This repository contains datasets and the extended version of the following paper
Do Opt-Outs Really Opt Me Out? [Download]
Duc Bui, Brian Tang, and Kang G. Shin
In Proceedings of the 2022 ACM SIGSAC Conference on Computer and Communications Security (CCS ’22), November 7–11, 2022, Los Angeles, CA, USA. ACM, New York, NY, USA, 19 pages.
-
Folder
optout_policy_datasetcontains datasets used to train and test opt-out policy classifiers.-
File
multi_label_export.csvcontains labeled sentences extracted from the privacy policies of online trackers. The file has the following columns:-
cookie_domain: contains domains of the online trackers which stated the policies. -
stmt: contains policy statements extracted from the privacy policies of the trackers. -
No-data-collection: contains value 1 if the statement is classified asNo-data-collectionopt-out policy class, or value 0 if otherwise. A sentence of this class means that opting out will stop data collection via third-party cookies. -
No-tracking: contains value 1 if the statement is classified asNo-trackingopt-out policy class, or value 0 if otherwise. A sentence of this class means that opting out will stop the tracking of users' activities via third-party cookies. -
No-data-coll.-for-oba: contains value 1 if the statement is classified asNo-data-coll.-for-obaopt-out policy class, or value 0 if otherwise. A sentence of this class means that opting out will stop data collection of users' activities for targeted advertising. -
No-display-OBA: contains value 1 if the statement is classified asNo-display-OBAopt-out policy class, or value 0 if otherwise. A sentence of this class means that opting out will stop the display of targeted advertising to users. -
other: contains value 1 if the statement is not classified as any of the above opt-out policy classes, or value 0 if otherwise.
-
-
Folder
train_datahas data files that are more convenient for training the opt-out policy classifiers. These files contain rows extracted from themulti_label_export.csvfile.- File
No-tracking.csvcontains thestmtandNo-trackingcolumns frommulti_label_export.csv. - File
No-data-collection.csvcontains thestmtandNo-data-collectioncolumns frommulti_label_export.csv.
- File
-
- Folder
optout_cookie_datasetcontains files used to train (train.csv) and test (test.csv) the opt-out cookie classifier. Each of the files contains the following columns:name: contains cookie names.value: contains cookie values.is_optout_cookie: contains value 1 if the cookie is an opt-out cookie, or value 0 if otherwise.