Skip to content

[SSTFI'25 HM] TrackCrawler is an optimized crawler through dynamic interactions + preference counting to understand changes in tracking in modern web applications from GDPR.

License

Notifications You must be signed in to change notification settings

mzen17/TrackCrawl

Repository files navigation

TrackCrawler

TrackCrawler is an optimized crawler through dynamic interactions + preference counting to understand changes in tracking in modern web applications from GDPR. This serves as the codebase for TrackCrawler.

image

Author: Mike Zeng

Previous Work: Understanding the Effect of GDPR-like Policies for Cookies in Web Applications

Note: This project is a reference for TrackCrawler. TrackCrawler is a complete rework of the old scrapper including a significantly wider scope for research and a different research purpose (understanding tracking at a wider scale vs just cookies).

A web scrapper for understanding cookies. This is the codebase for our paper "Understanding the Effect of GDPR-like Policies for Cookies in Web Applications".

Authors: Mike Zeng, Frank Forliano

Abstract

Majority of websites use cookies, as a direct consequence of needing data. In the past, they have caused many violations to privacy laws like GDPR or CCPA. The EU has made many lawsuits to companies for breaking GDPR laws, and by using a web scraper, we found out that these policies and enforcements have been working sufficiently well at reducing cookie usage in web applications, and causing the overall amount of cookies that load initially to decrease as compared to places that do not have such GDPR-like policies.

Quick Results

https://github.com/mzen17/cookiemonster/blob/main/CookieGraphData.png

The full length paper can be found in the repository.

About

[SSTFI'25 HM] TrackCrawler is an optimized crawler through dynamic interactions + preference counting to understand changes in tracking in modern web applications from GDPR.

Resources

License

Stars

Watchers

Forks