Maybe you are interested in contributing to TrackerDB, an open source data set on online tracking?
TrackerDB: https://github.com/ghostery/trackerdb
If you want to help us improve the data quality, but are not sure where to start, you can look for good starting topic here:
https://github.com/ghostery/trackerdb/issues?q=is%3Aissue+is%3Aopen+label%3A%22good+first+issue%22
TrackerDB is published under Creative Common license (free for all non-commercial projects). It is currently used it in WhoTracks.me and in the Ghostery Browser extension.