Starting to see (and getting a bit excited about) some components of openwebsearch.eu, and I was wondering if the EU will finally get its own Common Crawl, like dataset (commoncrawl.org).
It seems the crawling results aren't publicly accessible yet, and there's already some discussion about GDPR implications.
At this pace, we're still far from being able to compete with US-scale open data efforts 🤦♂️
#europe #commoncrawl #openwebsearch
🔗 https://pipeline.shared-search.eu/
🔗 https://pipeline.shared-search.eu/explain/license.html