🎉 scrapy-docs-l10n is published!
🚀 Preview:
https://projects.localizethedocs.org/scrapy-docs-l10n
🌐 Crowdin:
https://localizethedocs.crowdin.com/scrapy-docs-l10n
🐙 GitHub:
https://github.com/localizethedocs/scrapy-docs-l10n
#Crowdin #GitHub #Sphinx #Python #Scrapy #WebCrawling #WebScraping
Smart TVs are now running Bright SDK to silently crawl the web for AI training, using residential proxies to bypass Google policy. The move sparks a compliance backlash and raises privacy concerns for home devices. How will regulators respond, and what does this mean for open‑source AI? Dive into the details. #SmartTV #BrightSDK #WebCrawling #DeviceCompliance
🔗 https://aidailypost.com/news/smart-tvs-using-bright-sdk-crawl-web-ai-amid-compliance-backlash
📡 NOT4BFLU55 ist jetzt maschinenlesbar live.
Bildtafeln mit Transkription + Kontext → offen archiviert: https://git.not4bflu55.de/
Sitemap für Crawler: https://git.not4bflu55.de/sitemap.xml
Warum?
Weil Text im Bild ohne Text im Web für Maschinen Rauschen ist.
Hier liegt der Sinn wieder als Text neben dem Bild.
#NOT4BFLU55 #Infologie #Filterblase #OpenArchive #WebCrawling #GitHubPages #Sitemap #Fediverse #Maschinenfutter
wxpath – Declarative web crawling in XPath
https://github.com/rodricios/wxpath
#HackerNews #wxpath #webcrawling #XPath #technology #open-source #GitHub