One Open-source Project Daily

Fast and simple video download library and CLI tool written in Go

https://github.com/iawia002/lux

#1ospd #opensource #bilibili #crawler #download #downloader #go #golang #iqiyi #qq #scraper #tumblr #video #youku #youtube
GitHub - iawia002/lux: 👾 Fast and simple video download library and CLI tool written in Go

👾 Fast and simple video download library and CLI tool written in Go - iawia002/lux

GitHub

Added https://github.com/laylavish/uBlockOrigin-HUGE-AI-Blocklist to PriEco #crawler

PriEco will no longer create results out of clearly #AI slop #websites

Our fight against AI #slop doesn't end here, and we are figuring out better ways to handle them
#crawler #AI #websites #slop

GitHub - laylavish/uBlockOrigin-HUGE-AI-Blocklist: A huge blocklist of manually curated sites that contain AI generated imagery for uBlock Origin & uBlacklist.

A huge blocklist of manually curated sites that contain AI generated imagery for uBlock Origin & uBlacklist. - laylavish/uBlockOrigin-HUGE-AI-Blocklist

GitHub

RE: https://rheinneckar.social/@admin/116554880838480005

Kann vielleicht auch für @milan von Interesse sein. #crawler #bots

Ooh OpenAi ist gerade auf einer meiner Seiten unterwegs und ich wundere mich, warum gerade so viel Traffic auf dem Server ist
#Crawler
Welcome to the future, where AI agents hunt down alleged online copyright infringement

As readers of this blog have doubtless noticed, the latest hot tech – and investment – area involves “agentic AI”, where AI systems are allowed to operative autonomously on allocated tasks. There’s no doubt there are some exciting possibilities here, as well as some troubling issues concerning lack of control. It’s a rapidly-evolving area of research and experimentation, which makes […]

#agenticAi #agents #ai #ceaseAndDesist #crawler #digitalWatermarks #infringement #licensing #llms #patents #pricing #takedowns #universalMusicGroup https://walledculture.org/welcome-to-the-future-where-ai-agents-hunt-down-alleged-online-copyright-infringement/

To all #webmasters who use their service: This may be a quick fix for your #crawler woes. But you're not going to like the future they usher in. Your own #descendents will ask you one day why you ceded the control of this wonderful public resource to the likes of CF.

[3/4]

If I'm visiting a site from a country that you don't expect me to be from, does that mean that I'm not a human being interested in the content? Your solution to the AI vacuum cleaner is to arbitrarily blanket ban the IP blocks we're in? Why are we denied the full benefits of the internet because of your incompetence and/or unwillingness to solve the #LLM #crawler issue technically?

[2/4]

Google-Ranking verstehen: Was hinter den Suchergebnissen steckt

Wie setzt sich das Google-Ranking zusammen? Google schaut sich den Inhalt der Webseiten genau an um zu bestimmen, wer oben stehen darf.

TARNKAPPE.INFO

Turned people into crawlers today :V

#crawler #fresnocrawler