New Jersey Globe: Nearly 400 local newspapers sue OpenAI, Microsoft over alleged copyright theft. “The massive coalition of local newspaper publishers filed a federal lawsuit today against OpenAI and Microsoft, alleging the technology companies systematically copied copyrighted reporting from nearly 400 local newspapers to train and develop commercial artificial intelligence products, including […]

https://rbfirehose.com/2026/06/25/new-jersey-globe-nearly-400-local-newspapers-sue-openai-microsoft-over-alleged-copyright-theft/
New Jersey Globe: Nearly 400 local newspapers sue OpenAI, Microsoft over alleged copyright theft

New Jersey Globe: Nearly 400 local newspapers sue OpenAI, Microsoft over alleged copyright theft. “The massive coalition of local newspaper publishers filed a federal lawsuit today against Op…

ResearchBuzz: Firehose

Engadget: Meta is ‘pausing’ employee tracking program after it let the whole company see sensitive data. “Meta has paused use of an AI training program that tracks its own employees’ keystrokes and mouse movements. The company has suspended the Model Capability Initiative, not because of workers’ understandable displeasure around being (almost) perpetually monitored or for potentially breaking […]

https://rbfirehose.com/2026/06/25/engadget-meta-is-pausing-employee-tracking-program-after-it-let-the-whole-company-see-sensitive-data/
Engadget: Meta is ‘pausing’ employee tracking program after it let the whole company see sensitive data

Engadget: Meta is ‘pausing’ employee tracking program after it let the whole company see sensitive data. “Meta has paused use of an AI training program that tracks its own employe…

ResearchBuzz: Firehose

9to5 Google: New Google Search setting saves images and audio you upload; how to turn it off. “Google says it’s saving files and media uploaded during searches in user search history to train AI and improve the experience after announcing the change last month. Search history will include pictures, screenshots taken with Circle to Search, and audio files used in voice searches, but you can […]

https://rbfirehose.com/2026/06/23/new-google-search-setting-saves-images-and-audio-you-upload-how-to-turn-it-off-9to5-google/
New Google Search setting saves images and audio you upload; how to turn it off (9to5 Google)

9to5 Google: New Google Search setting saves images and audio you upload; how to turn it off. “Google says it’s saving files and media uploaded during searches in user search history to train…

ResearchBuzz: Firehose

The Verge: The Atlantic created a searchable database of the music used to train AI. “Atlantic reporter Alex Reisner recently uncovered four datasets of music being used to train AI models and made them fully searchable for the public. Two of the sets are absolutely enormous at 12 million and 9 million tracks. The other two are much smaller, but still represent a significant amount of training […]

https://rbfirehose.com/2026/06/22/the-verge-the-atlantic-created-a-searchable-database-of-the-music-used-to-train-ai/
The Verge: The Atlantic created a searchable database of the music used to train AI

The Verge: The Atlantic created a searchable database of the music used to train AI. “Atlantic reporter Alex Reisner recently uncovered four datasets of music being used to train AI models an…

ResearchBuzz: Firehose

The Decoder: Website “In the Weights” shows whether AI models know who you are. “Those ‘weights’ are billions of numerical values where AI models encode their knowledge. If you show up in them, the model considered you relevant enough during training to recall without tools like web search. The site queries several models to figure out who a specific person is, combines the results, and assigns […]

https://rbfirehose.com/2026/06/19/the-decoder-website-in-the-weights-shows-whether-ai-models-know-who-you-are/
The Decoder: Website “In the Weights” shows whether AI models know who you are

The Decoder: Website “In the Weights” shows whether AI models know who you are. “Those ‘weights’ are billions of numerical values where AI models encode their knowledg…

ResearchBuzz: Firehose

TorrentFreak: Major Publishers Sue ‘WeLib’, a Pirate Site Built on Anna’s Archive Code. “Less than a month after a New York court issued a default judgment against shadow library Anna’s Archive, thirteen major publishers have sued WeLib. The publishers characterize WeLib as a young but popular pirate site that was largely copied from Anna’s Archive. The site is allegedly used by tech […]

https://rbfirehose.com/2026/06/19/torrentfreak-major-publishers-sue-welib-a-pirate-site-built-on-annas-archive-code/

Engadget: Investigation by The Atlantic reveals many millions of songs used for AI music training . “We’re always glad to see more publications and groups digging deeper into artificial intelligence and its impact. Today, The Atlantic has published four searchable databases of music that has been used to train AI models. The scope is pretty staggering, with 12 million tracks in one database, 9 […]

https://rbfirehose.com/2026/06/18/engadget-investigation-by-the-atlantic-reveals-many-millions-of-songs-used-for-ai-music-training/
Engadget: Investigation by The Atlantic reveals many millions of songs used for AI music training

Engadget: Investigation by The Atlantic reveals many millions of songs used for AI music training . “We’re always glad to see more publications and groups digging deeper into artificial…

ResearchBuzz: Firehose

TechSpot: Spammers are flooding Reddit with fake posts designed to show up in AI search results. “Moderators of the /biohackers subreddit say they are dealing with spam that isn’t just about pushing sales, but about shaping how AI systems answer questions. They say companies are seeding discussions with posts intended to appear in AI-generated answers, effectively turning the subreddit into a […]

https://rbfirehose.com/2026/06/05/techspot-spammers-are-flooding-reddit-with-fake-posts-designed-to-show-up-in-ai-search-results/
TechSpot: Spammers are flooding Reddit with fake posts designed to show up in AI search results

TechSpot: Spammers are flooding Reddit with fake posts designed to show up in AI search results. “Moderators of the /biohackers subreddit say they are dealing with spam that isn’t just …

ResearchBuzz: Firehose

Northeastern University: This researcher put AI in the big game. It did not play well. “Northeastern University researcher Lorenzo Torresani wanted to test whether AI can help a group facing a challenge, and he found an interesting dataset with which to evaluate various popular AI models: sports footage. The result? Let’s just say the AI models were no slam dunk.”

https://rbfirehose.com/2026/06/02/northeastern-university-this-researcher-put-ai-in-the-big-game-it-did-not-play-well/
Northeastern University: This researcher put AI in the big game. It did not play well

Northeastern University: This researcher put AI in the big game. It did not play well. “Northeastern University researcher Lorenzo Torresani wanted to test whether AI can help a group facing …

ResearchBuzz: Firehose

Reuters: Why Tesla’s AI trainers don’t trust its self-driving tech – or its safety stats. “Tesla says its Full Self-Driving software is up to 10 times safer than human drivers. But the figures the company uses to support its claims don’t withstand scrutiny – and staffers who trained the technology say it isn’t close to safely delivering autonomous vehicles at scale.”

https://rbfirehose.com/2026/05/28/reuters-why-teslas-ai-trainers-dont-trust-its-self-driving-tech-or-its-safety-stats/
Reuters: Why Tesla’s AI trainers don’t trust its self-driving tech – or its safety stats

Reuters: Why Tesla’s AI trainers don’t trust its self-driving tech – or its safety stats. “Tesla says its Full Self-Driving software is up to 10 times safer than human drivers. But the figure…

ResearchBuzz: Firehose