Mastodawn

Federico Viticci

Jun 19, 2024

We were not crazy. We were right.

Amazing work by our @robb corroborated by extensive analysis at Wired:

Perplexity Is a Bullshit Machine https://www.wired.com/story/perplexity-is-a-bullshit-machine/

Perplexity Is a Bullshit Machine

A WIRED investigation shows that the AI-powered search startup Forbes has accused of stealing its content is surreptitiously scraping—and making things up out of thin air.

WIRED

Show thread

Federico Viticci

Jun 19, 2024

Regulation in this space cannot come soon enough.

AI companies that want to scrape the web for training purposes, or use their bots to summarize webpages, should follow a strict set of guidelines with identifiable user-agents and IP addresses.

Publishers should have a right to opt out of any AI access, request details as to whether their copyrighted content is included in any model, and if so, request that its gets removed and the model re-trained.

Hopefully the EU's AI Act will help.

Show thread

Stuart McHattie 👨🏼‍💻

@viticci I completely agree with all but the last bit. I get quite frustrated hearing that content is being slurped by companies to generate a profit and share none of that with the contributors that made it possible. However, forcing a retraining is going to amplify the carbon release of AI yet further. I want to see a solution that doesn’t, quite literally, cost the earth.