Mastodawn

Reed Mideke Jun 23, 2023

Also, not great plan to lie about being on vacation when responding to a show cause order "Mr. LoDuca’s statement was false and he knew it to be false at the time he made the statement. Under questioning by the Court at the sanctions hearing, Mr. LoDuca admitted that he was not out of the office on vacation"

Show thread

Reed Mideke Jun 23, 2023

Mr. Schwartz fares no better
"Mr. Schwartz’s statement in his May 25 affidavit that ChatGPT “supplemented” his research was a misleading attempt to mitigate his actions by creating the false impression that he had done other, meaningful research on the issue and did not rely exclusive on an AI chatbot, when, in truth and in fact, it was the only source"

Show thread

Reed Mideke Jun 24, 2023

More on clickworkers allegedly using #AI to automate their AI training tasks. Back door Habsburg AI https://www.technologyreview.com/2023/06/22/1075405/the-people-paid-to-train-ai-are-outsourcing-their-work-to-ai/

The people paid to train AI are outsourcing their work… to AI

It’s a practice that could introduce further errors into already error-prone models.

MIT Technology Review

Show thread

Reed Mideke Jun 28, 2023

James Vincent with a good look at the real #AI threat: Enshitification
https://www.theverge.com/2023/6/26/23773914/ai-large-language-models-data-scraping-generation-remaking-web

AI is killing the old web, and the new web struggles to be born

AI language models and chatbots show that AI can generate content cheaply but at a lower quality. These characteristics mean AI will remake the web as we know it — from Google Search to Wikipedia and more.

The Verge

Show thread

Reed Mideke Jun 30, 2023

Janelle Shane on the recent #AI detector paper, with succinct advice "Don't use AI detectors for anything important"

https://www.aiweirdness.com/dont-use-ai-detectors-for-anything-important/

Don't use AI detectors for anything important

I've noted before that because AI detectors produce false positives, it's unethical to use them to detect cheating. Now there's a new study that shows it's even worse. Not only do AI detectors falsely flag human-written text as AI-written, the way in which they do it is biased. This is

AI Weirdness

Show thread

Reed Mideke Jul 4, 2023

This. This is what people reporting on #AI / #LLM hype need to understand
https://mastodon.social/@amydentata@tech.lgbt/110651829564300496

Show thread

Reed Mideke Jul 4, 2023

Above could also have helped @[email protected] avoid the whole #AI explain train wreck, which thankfully seems to have been rolled back https://github.com/mdn/yari/issues/9208#issuecomment-1615411943

MDN can now automatically lie to people seeking technical information · Issue #9208 · mdn/yari

Summary MDN's new "ai explain" button on code blocks generates human-like text that may be correct by happenstance, or may contain convincing falsehoods. this is a strange decision for a technical ...

GitHub

Show thread

Reed Mideke Jul 6, 2023

Owners of Gizmodo jump on the #AI #enshitification bandwagon, with predictable results https://variety.com/2023/digital/news/io9-ai-generated-star-wars-article-errors-1235662194/

io9 Published an AI-Generated Star Wars Article Filled With Errors

A new byline showed up Wednesday on the site of io9, the genre-entertainment section of Gizmodo tech website: “Gizmodo Bot.” And the site’s editorial staff appears to have not had…

Variety

Show thread

Reed Mideke Jul 6, 2023

Additional comment from io9 deputy editor James Whitbrook "that's the formal part, here's my own personal comment: lmao, it's fucking dogshit"
https://twitter.com/Jwhitbrook/status/1676704102754004996

James Whitbrook on Twitter

“that's the formal part, here's my own personal comment: lmao, it's fucking dogshit”

Twitter

Show thread

Reed Mideke Jul 7, 2023

Oh FFS @[email protected] @[email protected] "readers also pointed out a handful of concrete cases where an incorrect answer was rendered. This feedback is enormously helpful, and the MDN team is now investigating these bug reports"

They aren't "bugs" - #LLMs by definition just put together plausible sounding words with no regard to correctness. Pointing out individual errors demonstrates this, but does not provide any mechanism by which it might be "fixed" in the general case

https://blog.mozilla.org/en/products/mdn/responsibly-empowering-developers-with-ai-on-mdn/

Responsibly empowering developers with AI on MDN | The Mozilla Blog

Generative AI technologies powered by Large Language Models (LLMs), such as OpenAI’s ChatGPT, have shown themselves to be both a big boon to productivity

Show thread

Reed Mideke Jul 7, 2023

The post also notes that many users were happy with the answers, ignoring that the target audience of people who *came to MDN looking for help with something they didn't already know* may not immediately recognize that the answer is subtly wrong, or just plausible looking #AI gibberish

Show thread

Reed Mideke Jul 7, 2023

It also says "even extraordinarily well-trained LLMs — like humans — will sometimes be wrong"

which is true as far as it goes, but here's the thing: They are not *wrong like humans* … yes, you'll find some overconfident bullshitters on stack overflow, but generally humans in these contexts have some awareness of the limits of their knowledge and don't drift seamlessly between accurate explanation and complete BS

Show thread

Reed Mideke Jul 7, 2023

@[email protected] post also makes no mention of the apparent lack of communication with the rest of the MDN team https://github.com/mdn/yari/issues/9208#issuecomment-1615411943

MDN can now automatically lie to people seeking technical information · Issue #9208 · mdn/yari

GitHub

Show thread

Reed Mideke Jul 7, 2023

Anyway, there's a new bug, so if you have thoughts on #MDN adding #AI stochastic bullshit to what has, up to now, been the premier technical reference for web developers, you could make them heard there https://github.com/mdn/yari/issues/9230

The AI help button is very good but it links to a feature that should not exist · Issue #9230 · mdn/yari

Summary I made a previous issue pointing out that the AI Help feature lies to people and should not exist because of potential harm to novices. This was renamed by @caugner to "AI Help is linked on...

GitHub

Show thread

Reed Mideke Jul 8, 2023

No, you shouldn't use <s>#AI</s> spicy autocomplete to evaluate grant proposals 😬 https://www.theguardian.com/technology/2023/jul/08/australian-research-council-scrutiny-allegations-chatgpt-artifical-intelligence

Are Australian Research Council reports being written by ChatGPT?

Multiple accounts from researchers suggest that feedback for Discovery Project grant funding was written by artificial intelligence

The Guardian

Show thread

Reed Mideke Jul 8, 2023

More on the #Gizmodo #AI debacle: After publishing error-ridden #LLM garbage which their own editorial team called "fucking dogshit" 'a G/O Media spokesman, said the company would be “derelict” if it did not experiment with AI. “We think the AI trial has been successful,”'

(free link)

https://wapo.st/43iHlmP

How an AI-written Star Wars story created chaos at Gizmodo

A Gizmodo story on Star Wars, generated by artificial intelligence, was riddled with errors. The irony that the problem happened at a tech publication was undeniable.

The Washington Post

Show thread

Reed Mideke Jul 12, 2023

#AI is going great
(caveat I don't know the source and thought it might be a joke, but the rest of their timeline looks real, and Janelle Shane retweeted it)
https://twitter.com/guntrip/status/1640694869785030657

Steve Guntrip on Twitter

“Digistore EU have promoted an AI to their website's chat function. It's not working particuarly well. Follow my attempts to get a tracking number that result in a milkshake recipe and a rude poem. (1/2)”

Twitter

Show thread

Reed Mideke Jul 12, 2023

Not gonna screenshot the thread of screenshots here, but it's archived if you don't want to visit the bird site https://web.archive.org/web/20230329232724/https://twitter.com/guntrip/status/1640694869785030657

Steve Guntrip on Twitter

Twitter

Show thread

Reed Mideke Jul 14, 2023

New religion dropped. From this great @arstechnica article on #AI detectors https://arstechnica.com/information-technology/2023/07/why-ai-detectors-think-the-us-constitution-was-written-by-ai/

Why AI writing detectors don’t work

Can AI writing detectors be trusted? We dig into the theory behind them.

Ars Technica

Show thread

Reed Mideke Jul 29, 2023

A thing that occurs to me about that last boost from @zoe (https://mastodon.social/@[email protected]imeprincess.net/110797643482092764): #AI scrapers refusing to play nice with ROBOTS.TXT is that it encourages adversarial approaches…

Show thread

Reed Mideke Jul 29, 2023

People building models will be keen to exclude AI generated content from the training set. So, would interspersing stuff that scores high as AI-generated (whether it actually is or not) cause entire pages to be excluded? You could separate it from the real content in ways that humans would understand. OTOH, if you care about SEO it'd be pretty risky

Show thread

Reed Mideke Jul 29, 2023

There's also been talk about standards to identify AI generated content, leading to hilarious option of falsely identifying your real content as AI generated to stop people from training AI on it

Show thread

Reed Mideke Jul 29, 2023

Folks have suggested CSS based approaches to poison models (like white text on white background) but there's a significant risk of breaking accessibility. Also risk of search engines thinking it looks spammy again

Show thread

Reed Mideke Jul 29, 2023

A general problem with poisoning like this is that any technique which becomes really widespread will likely be noticed and filtered out. OTOH, if the goal is to not have your content used, that may be OK!

Show thread

Reed Mideke Aug 6, 2023

Good to see mainstream press finally touching the question of whether #LLM #AI BSing is fixable or an inherent property of the tech, even if it gets a bit of he said, she said treatment.

Also uh "Those errors are not a huge problem for the marketing firms turning to Jasper AI for help writing pitches…" marketing doesn't care if their pitches are BS? KNOCK ME OVER WITH A FEATHER

https://fortune.com/2023/08/01/can-ai-chatgpt-hallucinations-be-fixed-experts-doubt-altman-openai/

Tech experts are starting to doubt that ChatGPT and A.I. ‘hallucinations’ will ever go away: ‘This isn’t fixable’

Experts are starting to doubt it, and even OpenAI CEO Sam Altman is a bit stumped.

Fortune

Show thread

Reed Mideke Aug 7, 2023

So @[email protected] points out (https://mastodon.social/@Toke@helvede.net/110848880977610283) that #OpenAI does claim to have unique user agent and honor robots.txt when scraping text for #ChatGPT #AI training. Not clear whether this is the only or even primary way publicly accessible web content gets into their training set though https://platform.openai.com/docs/gptbot

OpenAI Platform

Explore developer resources, tutorials, API docs, and dynamic examples to get the most out of OpenAI's platform.

Show thread

Reed Mideke Aug 7, 2023

Just hypothetically speaking, many web platforms could easily be configured to serve specially tailored content based on the user agent, but that would be mean and wrong and potentially waste resources of VC backed billionaires freeloading off the public web to build their BS machines so definitely don't do that 😉

Show thread

Reed Mideke Aug 7, 2023

Complete gibberish will likely get weeded out. Common knowledge will tend to be overwhelmed by other sources. So the sweet spot for influence would seem to be obscure topics, or unique tokens that only appear in your content (though to what end isn't obvious).

Bring on the SolidGoldMagikarp https://www.lesswrong.com/posts/Ya9LzwEbfaAMY8ABo/solidgoldmagikarp-ii-technical-details-and-more-recent

SolidGoldMagikarp II: technical details and more recent findings — LessWrong

tl;dr: This is a follow-up to our original post on prompt generation and the anomalous token phenomenon which emerged from that research. Work done b…

Show thread

Reed Mideke Aug 7, 2023

Of course, these things don't just scrape human readable text, many of them do code too. Serving up a special vulnerable version of your input sanitization code when you see GPTBot is left as an exercise to the reader

Show thread

Reed Mideke Aug 17, 2023

"It's highly unlikely that ChatGPT's training data includes the entire text of each book under question, though the data may include references to discussions about the book's content—if the book is famous enough"
Highlights a pernicious problem with ChatGPT style #LLM #AI: It's far more likely to give reasonable answers on well-known subjects. If you spot check with say, Dickens and Hunter S. Thomson, you might think it was pretty good at spotting naughty books

https://arstechnica.com/information-technology/2023/08/an-iowa-school-district-is-using-chatgpt-to-decide-which-books-to-ban/

An Iowa school district is using ChatGPT to decide which books to ban

Official: "It is simply not feasible to read every book" for depictions of sex.

Ars Technica

Show thread

Reed Mideke Aug 17, 2023

But for more obscure ones, it's probably no better than a coin toss. Being relatively good at stuff "everyone knows" gives people false confidence that it's also good at stuff they don't know

(we should also note that even if the entire text of the books were in the training set, that wouldn't mean it would provide accurate answers about the content!)

Show thread

Reed Mideke Aug 29, 2023

Cool, cool, #Amazon #AI book spammers have expanded from travel guides to mushroom foraging, what could possibly go wrong?
https://www.404media.co/ai-generated-mushroom-foraging-books-amazon/

‘Life or Death:’ AI-Generated Mushroom Foraging Books Are All Over Amazon

Experts are worried that books produced by ChatGPT for sale on Amazon, which target beginner foragers, could end up killing someone.

404 Media

Show thread

Reed Mideke Aug 29, 2023

Scale and the way they've structured things to profit off resellers insulates them quite a bit, but at some point it seems like this kind of is going to cut into Amazon's bottom line or open up opportunities for competition

Show thread

Reed Mideke Aug 30, 2023

The Verge reports copyright office will solicit comments on #AI starting tomorrow https://www.theverge.com/2023/8/29/23851126/us-copyright-office-ai-public-comments

US Copyright Office wants to hear what people think about AI and copyright

The agency is open to receiving comments around copyright and AI until October. It may use the comments to create new rules.

The Verge

Show thread

Reed Mideke Sep 6, 2023

G/O Media management continue their #AI enshitification of #Gizmodo, laying off staff of Spanish language site and switching to "AI" translation of English content

They know people who want shitty machine translations of the English content can already get that with Chrome or google translate, right?

https://arstechnica.com/information-technology/2023/09/ai-took-my-job-literally-gizmodo-fires-spanish-staff-amid-switch-to-ai-translator/

“AI took my job, literally”—Gizmodo fires Spanish staff amid switch to AI translator

Meanwhile, readers say that some AI-penned articles switch languages halfway through.

Ars Technica

Show thread

Reed Mideke Sep 6, 2023

Type II #AI (https://twitter.com/reedmideke/status/1137496639856189440) spotted in the wild "One of the sources said workers at one point produced the 3D design wholecloth themselves without the help of machine learning at all"

https://www.404media.co/kaedim-ai-startup-2d-to-3d-used-cheap-human-labor/

Reed Mideke on X

This is Type I AI. Type II AI is three mechanical turk workers in a trench coat https://t.co/9tW2zIMPVf

X (formerly Twitter)

Show thread

Reed Mideke Sep 6, 2023

Spicy autocomplete dishing out tax advice? I for one cannot imagine any way this could possibly go wrong https://arstechnica.com/information-technology/2023/09/talk-to-your-taxes-turbotaxs-new-ai-agent-makes-it-possible/

TurboTax-maker Intuit offers an AI agent that provides financial tips

AI-generated financial assistance also arrives in Credit Karma, QuickBooks, and Mailchimp.

Ars Technica

Show thread

Reed Mideke Sep 10, 2023

#OpenAI, on their flagship product "Additionally, ChatGPT has no 'knowledge' of what content could be AI-generated. It will sometimes make up responses to questions like 'did you write this [essay]?' or 'could this have been written by AI?' These responses are random and have no basis in fact."

Nominally this refers only to using #ChatGPT as an #AI detector. Extrapolating to other topics is left as an exercise to the reader ¯\_(ツ)_/¯

https://arstechnica.com/information-technology/2023/09/openai-admits-that-ai-writing-detectors-dont-work/

OpenAI confirms that AI writing detectors don’t work

No detectors “reliably distinguish between AI-generated and human-generated content.”…

Ars Technica

Show thread

Reed Mideke Sep 11, 2023

Also #OpenAI's suggestion for dealing with the lack of reliable #AI bullshit detectors is of course… make using their AI bullshit generator part of the assignment: "One technique some teachers have found useful is encouraging students to share specific conversations from ChatGPT" https://help.openai.com/en/articles/8313351-how-can-educators-respond-to-students-presenting-ai-generated-content-as-their-own

How can educators respond to students presenting AI-generated content as their own? | OpenAI Help Center

Show thread

Reed Mideke Sep 24, 2023

Another great illustration of how #LLM #AI are BS machines, from @janellecshane: If you ask them to explain a meme that doesn't exist, they'll happily oblige by making something up https://www.aiweirdness.com/trolling-chatbots-with-made-up-memes/

Trolling chatbots with made-up memes

ChatGPT, Bard, GPT-4, and the like are often pitched as ways to retrieve information. The problem is they'll "retrieve" whatever you ask for, whether or not it exists. Tumblr user @indigofoxpaws sent me a few screenshots where they'd asked ChatGPT for an explanation of the nonexistent "Linoleum harvest" Tumblr meme,

AI Weirdness

The people paid to train AI are outsourcing their work… to AI

AI is killing the old web, and the new web struggles to be born

Don't use AI detectors for anything important

MDN can now automatically lie to people seeking technical information · Issue #9208 · mdn/yari

io9 Published an AI-Generated Star Wars Article Filled With Errors

James Whitbrook on Twitter

Responsibly empowering developers with AI on MDN | The Mozilla Blog

MDN can now automatically lie to people seeking technical information · Issue #9208 · mdn/yari

The AI help button is very good but it links to a feature that should not exist · Issue #9230 · mdn/yari

Are Australian Research Council reports being written by ChatGPT?

How an AI-written Star Wars story created chaos at Gizmodo

Steve Guntrip on Twitter

Steve Guntrip on Twitter

Why AI writing detectors don’t work

Tech experts are starting to doubt that ChatGPT and A.I. ‘hallucinations’ will ever go away: ‘This isn’t fixable’

OpenAI Platform

SolidGoldMagikarp II: technical details and more recent findings — LessWrong

An Iowa school district is using ChatGPT to decide which books to ban

‘Life or Death:’ AI-Generated Mushroom Foraging Books Are All Over Amazon

US Copyright Office wants to hear what people think about AI and copyright

“AI took my job, literally”—Gizmodo fires Spanish staff amid switch to AI translator

Reed Mideke on X

TurboTax-maker Intuit offers an AI agent that provides financial tips

OpenAI confirms that AI writing detectors don’t work

How can educators respond to students presenting AI-generated content as their own? | OpenAI Help Center

Trolling chatbots with made-up memes

Registration Form ‹ Berryville Institute of Machine Learning — WordPress