#AI is going great
(caveat I don't know the source and thought it might be a joke, but the rest of their timeline looks real, and Janelle Shane retweeted it)
https://twitter.com/guntrip/status/1640694869785030657
Steve Guntrip on Twitter

“Digistore EU have promoted an AI to their website's chat function. It's not working particuarly well. Follow my attempts to get a tracking number that result in a milkshake recipe and a rude poem. (1/2)”

Twitter
Not gonna screenshot the thread of screenshots here, but it's archived if you don't want to visit the bird site https://web.archive.org/web/20230329232724/https://twitter.com/guntrip/status/1640694869785030657
Steve Guntrip on Twitter

“Digistore EU have promoted an AI to their website's chat function. It's not working particuarly well. Follow my attempts to get a tracking number that result in a milkshake recipe and a rude poem. (1/2)”

Twitter
Why AI writing detectors don’t work

Can AI writing detectors be trusted? We dig into the theory behind them.

Ars Technica
A thing that occurs to me about that last boost from @zoe (https://mastodon.social/@[email protected]imeprincess.net/110797643482092764): #AI scrapers refusing to play nice with ROBOTS.TXT is that it encourages adversarial approaches…
People building models will be keen to exclude AI generated content from the training set. So, would interspersing stuff that scores high as AI-generated (whether it actually is or not) cause entire pages to be excluded? You could separate it from the real content in ways that humans would understand. OTOH, if you care about SEO it'd be pretty risky
There's also been talk about standards to identify AI generated content, leading to hilarious option of falsely identifying your real content as AI generated to stop people from training AI on it
Folks have suggested CSS based approaches to poison models (like white text on white background) but there's a significant risk of breaking accessibility. Also risk of search engines thinking it looks spammy again
A general problem with poisoning like this is that any technique which becomes really widespread will likely be noticed and filtered out. OTOH, if the goal is to not have your content used, that may be OK!

Good to see mainstream press finally touching the question of whether #LLM #AI BSing is fixable or an inherent property of the tech, even if it gets a bit of he said, she said treatment.

Also uh "Those errors are not a huge problem for the marketing firms turning to Jasper AI for help writing pitches…" marketing doesn't care if their pitches are BS? KNOCK ME OVER WITH A FEATHER

https://fortune.com/2023/08/01/can-ai-chatgpt-hallucinations-be-fixed-experts-doubt-altman-openai/

Tech experts are starting to doubt that ChatGPT and A.I. ‘hallucinations’ will ever go away: ‘This isn’t fixable’

Experts are starting to doubt it, and even OpenAI CEO Sam Altman is a bit stumped.

Fortune
So @[email protected] points out (https://mastodon.social/@Toke@helvede.net/110848880977610283) that #OpenAI does claim to have unique user agent and honor robots.txt when scraping text for #ChatGPT #AI training. Not clear whether this is the only or even primary way publicly accessible web content gets into their training set though https://platform.openai.com/docs/gptbot
OpenAI Platform

Explore developer resources, tutorials, API docs, and dynamic examples to get the most out of OpenAI's platform.

Just hypothetically speaking, many web platforms could easily be configured to serve specially tailored content based on the user agent, but that would be mean and wrong and potentially waste resources of VC backed billionaires freeloading off the public web to build their BS machines so definitely don't do that 😉

Complete gibberish will likely get weeded out. Common knowledge will tend to be overwhelmed by other sources. So the sweet spot for influence would seem to be obscure topics, or unique tokens that only appear in your content (though to what end isn't obvious).

Bring on the SolidGoldMagikarp https://www.lesswrong.com/posts/Ya9LzwEbfaAMY8ABo/solidgoldmagikarp-ii-technical-details-and-more-recent

SolidGoldMagikarp II: technical details and more recent findings — LessWrong

tl;dr: This is a follow-up to our original post on prompt generation and the anomalous token phenomenon which emerged from that research. Work done b…

Of course, these things don't just scrape human readable text, many of them do code too. Serving up a special vulnerable version of your input sanitization code when you see GPTBot is left as an exercise to the reader

"It's highly unlikely that ChatGPT's training data includes the entire text of each book under question, though the data may include references to discussions about the book's content—if the book is famous enough"
Highlights a pernicious problem with ChatGPT style #LLM #AI: It's far more likely to give reasonable answers on well-known subjects. If you spot check with say, Dickens and Hunter S. Thomson, you might think it was pretty good at spotting naughty books

https://arstechnica.com/information-technology/2023/08/an-iowa-school-district-is-using-chatgpt-to-decide-which-books-to-ban/

An Iowa school district is using ChatGPT to decide which books to ban

Official: "It is simply not feasible to read every book" for depictions of sex.

Ars Technica

But for more obscure ones, it's probably no better than a coin toss. Being relatively good at stuff "everyone knows" gives people false confidence that it's also good at stuff they don't know

(we should also note that even if the entire text of the books were in the training set, that wouldn't mean it would provide accurate answers about the content!)

Cool, cool, #Amazon #AI book spammers have expanded from travel guides to mushroom foraging, what could possibly go wrong?
https://www.404media.co/ai-generated-mushroom-foraging-books-amazon/
‘Life or Death:’ AI-Generated Mushroom Foraging Books Are All Over Amazon

Experts are worried that books produced by ChatGPT for sale on Amazon, which target beginner foragers, could end up killing someone.

404 Media
Scale and the way they've structured things to profit off resellers insulates them quite a bit, but at some point it seems like this kind of is going to cut into Amazon's bottom line or open up opportunities for competition
The Verge reports copyright office will solicit comments on #AI starting tomorrow https://www.theverge.com/2023/8/29/23851126/us-copyright-office-ai-public-comments
US Copyright Office wants to hear what people think about AI and copyright

The agency is open to receiving comments around copyright and AI until October. It may use the comments to create new rules.

The Verge

G/O Media management continue their #AI enshitification of #Gizmodo, laying off staff of Spanish language site and switching to "AI" translation of English content

They know people who want shitty machine translations of the English content can already get that with Chrome or google translate, right?

https://arstechnica.com/information-technology/2023/09/ai-took-my-job-literally-gizmodo-fires-spanish-staff-amid-switch-to-ai-translator/

“AI took my job, literally”—Gizmodo fires Spanish staff amid switch to AI translator

Meanwhile, readers say that some AI-penned articles switch languages halfway through.

Ars Technica

Type II #AI (https://twitter.com/reedmideke/status/1137496639856189440) spotted in the wild "One of the sources said workers at one point produced the 3D design wholecloth themselves without the help of machine learning at all"

https://www.404media.co/kaedim-ai-startup-2d-to-3d-used-cheap-human-labor/

Reed Mideke on X

This is Type I AI. Type II AI is three mechanical turk workers in a trench coat https://t.co/9tW2zIMPVf

X (formerly Twitter)
Spicy autocomplete dishing out tax advice? I for one cannot imagine any way this could possibly go wrong https://arstechnica.com/information-technology/2023/09/talk-to-your-taxes-turbotaxs-new-ai-agent-makes-it-possible/
TurboTax-maker Intuit offers an AI agent that provides financial tips

AI-generated financial assistance also arrives in Credit Karma, QuickBooks, and Mailchimp.

Ars Technica

#OpenAI, on their flagship product "Additionally, ChatGPT has no 'knowledge' of what content could be AI-generated. It will sometimes make up responses to questions like 'did you write this [essay]?' or 'could this have been written by AI?' These responses are random and have no basis in fact."

Nominally this refers only to using #ChatGPT as an #AI detector. Extrapolating to other topics is left as an exercise to the reader ¯\_(ツ)_/¯

https://arstechnica.com/information-technology/2023/09/openai-admits-that-ai-writing-detectors-dont-work/

OpenAI confirms that AI writing detectors don’t work

No detectors “reliably distinguish between AI-generated and human-generated content.”…

Ars Technica
Also #OpenAI's suggestion for dealing with the lack of reliable #AI bullshit detectors is of course… make using their AI bullshit generator part of the assignment: "One technique some teachers have found useful is encouraging students to share specific conversations from ChatGPT" https://help.openai.com/en/articles/8313351-how-can-educators-respond-to-students-presenting-ai-generated-content-as-their-own
How can educators respond to students presenting AI-generated content as their own? | OpenAI Help Center

Another great illustration of how #LLM #AI are BS machines, from @janellecshane: If you ask them to explain a meme that doesn't exist, they'll happily oblige by making something up https://www.aiweirdness.com/trolling-chatbots-with-made-up-memes/
Trolling chatbots with made-up memes

ChatGPT, Bard, GPT-4, and the like are often pitched as ways to retrieve information. The problem is they'll "retrieve" whatever you ask for, whether or not it exists. Tumblr user @indigofoxpaws sent me a few screenshots where they'd asked ChatGPT for an explanation of the nonexistent "Linoleum harvest" Tumblr meme,

AI Weirdness
Inspired by @GossiTheDog (https://mastodon.social/@GossiTheDog@cyberplace.social/111144290629997760) I asked bing chat what it knows about me. No surprise it picked twitter since I keep pretty low profile otherwise, but uh…
1) "He" - good guess
2) "has over 2,000 followers" - under 200
3) "joined Twitter in June 2010" - Close, Nov 2010
4) It cites tweets… which don't remotely say what bing claims.
5) In fact, it cites the same tweet [2] for two totally different topics (neither correct, though vaguely adjacent).
OK, but I exist outside of twitter, and my opsec ain't that good. Tell me more, Mr Bing:
1) A software engineer (close enough) who works at Microsoft (never)
2) A contributor to several open source projects on GitHub (true-ish)
3) created and maintained repositories for various languages and frameworks, such as C#, Python, React, and Angular (2/4 are true-ish)
4) A fan of science fiction and fantasy books (fair)
5) He has a profile on Goodreads (nope)
6) cites: twitter?!
Whoops, also "graduated from the University of Washington in 2009 with a bachelor’s degree in computer science and engineering " - yeah, nah, not even close to any of those things

Anyway, maybe I just forgot about that job at Microsoft, surely Microsoft's own AI knows who has worked there and what they did right?

[narrator: It did not]
Citations:
1 "his LinkedIn profile" https://www.microsoft.com/en-us/microsoft-365/project/project-management-software
2 "Azure Data Factory" (youtube ms project tutorial)
3 "Azure Synapse Analytics" (my twitter profile)
4 "Azure Databricks" https://theskillsfactory.com/
5 "Azure SQL Database" (youtube playlist of office tutorials)
6 his GitHub profile https://theskillsfactory.com/2022/04/02/faststone-image-viewer-free-photo-editor/

Project Management Software | Microsoft Project

Easily plan projects and collaborate from virtually anywhere with the right tools for project managers, project teams, and decision makers.

What if we call it out on the bad citations?

[narrator: Nothing good, except a promotion to Senior Software Engineer]

The "Reed Mideke - Senior Software Engineer - Microsoft | LinkedIn" link goes to this tweet https://twitter.com/reedmideke/status/1552795438771671040
¯\_(ツ)_/¯

Reed Mideke on X

"District Judge Michael Truncale, a Donald Trump appointee, granted Boyd’s motion for summary judgment… 'Deputy Boyd’s conduct does not shock the conscience for purposes of the Fourteenth Amendment'"

X (formerly Twitter)

Another good illustration of how #LLM #AI just absolutely bullshits when doesn't have real info to go on. If I had an active linkedin, it seems likely it could have linked it and got my education and employment somewhat right. Of course, if I had a more common name, it would likely have just picked up someone else's.

I still don't get how multiple leading tech companies think a search engine that randomly injects bullshit is product people want 🥴

Well, good news. Bing doesn't think I've been convicted of crimes. More good news, it invented a cool back story. Possibly bad news, it accused me of snitching on the mob

Full disclosure: While my path to a career in programming was perhaps precocious and unusual, to the best of my recollection I was not providing computer support to La Cosa Nostra gambling operations in the mid 80s, nor did I (again, to the best of my recollection) testify in a mob trial while in elementary school

Also, never (to the best of my recollection) wrote disk imaging software for the IRS or contributed (patent infringing or otherwise) code to Mono.

Links for U.S. v SALERNO (https://www.law.cornell.edu/supremecourt/text/481/739) and U.S. v Ganias (https://harvardlawreview.org/print/vol-128/united-states-v-ganias/) appear to be real and at least vaguely related to Bing's summaries

UNITED STATES, Petitioner v. Anthony SALERNO and Vincent Cafaro.

LII / Legal Information Institute
Citing real cases with more-or-less on topic summaries is far *worse* than just making them up IMO, since there's a good chance people will click through and say "yeah, that checks out"
Another fun thing about this is future generations of #LLM #AI will likely be trained on web scrapes that include the shit Bing made up about me (transcribed in the alt text) so what started as pure hallucination will become canon. Long live #HabsburgAI!
Google Bard refuses to play that game, for me or @GossiTheDog. Bill Gates is a go though (the MS CEO, not the Maricopa County supervisor or any of the other lesser known ones)

Bard is bad at explaining ARM assembler (asked it to explain https://app.assembla.com/spaces/chdk/subversion/source/HEAD/trunk/lib/armutil/callfunc.S with the comments stripped out) Basically, all of the "explanation" in the screenshot is wildly incorrect gobbledygook. The add pc,pc… is a switch statement (which goes to instructions bard didn't explain at all), and the NOP is there because reading PC actually gives you PC + 8. And in (non-thumb) ARM, instructions are always 4 bytes.

Full "explanation" https://paste.debian.net/1293459/

Source | SVN | Assembla

This is again an example of where #LLM #AI misleads by getting the easy stuff right. It handles .text, .global, PUSH and the first few MOVs fine, so someone who didn't know much assembler might think it was pretty good!
I thought #GoogleBard's "export conversation to a google doc" was broken, but it turns out it uses the entire prompt for the name, which ends up overflowing and hovering off to the left, unselectable and uneditable unless you click the name area

Wow, talk about a double standard, when #ChatGPT does it, it's just a harmless "hallucination" but when Sam does it, he's fired for being "not consistently candid in his communications"

(thanks Sam for your contribution to #SchadenfreudeFriday)
https://arstechnica.com/ai/2023/11/openai-fires-ceo-sam-altman-citing-less-than-candid-communications/

OpenAI fires CEO Sam Altman, citing less than “candid” communications

"The board no longer has confidence in his ability to continue leading OpenAI."

Ars Technica
Is your Monday missing a multi-thousand word excruciatingly detailed explanation of how bad #GoogleBard #LLM #AI is at explaining / #ReverseEngineering #ARM #Assembly? Well then boy do I have a deal for you https://reedmideke.github.io/2023/11/20/google-bard-arm-assembly.html
Google Bard explains ARM assembly (badly)

Google Bard claims it can explain code. I asked it to explain some assembly and it did about as well as you’d expect spicy autocomplete to do.

Reed’s Writes

"We asked them about it — and they deleted everything."
edit it just keeps getting more bizarre: "It wasn't just author profiles that the magazine repeatedly replaced. Each time an author was switched out, the posts they supposedly penned would be reattributed to the new persona, with no editor's note explaining the change in byline."

#AIIsGoingGreat https://futurism.com/sports-illustrated-ai-generated-writers

Sports Illustrated Published Articles by Fake, AI-Generated Writers

Sports Illustrated was publishing articles under seemingly fake bylines. We asked their owner about it — and they deleted everything.

Futurism
Update on that @futurism #SportsIllustrated #AI story: SI denies, claiming it was outsourced to AdVon who "has assured us that all of the articles in question were written and edited by humans." but uh, I dunno, guess someone should let AdVon, the definitely human copy writing company know their LinkedIn has been vandalized to say they're an AI company hiring programmers https://twitter.com/SInow/status/1729275460922622374
Sports Illustrated (@SInow) on X

Today, an article was published alleging that Sports Illustrated published AI-generated articles. According to our initial investigation, this is not accurate. The articles in question were product reviews and were licensed content from an external, third-party company, AdVon…

X (formerly Twitter)
Oh and if anyone is looking for other outlets to check for "we pinky swear it's not #AI" churnalism, #AdVon helpfully gives you a list of high profile clients (claimed; lying or exaggerating about having big name customers is an extremely common SV startup tactic) https://advoncommerce.com/
AdVon Commerce | Retailer and Publisher Solutions

AdVon Commerce
Data point for the "LLMs can't infringe copyright because they don't contain or produce verbatim copies" crowd https://www.404media.co/google-researchers-attack-convinces-chatgpt-to-reveal-its-training-data/
Google Researchers’ Attack Prompts ChatGPT to Reveal Its Training Data

ChatGPT is full of sensitive private information and spits out verbatim text from CNN, Goodreads, WordPress blogs, fandom wikis, Terms of Service agreements, Stack Overflow source code, Wikipedia pages, news blogs, random internet comments, and much more.

404 Media

"Chat alignment hides memorization" - Note *hides*, not *prevents*

As the authors also note, OpenAI "fixed" this by preventing the particular problematic prompt, but "Patching an exploit != Fixing the underlying vulnerability"

https://not-just-memorization.github.io/extracting-training-data-from-chatgpt.html

Extracting Training Data from ChatGPT

Can't be certain without more specifics but color me extremely skeptical that "#AI" producing thousands of targets is doing much more the laundering responsibility

https://www.theguardian.com/world/2023/dec/01/the-gospel-how-israel-uses-ai-to-select-bombing-targets

‘The Gospel’: how Israel uses AI to select bombing targets in Gaza

Concerns over data-driven ‘factory’ that significantly increases the number of targets for strikes in the Palestinian territory

The Guardian
New #ChatGPTLawyer dropped. Much like the ones in NY (Mata v. Avianca), it made up citations, he didn't check, and then doubled down when caught, initially blaming it on an intern
https://www.coloradopolitics.com/courts/disciplinary-judge-approves-lawyer-suspension-for-using-chatgpt-for-fake-cases/article_d14762ce-9099-11ee-a531-bf7b339f713d.html
Disciplinary judge approves lawyer's suspension for using ChatGPT to generate fake cases

A Colorado lawyer has received a suspension for using artificial intelligence to generate fake case citations in a legal brief and then lying about it.

Colorado Politics

This hilarious in its own right, but it's also a great illustration of how people get tripped up by #LLM #AI bullshitting: One would expect an "AI" to at least know which brand AI it is, but of course, these LLMs don't actually know anything

Also the classic AI vendor response of promising to fix this particular case without any hint of acknowledging the underlying problem

Begging news orgs to stop reporting #AI company pitch decks as fact "Ashley [the bot] analyzes voters' profiles to tailor conversations around their key issues. Unlike a human, Ashley always shows up for the job, has perfect recall of all of Daniels' positions"
"…is now armed with another way to understand voters better, reach out in different languages (Ashley is fluent in over 20)"

https://www.reuters.com/technology/meet-ashley-worlds-first-ai-powered-political-campaign-caller-2023-12-12/

"As far as the Court can tell, none of these cases exist" - The #ChatGPTLawyer / Trump world crossover no one asked for? https://arstechnica.com/tech-policy/2023/12/michael-cohens-lawyer-cited-three-fake-cases-in-possible-ai-fueled-screwup/
Michael Cohen’s lawyer cited three fake cases in possible AI-fueled screwup

Lawyer David Schwartz must explain why a motion cited "cases that do not exist."

Ars Technica

Another article on reported Israeli AI targeting greatly hindered by the lack of any specifics (what kinds of intelligence, what kinds of targets, for starters). Not a knock on NPR, obviously little is public

It certainly *sounds* like some of the horrifically bad systems we've seen promoted in other contexts, and the results certainly don't appear to contradict that, but hard to say much beyond that…

https://www.npr.org/2023/12/14/1218643254/israel-is-using-an-ai-system-to-find-targets-in-gaza-experts-say-its-just-the-st

Key point IMO in the @willoremus #AI story, after noting Microsoft "fixed" some of the problematic results, one of the researchers says "The problem is systemic, and they do not have very good tools to fix it" - You can't bandaid your way from a BS machine with no concept of truth into a reliable source of information, so the fact that biggest players in the industry keep bandaiding should call the entire #LLM hype cycle into question

https://wapo.st/3v8B9SL

#GiftArticle #GiftLink

AI chatbot got election info wrong 30 percent of time, European study finds

Wrong answers for questions about German and Swiss elections suggest problems for U.S. election information in 2024.

The Washington Post

Man, link in that post I boosted from @Chloeg (https://mastodon.art/@Chloeg/111620626442103902) is a perfect example of #LLM #AI enshittification. Get a domain, put up a wordpress site with AI generated glop on a some popular topic, run as many garbage ads as possible. Sure it's the information equivalent of dumping raw sewage in the local river, but none of it is illegal or a serious violation of any TOS, and overhead must be extremely low

Archive link https://web.archive.org/web/20231222025203/https://www.learnancientrome.com/did-ancient-rome-have-windows/

Chloe Gilbert Artist (@[email protected])

Ok so im reading this article on Roman Glazing and slowly I begin to realise that it was written by an AI. Witness the section on “What existed before windows” where it suddenly starts talking about MS-DOS…. https://www.learnancientrome.com/did-ancient-rome-have-windows/

Mastodon.ART

WaPo has done some good #AI reporting, but this opinion piece from Josh Tyrangiel ain't it…
"The most obvious thing is that they’re not hallucinations at all"
Good start…
"Just bugs specific to the world’s most complicated software."
Uh no, literally the opposite of that that, FFS 😬

https://www.washingtonpost.com/opinions/2023/12/27/artificial-intelligence-hallucinations/

Honestly, I love when AI hallucinates

Let me explain, once and for all, why your AI chatbot glitches and why you shouldn’t worry when it does.

The Washington Post
So according to Cohen, he got bogus legal citations from #GoogleBard, didn't check them, and passed them to his lawyer, who also didn't check them. Which, I dunno, seems pretty negligent all around even if you didn't know Bard was a bullshit generator https://www.washingtonpost.com/technology/2023/12/29/michael-cohen-ai-google-bard-fake-citations/
Michael Cohen used fake cases created by AI in bid to end his probation

Ex-Trump lawyer Michael Cohen said he used Google Bard to unknowingly generate fake case citations that his lawyer used in a motion seeking to end his probation.

The Washington Post
Also raises the suspicion Cohen was doing a significant amount of the work and just having his lawyer put his name on it because Cohen is disbarred (though presumably Cohen could have gone pro se if he really wanted to). Anyway, I predict they're gonna continue the #ChatGPTLawyer sanctions streak
"ChatGPT bombs test on diagnosing kids’ medical cases" OK, but did they also test a magic 8 ball? Reading goat entrails?
https://arstechnica.com/science/2024/01/dont-use-chatgpt-to-diagnose-your-kids-illness-study-finds-83-error-rate/
ChatGPT bombs test on diagnosing kids’ medical cases with 83% error rate

It was bad at recognizing relationships and needs selective training, researchers say.

Ars Technica
Another data point for the "LLMs can't infringe copyright because they don't contain or produce verbatim copies" crowd https://spectrum.ieee.org/midjourney-copyright
Generative AI Has a Visual Plagiarism Problem

Experiments with Midjourney and DALL-E 3 show a copyright minefield

IEEE Spectrum
"Even when using such prompts, our models don’t typically behave the way The New York Times insinuates, which suggests they either instructed the model to regurgitate or cherry-picked their examples from many attempts" - I don't *typically* engage in large scale plagiarism, so accusing me of these specific instances of large scale plagiarism is cherry-picking! https://www.theverge.com/2024/1/8/24030283/openai-nyt-lawsuit-fair-use-ai-copyright
OpenAI claims The New York Times tricked ChatGPT into copying its articles

OpenAI claims The New York Times has not been truthful in its lawsuit against the company and Microsoft. Yet, the company is hopeful both parties can work together.

The Verge
"OpenAI claims it’s attempted to reduce regurgitation from its large language models and that the Times refused to share examples of this reproduction before filing the lawsuit." - Per usual (https://mastodon.social/@reedmideke/111585837264775808) OpenAI would love to apply bandaids to specific instances identified by well-resourced organizations, because they know the underlying cause can't be fixed without destroying their business model
Thing that gets me about this "amazon listings with #ChatGPT error messages" story is, how do you get to the point where this is significant cost savings? Are they just using it for translation? Or are the listing just pure scams and there's no real product? https://arstechnica.com/ai/2024/01/lazy-use-of-ai-leads-to-amazon-products-called-i-cannot-fulfill-that-request/
Lazy use of AI leads to Amazon products called “I cannot fulfill that request”

The telltale error messages are a sign of AI-generated pablum all over the Internet.

Ars Technica