Mastodawn

I was disappointed to read Cory Doctorow's post where he got weirdly defensive about his LLM use and started arguing with an imaginary foe.

@tante has a very thoughtful reply here:

https://tante.cc/2026/02/20/acting-ethical-in-an-imperfect-world/
A few further comments, 🧵>>

Acting ethically in an imperfect world

Life is complicated. Regardless of what your beliefs or politics or ethics are, the way that we set up our society and economy will often force you to act against them: You might not want to fly somewhere but your employer will not accept another mode of transportation, you want to eat vegan but are […]

Smashing Frames

Show thread

Prof. Emily M. Bender(she/her)Feb 21

It was particularly disappointing to see Doctorow misconstrue (and thus, if he is believed) undermine the work that many of us are doing to shine a light on the ways in which the ideology of "AI" and the specific ways in which LLMs and other "AI" products are created do real harm.
>>

Show thread

Prof. Emily M. Bender(she/her)Feb 21

I also want to point out (again) the ways in which lumping together all uses of LMs (like the lumping of technologies into "AI") obscures the issues at hand.

Language modeling is a useful component of many technologies that can be built without extractive, exploitative means. Take the automatic transcription built by and for the Māori people -- there's te reo Māori language model that's part of that.
>>

Show thread

Prof. Emily M. Bender(she/her)Feb 21

And the transformer architecture represented an important step forward in language modeling, that brought improvements to things like spell checking (Doctorow's use case).
>>

Show thread

Prof. Emily M. Bender(she/her)Feb 21

And you can build and use language models without turning them into the synthetic text extruding machines that are despoiling our information ecosystem.

And even if those are easily accessible, because OpenAI et al want to burn through cash with their demos, we can still refute and refuse the narrative that synthetic text is somehow a panacea to be used across social services (medicine, education) and in science, etc.
>>

Show thread

Prof. Emily M. Bender(she/her)Feb 21

Doctorow could have gone into these details; could have said something about the particular LLM he chose was built (whose data, trained how, how much data, what kind of further data work in RLHF); could have drawn a distinction about use cases.
>>

Show thread

Martin Hamilton

@emilymbender Hi from a random Internet person! I wondered if you have a view on "Sovereign" models like Apertus? Per https://raw.githubusercontent.com/swiss-ai/apertus-tech-report/main/Apertus_Tech_Report.pdf

FWIW I am a genAI septic who started out feeling quite positive about this development, but then cooled on it rapidly once I realised that it doesn't address a) environmental impacts, or b) potential harms when genAI is used naively - or for plausible deniability by people doing bad stuff ¯\(ツ)/¯

For anyone reading this who hasn't come across Apertus before, there are now several models like this with characteristics such as:

Full disclosure of training data
robots.txt is respected during scraping
Training corpus includes under-represented languages/cultures
Measures taken to mitigate harm are documented
Code base is open source, not just the weights