Mastodawn

Yesterday Cory Doctorow argued that refusal to use LLMs was mere "neoliberal purity culture". I think his argument is a strawman, doesn't align with his own actions and delegitimizes important political actions we need to make in order to build a better cyberphysical world.

EDIT: Diskussions under this are fine, but I do not want this to turn into an ad hominem attack to Cory. Be fucking respectful

https://tante.cc/2026/02/20/acting-ethical-in-an-imperfect-world/

Acting ethically in an imperfect world

Life is complicated. Regardless of what your beliefs or politics or ethics are, the way that we set up our society and economy will often force you to act against them: You might not want to fly somewhere but your employer will not accept another mode of transportation, you want to eat vegan but are […]

Smashing Frames

Show thread

Simon Zerafa Feb 20

@tante

That doesn't seem to be the best idea @pluralistic

AI and LLM output is 90% bullshit, and most people don't have the time nor the patience to work out which 10% might actually be useful.

That's completely ignoring the environmental and human impacts of the AI bubble.

Try buying DDR memory, a GPU or an SSD / HDD at the moment.

Show thread

Cory Doctorow Feb 20

@simonzerafa @tante

What is the incremental environmental damage created by running an existing LLM locally on your own laptop?

As to "90% bullshit" - as I wrote, the false positive rate for punctuation errors and typos from Ollama/Llama2 is about 50%, which is substantially better than, say, Google Docs' grammar checker.

Show thread

Simon Zerafa Feb 20

@pluralistic @tante

Of course, I am speaking in generalities.

Encouraging the use of LLM's is counterproductive in so many ways, as I highlighted.

Pop a power meter on that LLM adorned PC and let us all know what the power usage looks like with and without your chosen LLM running on a typical task 🙂

That's power that generated somewhere, even if it's with renewable energy.

The main issue with LLM's is that they don't encourage critical thinking, in a world which is already suffering from a massive shortage.

Show thread

memo 📎Feb 22

@simonzerafa @pluralistic @tante

Pop a power meter on that LLM adorned PC and let us all know what the power usage looks like with and without your chosen LLM running on a typical task

challenge accepted! :D

my laptop uses about 6w per hour when idling, and 25w when playing games or running inference

I'd attribute the difference as about 19w per hour of inference

my 900W microwave uses 15w per minute

so microwaving a frozen burrito for two and a half minutes is equivalent to two hours of inference (or games) on my laptop

also, that burrito was frozen. refrigerator wattage varies widely, but an average hourly running wattage of 150w is nominal

at 150w the freezer takes almost 8x watts per hour more than the laptop inference, and the freezer runs 24/7/365!

most of my inference tasks complete in about 30 seconds at about 0.16 watt per inference job. thats almost 940 inference jobs (assuming 30s average) equivalent to 1 hour of refrigerator running wattage

Show thread

Santiago, né ?

👾Feb 22

@memoria Not mentioning everyone here. I am curious what kind of use cases of useful inferences you can do on your home machine and with which models ?

(I have a M1 Studio Ultra on my desk but only tried local inference long ago)

Show thread

memo 📎

@santi

sure, i use inference in a few ways:

karakeep - tagging bookmarks, semantic search
immich - face detection, semantic search
paperless-gpt - document titles, tags, and OCR
libretranslate - language translations
Speakr - voice to text transcription, tagging, summaries, semantic search
audiomuse - sonic analysis on my music collection to generate sonically similar playlists and track queues

as for LLM models:

i really like the IBM granite4 models, specifically the 7B hybrid model (granite4:7v-a1b-h). It's hands down the best text-only model for it's CPU and memory (4.2GiB) requirements.

Gemma3:4b is an all around good model for it's size, and can output text from text and image inputs. it's a pure transformer model so it's heavier to run than hybrid models, and 4B models do tend to go off the rails faster and more frequently.

qwen2.5vl:3b is the best image to text model i can run on my system. qwen3vl:4b is significantly better, but i can't reasonably run it

with an M1 ultra you could probably run the largest of these models and have it complete inference instantly

Show thread

Santiago, né ?

👾Feb 22

@memoria Thanks ! I’ll definitely play with this again one of these days !