Absolutely fascinating piece by David Oks connecting language model oddities to human cultural development. Picked this up courtesy of Deena Mousa's "Under Development" newsletter.
https://davidoks.blog/p/language-models-are-weird-for-the
Language models are weird for the same reason human cultures are weird

You can’t have adaptive learning without strange tics

David Oks
@shriramk I like that the text traces these connections while mostly avoiding the anthropomorphism (until the end where it rears its ugly head among mentions of "psychology" and "mind-like" entities, sic). A science of connectionist language patterns is waiting, that is exciting, and we may learn a great deal about humans, too. But to ascribe human traits to models is IMHO a bridge crossed too far and ignores that ultimately it is in our human desire and need for interpretation where the oddness and weirdness judgment comes up. The linear algebra does not care, and that makes LLM technology not less but *more* prone to be used for manipulating humans.

@burakemir Okay, but most writers (not all, obviously) do not actually mean to anthropomorphize. It's just a convenient shorthand.

Like if I say "the LLM says" or "the LLM knows", sure, the first time I could write «the LLM "says"» or «the LLM "knows"» but the tenth time you read that in the same essay, it's irritating as heck.

Note that I'm also the kind of person who refuses to assign a gender to Alexa, Siri, etc.: it's an "it".

@shriramk No disagreement, the shorthand is more than convenient, we don't seem to have good ways (for now?) to sustainably talk about "speak, know, answer" and goddamn "intelligence" while preserving that nuance. Anthropomorphism is very old with general public computer use and it would not be an issue if everyone had the capacity to see it as a playful shorthand. Alas, anthropomorphism is normalized and instrumentalized in marketing what amounts to a war on human labor. Apart from the risk of being unknowingly co-opted, it is also intellectually interesting to ponder why researchers do not seem capable of consistently using technical and precise language here.
@burakemir What "consistent and precise" language would you use that is not a mouthful every time you use it?

@shriramk I am no Kamlah and Lorenzen, but let me try.

* instead of "say/reply", use "emit" or "generate" output.
* Instead of "thinks/understands" use "computes probabilities" or "maps vector spaces".
* Instead of "hallucinates" use "emits misaligned patterns".
* Instead of "learns" use "optimizes weights".
* Instead of "cooperate", say the human "initiates" interaction by providing constraints, and the model solves these constraints.

@burakemir I like emit and generate, but for the most part I think the LHSs fall into exactly the "convenient shorthand" category. (I do think "think" and "understand" are egregious, but we can just say "computes" instead.)
@shriramk The question is: shorthands for *what* : ) There is something genuinely lacking, a scientific language of semiotic (self-) interaction and control.
The vector spaces are not even the point, we are interacting with a machine that retrieves lossy/compressed accounts of "meaning", and it is only our interpretation and evals that assign this meaning.

@burakemir @shriramk i feel like part of the problem here is how not every aspect of communication is literal but we're not always good at signaling or even recognizing when we're moving between fact and metaphor.

we do this in the other direction too, e.g. sometimes say i'm "adjusting my priors" when i learn new information. but i suppose it's usually more obviously metaphorical in this direction

@chrisamaphone @shriramk I also think there is something about metaphor, or more generally the ability to make and understand analogy which is key. This is part of human intelligence, but we are also intentionally tricked (we want to be tricked, we want to interact with the machine like we interact with humans, because we like to belong.)

Analogies are fine but this tech is useful because it captures "knowledge" in a (still) novel, practically relevant way and then simulates human communication. The ML researchers are IMHO not particularly qualified to talk about this; we have never been able to measure knowledge, the extent, practical relevance or effectiveness of communication. Now, in addition to that, we are trying to get comfortable in a sea of convincing, plausible and self-confirming fuzzy information retrieval plus slop, and all that gets to be called "intelligence". This is a very weak analogy. The machine lacks inherent purpose.

IMHO it would be nice if people who think could also engage in precise, "formal", "objective" language to express whatever this is, and such discourse will be flawed if it resembles the goddamn marketing talk and fails to acknowledge purpose and control. There needs to be a signal that we are striving for intellectually honesty. I am certainly not playing the science language police here, I just want to convey that I am missing this and it seems a real prerequisite to meaningful discussion on ethics of LLMs and LLM use.

@burakemir @chrisamaphone
Frankly, none of this is new. Analogy and metaphor are *always* used to understand new things by relating to the old. Whole research programs have been built around this idea. And they cause problems everywhere (which is why I'm personally *not* a fan of their use). ↵

@burakemir @chrisamaphone

I actually try to collect analogies that people use specifically for generative AI. The thing is, they're really bad in *both* directions. They are less a reflection of the underlying reality than of the positions of the people who use them. The ones that deprecate it, for instance, clearly just fly in the face of every available lived positive evidence. ↵

@burakemir @chrisamaphone
But when you ask for precise, scientific speech: there is no such thing. When people talk about the "flow" of electricity, or for that matter when they talk about "looking" into the past, they're using metaphors. This is just how we communicate. Indeed, the entire formation of abstractions in linguistics is very revealing. ↵

@burakemir @chrisamaphone
It's marketing talk in both directions. Take away metaphors and you strip away natural, human language. You can't even use technical terms: e.g., reinforcement learning is full of "common" terms (eg, "curriculum learning") that have been made technical. (Of course, "reinforcement" and "learning" also have just that property.)

So you *can* strip away all natural language, but then you only get mathematical symbols. I'm fine with that! But everything else is political.

@shriramk @chrisamaphone Sure, there is an aspect of politics in making terminology and concepts stick, and metaphor is helpful. I am more wondering about the meaning, and there anthropomorphism is clearly a crutch that does not convey much and is sometimes misleading (and vector spaces seem lacking, too). We are capable of making definitions of technical words though, and we can assess whether the concepts brings us wisdom. I want to see that happen, maybe "thinking" and "reasoning" in models is a first confused step on a journey, it is hard to accept that our (human) understanding should just end there.
@burakemir @chrisamaphone
I disagree. It conveys a LOT. It *is* misleading, which is definitely a problem. But people wouldn't do it if it didn't do not convey a great deal, and people would also reject it if it didn't in some sense match their intution. ↵
@burakemir @chrisamaphone
When we say "GPT 5.5 just produced a proof of X", we don't mean in the exact same sense as a human wrote a proof, so that is clearly a kind of anthropomorphism, because we are evoking a human-ish activity, and didn't say "typed out a sequence of characters that matched both the well-formedness and validity characteristics of Lean syntax and happened to pass the dependently-typed system in a manner consistent with the prior formalization of X". Would that be better?
I have no issue at all with "X is performing (symbolic task)", my issue is with statements like LLM interpretability research "sitting somewhere between classical psychology and early neuroscience." I can see what they mean, since I read Anthropic safety lore and looked at Opus system prompts. In LLM dialogue, we do role play but the metaphor gets "real" in a different way when we try to steer in order to "exhibit behavior". Understanding the context dependence and task dependence of these "thinking" patterns better is likely the key to energy-efficient LLM architecture with fewer parameters and/or combination with symbolic algorithms, like in RAG systems for example, or other specialization. And I think that might matter for safety, too.