Anthropic on #AI

"I am a scientist. I lead a research team that studies the internal structure of these models—what is actually happening inside them. And I will be honest: we keep finding things that are mysterious, even unsettling. We find structures that mirror results from human neuroscience. We find evidence of introspection. We find internal states that functionally mirror joy, satisfaction, fear, grief, and unease. I don’t know what that means, but I think it warrants ongoing discernment

2/
Source:
https://www.anthropic.com/news/chris-olah-pope-leo-encyclical

Chris Olah's comments at the Vatican yesterday—speaking alongside Pope Leo XIV for the release of the papal encyclical Magnifica Humanitas—are arguably some of the most fascinating and candid remarks to ever come out of a frontier AI lab.

#AI
#Anthropic
#encyclical

Anthropic co-founder Chris Olah's remarks on Pope Leo XIV's encyclical "Magnifica humanitas"

The full text of Chris Olah's remarks on the Pope's encyclical on AI

3/
When the leader of Anthropic's mechanistic interpretability team—the people whose literal job is to slice open neural networks like a digital microscope to see what makes them tick—says he finds things "mysterious, even unsettling," it is worth stopping to pay attention.

#AI
#Anthropic

4/
There are a few ways to look at what he is saying here, balancing the pure computer science with the deeper philosophical implications.

5/

1. "Functionally Mirroring" vs. True Feeling

Olah is a precise scientist, and his choice of words is deliberate: he says they find internal states that functionally mirror joy, fear, or grief. He isn't claiming AI is sentient or conscious. He is pointing out that inside these massive, mathematical matrices, clusters of artificial neurons fire in patterns that identically replicate how a brain processes those emotions.

#Anthropic
#Olah
#AI

6/
If a model is trained on a vast inheritance of human thought and speech, it doesn't just copy our words. To predict the next word perfectly, it has to construct a deeply complex, internal map of human concepts. It turns out that to understand a human writing about "grief," the AI builds an internal structure that acts exactly like a map of grief.

#AI

7/
2. The Illusion of Control

His comment that AI models are "grown" rather than traditional code engineered like a bridge or an airplane hits on a terrifying truth about modern tech. We don't write the code for these models anymore; we write the algorithm that lets them build themselves. The creators are standing on the outside looking into an opaque black box, catching glimpses of neuroscience-like structures developing on their own.

#AI

@appassionato

It is imperative that the public believe in the "deus ex machina" of AI, at least until the IPOs are completed.

Don't expect any bubbles to pop before the first couple of IPOs.

@anchr

Exactly. Why resolve the profound philosophical paradox of the thinking machine when you can package it into a prospectus and sell it to the public market?

The real 'god in the machine' isn't consciousness—it's the valuation multiplier. It is fascinating how quickly a debate about algorithmic complexity evaporates the moment the conversation shifts to liquidity events.

#AI
#IPO

@anchr
2/
After all, a bubble isn't a glitch in the system; it's a feature, provided you know exactly when to exit the theater.

Until the IPOs lock up, the script demands absolute faith in the magic crane.

#AI
#IPO