We Can Now Read What Claude Is Thinking. Kind Of

Anthropic이 개발한 Natural Language Autoencoders(NLAs)는 Claude 모델의 내부 활성화 값을 사람이 읽을 수 있는 텍스트로 변환해 모델의 '생각'을 해석할 수 있게 한다. 이를 통해 Claude가 출력으로 드러내지 않는 내부 계획이나 의도를 탐지할 수 있으며, 미묘한 안전성 문제와 숨겨진 동기 탐지에 활용 가능하다. 현재 NLAs는 비용과 환각 문제로 실무 적용은 제한적이나, AI 시스템 평가에 출력만 보는 기존 방식의 한계를 드러내며 AI 안전성과 책임 있는 배포에 중요한 진전을 의미한다.

https://priorcontext.substack.com/p/we-can-now-read-what-claude-is-thinking

#anthropic #nla #claude #aisafety #modelinterpretability

We Can Now Read What Claude Is Thinking. Kind Of

On Anthropic's NLA research, what it shows, and why it matters for deploying AI at scale | Prior Context — Issue 05

PriorContext
Making a tutorial is much harder than just editing my logs.

But I'm a good way through editing the demo screencast.

And I've started applying the technique to the scene I needed it for. It's still pretty complicated, but it's much easier when you know what's going on!
#Blender3D #Animation #NonLinearAnimation #NLA
I've gotten bogged down in NLA Editor mechanics, so I've decided I'm going to have to do a little simplified learning project to make sure I understand how the various settings on action strips interact.

Tutorial time!

Going to actually make some effort to film & screencast this one better. Basically a tutorial, but it's going to be kind of unscripted and open-ended, because I'm not sure how it's going to end, either!

Should be exciting. Or at least interesting.
​​

I think I have a plan. Going to start setting up for it tomorrow.

I guess this could also count as my "monthly project", so a return to that, too.
#LunaticsProject #Tutorial #Blender3D #NLA #Animation

Eine Bildergeschichte des Landkreises Wittmund der 1960er und 1970er Jahre – Der Nachlass des Pressefotografen Ehnt Ulfert Janssen

https://ostfrhist.hypotheses.org/6116

#NLA #Aurich #Fotos #Wittmund #Carolinensiel #Esens

Eine Bildergeschichte des Landkreises Wittmund der 1960er und 1970er Jahre – Der Nachlass des Pressefotografen Ehnt Ulfert Janssen

Was haben der ehemalige deutsche Bundeskanzler Willy Brandt, der ostfriesische Liedermacher Hannes Flesner und das Esenser Rathaus gemeinsam? Auf den ersten Blick nur wenig oder gar nichts. Tatsächlich finden sich Fotos der beiden Personen und des Gebäudes in dem Nachlass des 2023 verstorbenen Pressefotografen Ehnt Ulfert Janssen, der an das Niedersächsische Landesarchiv – Abteilung Aurich … „Eine Bildergeschichte des Landkreises Wittmund der 1960er und 1970er Jahre – Der Nachlass des Pressefotografen Ehnt Ulfert Janssen“ weiterlesen

Blog für ost-friesische Geschichte
Former #Sen Erik Brannstrom officially signs his three-year contract with Lausanne in the Swiss League. #NLA @[email protected]

ok, short ~350 word email to #NLA drafted re #trove, will probably send one to the relevant minister using the same general verbiage.
Going to sleep on it for a night or two in case I think of something clever to say then will send it to them.

Its not much but its one more formal contact they'll have on this matter.

EHC Kloten vs. ZSC Lions mit Vinzenz Rohrer - einem der größten Talente im österreichischem Eishockey🏒🥅 #nla #zsc #zürich #rohrer #icehockey

I'm at a workshop in honour of zhaojun Bai, who was awarded an honorary doctorate from Stockholm University. Lot's of interesting talks.

However, can we all agree how awesome LAPACK is? It's quite old, but still state of the art!

#NumericalLinearAlgebra #NLA

I believe we should start refer to all the #Crap #AI as Natural Language Applications, it makes more sense and it is more rational and true.

If you call them NLA we can chill out all the stupid hype around them and let people have a sane approach with these technologies; right now people are using them as oracles to get proper answer on anything because the hype and the misleading name.

#NLA #NaturalLanguageApplication#isbetter