fly51fly (@fly51fly)

논문 'Interpreting and Controlling Model Behavior via Constitutions for Atomic Concept Edits' 발표: 모델 행동을 해석하고 제어하기 위해 'constitutions' 기반의 원자적 개념 편집(atom-level concept edits) 기법을 제안합니다. 저자 N Kalibhat, Z Wang, P Bajpai, D Proud 등, Google DeepMind 소속이며 arXiv에 게재(링크 포함).

https://x.com/fly51fly/status/2018802485377593565

#deepmind #modelediting #interpretability #arxiv

fly51fly (@fly51fly) on X

[LG] Interpreting and Controlling Model Behavior via Constitutions for Atomic Concept Edits N Kalibhat, Z Wang, P Bajpai, D Proud... [Google DeepMind] (2026) https://t.co/XOqcJibvQI

X (formerly Twitter)