Gemma Scope 2: Bộ công cụ mở cho khả năng giải thích mô hình ngôn ngữ, được xây dựng từ 1 nghìn tỷ thông số. Công cụ phân tích sâu trạng thái kích hoạt nội bộ của mô hình Gemma 3 và hành vi hội thoại. #GemmaScope #AI #MôHìnhNgônNgữ #Interpretability #CôngNghệAI #MachineLearning #OpenSource #TriTueNhanTao #KiemSoatMoHinh

https://www.reddit.com/r/LocalLLaMA/comments/1pqk7sd/gemma_scope_2_open_suite_of_tools_for_language/

AI's Philosophical Tech Challenge - Dean W Ball on 80000 Hours

#interpretability #ai #courtroom

“OpenAI’s work is part of a hot new field of research known as #mechanistic #interpretability, which is trying to map the internal mechanisms that #models use when they carry out different tasks.” www.technologyreview.com/2025/11/13/1... #OpenAI #LLMs

OpenAI’s new LLM exposes the s...
OpenAI’s new LLM exposes the secrets of how AI really works

The experimental model won't compete with the biggest and best, but it could tell us why they behave in weird ways—and how trustworthy they really are.

MIT Technology Review
Understanding neural networks through sparse circuits - OpenAI openai.com/index/understa… #AI #interpretability
Visual Features Across Modalities: SVG and ASCII Art Reveal Cross-Modal Understanding - Visual Features Across Modalities: SVG and ASCII Art Reveal Cross-Modal Unde... - https://simonwillison.net/2025/Oct/25/visual-features-across-modalities/#atom-everything #pelican-riding-a-bicycle #interpretability #generative-ai #anthropic #llms #svg #ai
Visual Features Across Modalities: SVG and ASCII Art Reveal Cross-Modal Understanding

New model interpretability research from Anthropic, this time focused on SVG and ASCII art generation. We found that the same feature that activates over the eyes in an ASCII face …

Simon Willison’s Weblog

"Anthropic rao v paper mới về mã Hogan điểm kê theo geometriậy. Tiêu Related cuộn vệ○ lea̛ا sản mã Hogan bo trọ bại tổi cậc phân outil để cháyTraits bọi phịnStraight. Đọc thêm trong #AI #Interpretability #Anthropic #countingtask #T/NewAI#Môi\tả #bài_tổi_đếm!"

https://www.reddit.com/r/singularity/comments/1od3mfw/when_models_manipulate_manifolds_the_geometry_of/

#CodemotionMilan è la prossima settimana !

Ci sarò anche io con un talk su
🔍 #ExploratoryDataAnalysis
📊 #DataVisualization

mercoledì 1️⃣5️⃣ ottobre alle 🕧 12:30 Gate 5️⃣

Corley Cloud at CodemotionMilan
#EDA #GraphTools #Accessibility #Interpretability #BestPractices

The Interpretable AI playbook: What Anthropic’s research means for your enterprise LLM strategy https://venturebeat.com/ai/the-interpretable-ai-playbook-what-anthropics-research-means-for-your-enterprise-llm-strategy/ #AI #interpretability