Mastodawn

Steerling-8B, the first interpretable model that can trace any token it generates to its input context, concepts a human can understand, and its training data.

https://www.guidelabs.ai/post/steerling-8b-base-model-release/

#AI #InterpretableAI #DiffusionModel #DiffusionModels

Steerling-8B: The First Inherently Interpretable Language Model

We release Steerling-8B, an 8B-parameter causal diffusion language model that is interpretable by construction — its predictions are routed through concepts you can measure, audit, and control.

Guide Labs

CSBJ Jun 10, 2025

🤖 Are protein language models the next revolution in functional genomics?

🔗 Fine-tuning protein language models to understand the functional impact of missense variants. Computational and Structural Biotechnology Journal, DOI: https://doi.org/10.1016/j.csbj.2025.05.022

📚 CSBJ: https://www.csbj.org/

#Genomics #AI #ProteinLanguageModels #Bioinformatics #PrecisionMedicine #PLM #MissenseVariants #MachineLearning #ClinicalGenetics #AIinHealthcare #InterpretableAI #VariantInterpretation

ESWC Conferences Jun 2, 2025

🧠🤝 The 1st SemGenAge Workshop is now live at #ESWC2025!
📍 Room 5 – Adria II, Floor 11

SemGenAge is exploring how to bridge the gap between Large Language Models (LLMs) and Semantic Web technologies, with the goal of building intelligent agents that are interpretable, controllable, and socially aware.

Join us to shape the future of human-aligned, explainable AI! 🚀

#SemGenAge2025 #LLMs #SemanticWeb #IntelligentAgents #InterpretableAI #ResponsibleAI #KnowledgeGraphs #SocialAI #ESWC2025

CSBJ Apr 10, 2025

🧠 Is AI ready to be your doctor’s second opinion — or is it still a black box?

🔗 From explainable to interpretable deep learning for natural language processing in healthcare: How far from reality?. Computational and Structural Biotechnology Journal, DOI: https://doi.org/10.1016/j.csbj.2024.05.004

📚 CSBJ Smart Hospital: https://www.csbj.org/smarthospital

#XIAI #ExplainableAI #InterpretableAI #HealthcareAI #NLPinHealthcare #Transformers #DeepLearning #ClinicalNLP #AIethics #MedicalAI #XAI #IAI

Anand Philip Aug 17, 2023

Stephen Hahn, Rico Zhu, Simon Mak, Cynthia Rudin, and Yue Jiang. 2023. An Interpretable, Flexible, and Interactive Probabilistic Framework for Melody Generation. In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '23). Association for Computing Machinery, New York, NY, USA, 4089–4099. https://doi.org/10.1145/3580305.3599772 | I love #interpretableAI and generally the kinda stuff Cynthia rudin produces. Made a few tunes using the tool and they are pretty damn good

An Interpretable, Flexible, and Interactive Probabilistic Framework for Melody Generation | Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

ACM Conferences

Webappia Jul 7, 2023

Understanding AI through the use of expressive Boolean formulas #InterpretableAI

Hashtags: #chatGPT #ExplainableAI #BooleanLogic Summery: The rapid growth of artificial intelligence (AI) and machine learning applications in various industries has raised concerns about the complexity and lack of transparency in these systems. In fields like finance and medicine, where regulations and best practices require explainability, the current black box algorithms used in AI can…

https://webappia.com/understanding-ai-through-the-use-of-expressive-boolean-formulas-interpretableai/

Understanding AI through the use of expressive Boolean formulas #InterpretableAI

The Fidelity Center for Applied Technology (FCAT) and the Amazon Quantum Solutions Lab have collaborated to propose an interpretable machine learning model for Explainable AI (XAI) based on expressive Boolean formulas. This approach aims to address the complexity of AI algorithms and meet the transparency requirements of industries like finance and medicine. The model has been successfully implemented and benchmarked on public datasets, showing competitive performance. The use of special purpose hardware or quantum devices can further enhance the model's efficiency. This XAI model has potential applications in healthcare and finance, providing insights for product development and marketing optimization.

Webappia

Wout Bittremieux Jun 15, 2023

Our latest paper has now been published in #ImmunoInformatics! 🎉

Predicting #TCR #epitope binding is extremely challenging. 🤯 We used #InterpretableAI techniques to explore how these prediction models work, to achieve a deeper understanding of TCR–epitope interactions and learn how these computational tools can be improved. 🕵️

Publication: https://www.sciencedirect.com/science/article/pii/S2667119023000071

Ari Benjamin May 10, 2023

Interpretable AI really wants to understand what neurons in LLMs are doing. But this effort is very likely to fail – and it's not the right approach to understand what AI is doing and why.

Like, today, there's weirdly a lot of press about how OpenAI just showed that "Language models can explain neurons in language models" (https://openai.com/research/language-models-can-explain-neurons-in-language-models). But look at the metrics – this was a failed effort. GPT-4 *cannot explain* what neurons in GPT-2 are doing.

More importantly, single-unit interpretability in LLMs is not the same as understanding why and what LLMs as a whole are doing. Even if you did understand when a handful of units activate, you will never be able to stitch these together into a general understanding of why an LLM says the words that it does.

LLMs may someday be able to explain themselves in plain language. But describing (in plain language) when each neuron fires is not going to get us there.

#interpretableAI #LLMs #openai

Language models can explain neurons in language models

We use GPT-4 to automatically write explanations for the behavior of neurons in large language models and to score those explanations. We release a dataset of these (imperfect) explanations and scores for every neuron in GPT-2.

Arie van Deursen Jan 5, 2023

“Why is it that neurons sometimes align with features and sometimes don't? Why do some models and tasks have many of these clean neurons, while they're vanishingly rare in others?

In this paper, we use toy models — small ReLU networks trained on synthetic data with sparse input features — to investigate how and when models represent more features than they have dimensions.“

https://transformer-circuits.pub/2022/toy_model/index.html

#AnthropicAI #InterpretableAI #superposition