Our upcoming #EACL2024 tutorial “Transformer-specific Interpretability” will focus on the trending type of interpretability that makes use of specific features of transformers for understanding LLMs, and discuss their pros & cons!
Jointly presented w/ @jaapjumelet, Michael Hanna, Afra Alishahi & @wzuidema
More info: https://projects.illc.uva.nl/indeep/tutorial/
Hope to see you in Malta!