Mastodawn

Gabriele Sarti Jan 31, 2023

Huge congrats @[email protected]! 🚀 The future of creative MT will be "made in @[email protected]"! 😎

We're proud of our colleague @[email protected] for becoming an @[email protected] #awardee for her project INCREC – Studying the creative translation process in the intersect with technology! 🎉 https://twitter.com/univgroningen/status/1620376478130049025

🐦🔗: https://twitter.com/GroNlp/status/1620395831663935488

University of Groningen on Twitter

“Great news for @DocTinaK and @AnaGuerberof! They both got an @ERC_Research #consolidator grant of €2 million for research into, respectively, #parenthood and the creative process of #translating. Congrats!🎀 More 👉 https://t.co/v5o0SBfzu5 @rug_gmw @FacultyofArtsUG”

Twitter

Gabriele Sarti Jan 31, 2023

RT @[email protected]

🥳Thrilled to announce our paper got accepted to #EACL2023!
We introduce Value Zeroing, a new interpretability method for quantifying context mixing in Transformers.

A joint work w/ me, @[email protected], @[email protected], and @[email protected]

📑Paper: https://arxiv.org/abs/2301.12971

#NLProc #InDeep

🐦🔗: https://twitter.com/hmohebbi75/status/1620351439855063040

Quantifying Context Mixing in Transformers

Self-attention weights and their transformed variants have been the main source of information for analyzing token-to-token interactions in Transformer-based models. But despite their ease of interpretation, these weights are not faithful to the models' decisions as they are only one part of an encoder, and other components in the encoder layer can have considerable impact on information mixing in the output representations. In this work, by expanding the scope of analysis to the whole encoder block, we propose Value Zeroing, a novel context mixing score customized for Transformers that provides us with a deeper understanding of how information is mixed at each encoder layer. We demonstrate the superiority of our context mixing score over other analysis methods through a series of complementary evaluations with different viewpoints based on linguistically informed rationales, probing, and faithfulness analysis.

arXiv.org

Gabriele Sarti Jan 31, 2023

Shout-out to @[email protected] et al. from our #InDeep consortium for their awesome work "Quantifying Context Mixing in Transformers", introducing Value Zeroing as a new promising post-hoc interpretability approach for NLP! 🎉 Paper: https://arxiv.org/abs/2301.12971 https://t.co/RxX2LenHqv

Quantifying Context Mixing in Transformers

Self-attention weights and their transformed variants have been the main source of information for analyzing token-to-token interactions in Transformer-based models. But despite their ease of interpretation, these weights are not faithful to the models' decisions as they are only one part of an encoder, and other components in the encoder layer can have considerable impact on information mixing in the output representations. In this work, by expanding the scope of analysis to the whole encoder block, we propose Value Zeroing, a novel context mixing score customized for Transformers that provides us with a deeper understanding of how information is mixed at each encoder layer. We demonstrate the superiority of our context mixing score over other analysis methods through a series of complementary evaluations with different viewpoints based on linguistically informed rationales, probing, and faithfulness analysis.

arXiv.org

Gabriele Sarti Jan 30, 2023

RT @[email protected]

As Gregor Samsa awoke one morning from uneasy dreams, he found himself rewritten in Rust for performance reasons

🐦🔗: https://twitter.com/vboykis/status/1620072731994914819

Vicki on Twitter

“As Gregor Samsa awoke one morning from uneasy dreams, he found himself rewritten in Rust for performance reasons”

Twitter

Gabriele Sarti Jan 27, 2023

Welcome to @[email protected] 🤗 Looking forward to promising future collaborations!

RT @[email protected]

Happy to announce that, from February 1st, I'll start a new job as a lecturer 👨‍🏫 at @[email protected]
Excited to start this new adventure!

🐦🔗: https://twitter.com/marco_zul/status/1618977505632985089

Marco Zullich on Twitter

“Happy to announce that, from February 1st, I'll start a new job as a lecturer 👨‍🏫 at @univgroningen Excited to start this new adventure!”

Twitter

Show thread

Gabriele Sarti Jan 26, 2023

Prediction: LMaaS companies will use private RNG keys for their APIs and sell LM plagiarism detection tools on subscription for $$$ to schools and universities

Gabriele Sarti Jan 26, 2023

RT @[email protected]

#OpenAI is planning to stop #ChatGPT users from making social media bots and cheating on homework by "watermarking" outputs. How well could this really work? Here's just 23 words from a 1.3B parameter watermarked LLM. We detected it with 99.999999999994% confidence. Here's how 🧵

🐦🔗: https://twitter.com/tomgoldsteincs/status/1618287665006403585

Tweet / Twitter

Twitter

Gabriele Sarti Jan 25, 2023

RT @[email protected]

#OpenAI is planning to stop #ChatGPT users from making social media bots and cheating on homework by "watermarking" outputs. How well could this really work? Here's just 23 words from a 1.3B parameter watermarked LLM. We detected it with 99.999999999994% confidence. Here's how 🧵

🐦🔗: https://twitter.com/tomgoldsteincs/status/1618287665006403585

Tweet / Twitter

Twitter

Gabriele Sarti Jan 20, 2023

Attention attribution is here! Thanks to all the awesome contributors! 🚀

RT @[email protected]

Version 0.3.3 is finally out! 🎉 Highlights: attention attribution, new L2 norm default attribution aggregation, ruff linting (tip hat @[email protected]), improved save/reload of attributions. See release notes for usage examples: https://github.com/inseq-team/inseq/releases/tag/v0.3.3

🐦🔗: https://twitter.com/InseqDev/status/1616419539616694273

Release v0.3.3: Attention attribution, new aggregation, improved saving/reloading and more · inseq-team/inseq

What’s Changed Attention attribution (#148 ) This release introduces a new category of attention attribution methods and adds support for AttentionAttribution (id: attention). This method attribute...

GitHub

Show thread

Gabriele Sarti Jan 20, 2023

If you are interested in the topic, our new @[email protected] library greatly simplifies access to model internals with support for most recent gradient, attention, and (soon!) occlusion methods. Find it here: https://github.com/inseq-team/inseq

GitHub - inseq-team/inseq: Interpretability for sequence generation models 🐛 🔍

Interpretability for sequence generation models 🐛 🔍 - GitHub - inseq-team/inseq: Interpretability for sequence generation models 🐛 🔍

GitHub

Location	Groningen, Netherlands
Website	https://gsarti.com
Twitter	https://twitter.com/gsarti_
Github	https://github.com/gsarti