Ya están disponibles los videos de nuestra participación en #EMNLP2025

Compartimos la presentación que hizo @guido_ivetta del paper: “HESEIA: un conjunto de datos comunitario para evaluar sesgos sociales en grandes modelos de lenguaje”:

🔗 https://youtu.be/nKrBH5_NIZs

También les traemos la sesión: “¿Cómo moldearán los modelos de lenguaje multimodales la industria y la sociedad?” dentro del panel de la sesión industrial, en la que participó @LucianaBenotti

🔗 https://youtu.be/Bp3NNQiFKvQ

#EMNLP2025 - Guido Ivetta

YouTube
#EMNLP2025 konferentzian gaude

And consider following the authors Sheng Lu, Ilia Kuznetsov, and Iryna Gurevych (all Ubiquitous Knowledge Processing (UKP) Lab/Technische Universität Darmstadt).

See you at the #EMNLP conference in Suzhou 🏯

(3/3)

#EMNLP2025 #UKPLab #PeerReview #LLM #AIResearch #NLProc

📊 𝗦𝗘𝗘𝗘𝗗 outperforms #GPT-4o and #Phi-4 by up to +𝟴 𝗽𝗽 across multiple datasets.

📄 𝗣𝗮𝗽𝗲𝗿: https://www.arxiv.org/abs/2509.10833
💻 𝗖𝗼𝗱𝗲: https://github.com/UKPLab/emnlp2025-automatic-error-discovery
🔗 𝗣𝗿𝗼𝗷𝗲𝗰𝘁: https://ukplab.github.io/emnlp2025-automatic-error-discovery/

Be sure to follow the authors: Dominic Petrak, Thy Thy Tran, and Iryna Gurevych from Ubiquitous Knowledge Processing (UKP) Lab/Technische Universität Darmstadt.

See you at the #EMNLP in Suzhou!

(2/2)

#NLProc #ConversationalAI #Agents #EMNLP2025

Towards Automated Error Discovery: A Study in Conversational AI

Although LLM-based conversational agents demonstrate strong fluency and coherence, they still produce undesirable behaviors (errors) that are challenging to prevent from reaching users during deployment. Recent research leverages large language models (LLMs) to detect errors and guide response-generation models toward improvement. However, current LLMs struggle to identify errors not explicitly specified in their instructions, such as those arising from updates to the response-generation model or shifts in user behavior. In this work, we introduce Automated Error Discovery, a framework for detecting and defining errors in conversational AI, and propose SEEED (Soft Clustering Extended Encoder-Based Error Detection), as an encoder-based approach to its implementation. We enhance the Soft Nearest Neighbor Loss by amplifying distance weighting for negative samples and introduce Label-Based Sample Ranking to select highly contrastive examples for better representation learning. SEEED outperforms adapted baselines -- including GPT-4o and Phi-4 -- across multiple error-annotated dialogue datasets, improving the accuracy for detecting unknown errors by up to 8 points and demonstrating strong generalization to unknown intent detection.

arXiv.org

And consider following the authors Haishuo Fang (UKP Lab), Xiaodan Zhu (Department of Electrical and Computer Engineering, Smith Engineering at Queen's University), and Iryna Gurevych (UKP Lab/ATHENE Center) if you are interested in more information or an exchange of ideas.

See you at the #EMNLP conference in Suzhou 🏯

(3/3)

#NLProc #AI #EMNLP2025 #LLMAgent

The #EMNLP2025 conference is starting in two weeks in Suzhou, #China.

Language Technology Group #Oslo will be there with at least four papers, see the thread🧵 :

#NLProc

👏 Congratulations to all authors and collaborators for their excellent work! We are looking forward to presenting these results at EMNLP 2025 in #Suzhou this November.

Stay tuned for more details!

#NLProc #MachineLearning #UKPLab #Research #EMNLP2025

🎉 𝗨𝗞𝗣 𝗟𝗮𝗯 𝗮𝘁 𝗘𝗠𝗡𝗟𝗣 𝟮𝟬𝟮𝟱 𝗶𝗻 𝗦𝘂𝘇𝗵𝗼𝘂

Researchers from the UKP Lab at Technische Universität Darmstadt and their collaborators celebrate the acceptance of 9 papers in the Main Conference and 1 paper in Findings at #EMNLP2025 in Suzhou.

Here is the full list of accepted papers 🔎

Congratulations to our colleague Max Glockner for receiving an Outstanding Reviewer Award – well-deserved recognition for his dedication to the field!🎉

And many thanks to Jonathan Tonglet for his impressions and photos.

We’re already looking forward to #EMNLP2025 in Suzhou!