Mastodawn

My master student Lukáš Eigler just defended his thesis (co-supervised with David Hurych from Valeo.ai) 🎉 Congrats!

#NLP metric validation needs 🐌💰 human judgment data. Our fix: generate synthetic data for metric validation instead. ✅ Tested on MT, QA, summarization.

To appear at #ACL2026 Student Research Workshop:
https://arxiv.org/abs/2603.09403

#NLProc #MachineLearning

LLM as a Meta-Judge: Synthetic Data for NLP Evaluation Metric Validation

Validating evaluation metrics for NLG typically relies on expensive and time-consuming human annotations, which predominantly exist only for English datasets. We propose LLM as a Meta-Judge, a scalable framework that utilizes LLMs to generate synthetic evaluation datasets via controlled semantic degradation of real data, replacing human judgment. We validate our approach using meta-correlation, measuring the alignment between metric rankings derived from synthetic data and those from standard human benchmarks. Experiments across Machine Translation, Question Answering, and Summarization demonstrate that synthetic validation serves as a reliable proxy for human judgment, achieving meta-correlations exceeding 0.9 in multilingual QA and proves to be a viable alternative where human judgments are unavailable or too expensive to obtain. Our code and data are publicly available at https://github.com/eiglerl/meta-judge.

arXiv.org

Show thread

UKP Lab Jun 3

🔗 𝗥𝗲𝗹𝗮𝘁𝗲𝗱 𝗥𝗲𝘀𝗼𝘂𝗿𝗰𝗲𝘀
ARR Data Collection: https://arr-data.aclweb.org/

Dagstuhl Seminar 2024 on Peer Review: https://www.dagstuhl.de/en/seminars/seminar-calendar/seminar-details/24052

#NLP #NLProc #PeerReview #MachineLearning #ArtificialIntelligence #NLPeer #ARR #EACL2026 #OpenScience #OpenData #ResearchData #AIResearch #LanguageTechnology #UKPLab #TUDarmstadt

ACL Rolling Review Data Collection (ARR-DC)

Collecting and curating a large-scale dataset of peer reviews and associated metadata from the ACL community.

ARR Data Collection

Sophie Schneider Jun 2

Mit unserer Reihe #PanoramaText geht es am 🗓️ 22.06. weiter mit dem Workshop “Der interaktive Vergleich unterschiedlicher Textfassungen eines Werkes mittels LERA”, in dem Marcus Pöckelmann die Kollationierung verschiedener Versionen eines Textes mit dem Tool LERA (https://lera.uzi.uni-halle.de) vorstellen wird.

Der Workshop findet in Präsenz an der FU (Fabeckstr. 32/Raum 106) statt.

👉 Der Link zur Anmeldung: https://www.it.fu-berlin.de/unsere-services/kompetenzentwicklung/fortbildungen/workshops/E-Research/2026-06-22-LERA.html

#nlproc #digitalhumanities #cls #multilingualDH

LERA

kristallpirat Jun 1

RE: https://chaos.social/@kristallpirat/116606625977516503

Für Menschen sind "§ 92 I ZPO" und "§ 92 Abs. 1 ZPO" identisch.
Für die meisten Suchsysteme nicht, (auch) deshalb Normalisierung.

#SchrifttumsLinguistik #LegalTech #Rechtsinformatik #NLProc #InformationRetrieval #DigitalesHandwerk #jurabubble

Tatjana Scheffler May 30

RE: https://fediscience.org/@tschfflr/116659137954249730

I guess that’s the butt end of “just throw in the data, LLM can write the paper no problem”. I really appreciate the intentions behind the ARR system but it doesn’t work. I first realized just how broken it was several years ago as area chair when some of my reviewers were undergraduate students #nlproc

Long Live Moonjoy May 28

welp, I got laid off.

if anyone you know is looking for a slightly-used computational linguist -- remote or in #Syracuse, NY -- get in touch. I'm also open to "data science" positions if they're "languagey" #GetFediHired #NLProc #NLP

Show thread

UKP Lab May 21

Learn more about Phu and his work: https://phusroyal.github.io/

Welcome to the team, Phu! 👋

#UKPLab #TUDarmstadt #MBZUAI #NLP #NLProc #MechanisticInterpretability #LLMs #AIInterpretability

Homepage - PhusRoyal

SIGGEN May 20

📢 Reminder: we accept 2-page workshop proposals for #INLG2026 until May 25th! More details on the website: https://2026.inlgmeeting.org/calls.html
#NLProc #INLG

INLG2026

The 19th International Natural Language Generation Conference is scheduled to be held in Utrecht, the Netherlands from October 17 to 21, 2026.

Michal Ptaszynski May 18

🚀 ML-Ask Official v0.5 is on PyPI!

This is an official Python release of my emotion analysis system for Japanese.

✓ 10 emotion classes, ~4,700 dictionary entries
✓ emoji, kaomoji, gyaru-go, katakana
✓ 50k sentences/sec (100k parallel)
✓ Streamlit app with JA/EN toggle
✓ CLI + streaming + multiprocessing

📦 pip install mlask-official
🧪 Live demo: mlask-official.streamlit.app
🐙 github.com/ptaszynski/mlask-official

#JapaneseNLP #感情分析 #NLProc #AffectiveComputing