EsoLang-Bench: Evaluating Genuine Reasoning in LLMs via Esoteric Languages

https://esolang-bench.vercel.app/

#HackerNews #EsoLangBench #GenuineReasoning #LLMs #EsotericLanguages #AIresearch

EsoLang-Bench: Evaluating LLMs via Esoteric Programming Languages

EsoLang-Bench: A benchmark of 80 problems across 5 esoteric languages to evaluate genuine reasoning in LLMs.