Yet another #LLM #benchmark. 😉

EsoLang-Bench: Evaluating genuine reasoning in large language models via esoteric #programming languages https://esolang-bench.vercel.app/ #esolang #GenAI #AI

EsoLang-Bench: Evaluating LLMs via Esoteric Programming Languages

EsoLang-Bench: A benchmark of 80 problems across 5 esoteric languages to evaluate genuine reasoning in LLMs.