Yet another #LLM #benchmark. 😉
EsoLang-Bench: Evaluating genuine reasoning in large language models via esoteric #programming languages https://esolang-bench.vercel.app/ #esolang #GenAI #AI
Yet another #LLM #benchmark. 😉
EsoLang-Bench: Evaluating genuine reasoning in large language models via esoteric #programming languages https://esolang-bench.vercel.app/ #esolang #GenAI #AI