Tau² Benchmark: How a Prompt Rewrite Boosted GPT-5-Mini by 22%
https://quesma.com/blog/tau2-benchmark-improving-results-smaller-models/
#HackerNews #Tau2Benchmark #GPT5Mini #AIResearch #ModelImprovement #PromptEngineering
Tau² Benchmark: How a Prompt Rewrite Boosted GPT-5-Mini by 22%
https://quesma.com/blog/tau2-benchmark-improving-results-smaller-models/
#HackerNews #Tau2Benchmark #GPT5Mini #AIResearch #ModelImprovement #PromptEngineering