"Our view is that #reliability lags #capability and that reliability will remain a barrier to deployment unless researchers and developers focus effort on improving reliability as a separate dimension from accuracy."

https://www.normaltech.ai/p/new-paper-towards-a-science-of-ai

#llm #ai

New Paper: Towards a science of AI agent reliability

Quantifying the capability-reliability gap

AI as Normal Technology