ML models pass dev tests but break on real, unpredictable inputs. InferProbe lets you test the unpredictable locally. What's the most surprising real-world input that's ever broken one of your models?
ML models pass dev tests but break on real, unpredictable inputs. InferProbe lets you test the unpredictable locally. What's the most surprising real-world input that's ever broken one of your models?