the noise suppression model I am rolling out has a ~10% failure rate with regard to perfect suppression of dog barks, but a ~50% failure rate with regard to perfect suppression of cat meows. this, i believe, aligns with customer expectations
@nickappleton surprisingly, these are comparatively easy scenarios that it generally performs okay at, though they are probably rare. We don't have chips or trains, but we have "vacuum cleaner" (20% less than perfect) and "sirens" (25% less than perfect), which should be a good approximation.
the real trouble is interfering speakers for the most part, humans sound a lot like other humans, annoyingly
@nickappleton I don't have singing validation set numbers for this model, it was doing fine on emotional data, though, which is similar usually
we introduce 10ms latency, it's a prod RTC model and more is intolerable