... I find it strange that there isn't a suite of tests used to help evaluate the effectiveness of an agent skill?
@mattiem What’s the point if passing said tests is entirely non-deterministic and seed dependent?
@dimitribouniol I don’t know! How does one evaluate if a skill “works” or not?
@mattiem Vibes 🤪