🤖📝 AB Test Your Agent 📝🤖
And what's a test without a grade?
Couple of things we can tighten up, but the tasks were completed as requested.
#engineering #ai #devops #devopsendgamewithallthecheatcodesandallthepeace #abtestyouragentsunderstanding
🤖📝 AB Test Your Agent 📝🤖
And what's a test without a grade?
Couple of things we can tighten up, but the tasks were completed as requested.
#engineering #ai #devops #devopsendgamewithallthecheatcodesandallthepeace #abtestyouragentsunderstanding
🤖📝 AB Test Your Agent 📝🤖
One thing I've noticed is that the CLAUDE.md file definitely has an impact on how well they use the system I bolt on top of them.
So, one does what one does in situations like this :: A/B test your CLAUDE.md
Had one of them synthesize an update based on the guides I was producing for my former team to consume
[may as well get my moneys worth out of them]
Had it create a test to.
Spawn a new agent and let it loose
===
"""
read the node oculus-proficiency-test-20260122-a and perform the tasks described
"""
#engineering #devops #devopsendgamewithallthecheatcodesandallthepeace #ai #abtestyouragentsunderstanding