🤖📝 AB Test Your Agent 📝🤖

And what's a test without a grade?

Couple of things we can tighten up, but the tasks were completed as requested.

#engineering #ai #devops #devopsendgamewithallthecheatcodesandallthepeace #abtestyouragentsunderstanding

🤖📝 AB Test Your Agent 📝🤖

One thing I've noticed is that the CLAUDE.md file definitely has an impact on how well they use the system I bolt on top of them.

So, one does what one does in situations like this :: A/B test your CLAUDE.md

Had one of them synthesize an update based on the guides I was producing for my former team to consume

[may as well get my moneys worth out of them]

Had it create a test to.

Spawn a new agent and let it loose

===

"""
read the node oculus-proficiency-test-20260122-a and perform the tasks described
"""

#engineering #devops #devopsendgamewithallthecheatcodesandallthepeace #ai #abtestyouragentsunderstanding