Just make them talk to each other and take their response and just wrap it with something like “I was thinking about <response>, do you have a recommendation?” Then feed that response into the next one in a giant loop of fast food bots…
You can access the Windows 11 cooplilot API easily, but since MS has basically unlimited compute, I never bothered to make a token burning program. Tokens cost them truly nothing.
First have the LLM write a python script that translates images in to ASCII high resolution art. Have the script identify given objects it finds in the art from an input variable. Point that script at Captchas. Profit?