A big problem with Apple's Private Cloud Compute is that the servers 'merely' have the same horsepower as M2 Ultra machines you can buy at retail and run on your desk, where the cloud-based LLMs they're supposed to compete with are datacenters full of orders-of-magnitude-more-powerful Nvidia GPUs. And as we've seen, even with Apple's Foundation Models, a simple request can take tens of seconds to process. There is a gulf between what Apple wants to do and the hardware it's trying to build it on