Increasingly, my number one feature hope for OS 27 is a larger context window for Apple's on-device model. There are so many features I am trying to build that I would love to run 100% locally, but I can't.

As a basic example, you literally can't use it as a meeting note generator because any meeting more than 5-10 minutes is going to exceed the window.

In my other apps, I've fallen back to using Gemini for this sort of thing. But now that means there's an external connection, and there is a usage-based cost that I'm dealing with and need to offset by charging customers

And because power users are going to generate exponentially more cost than the average user, I have to raise the subscription price for everyone to offset those users. Even though most people could be well profitable for me at a lower cost.

@matt_birchler it’s not a great solution but could you have it setup so that after X basic usage the power user has to input their own AI tool API key to use their own credits? Or does Apple then consider than locked functionality, or whatever.
@andyn Yeah, I could do that. Apple would have no issue with that sort of gating. Like you say, it's just a bit messy and adds complexity to the app. Something to consider, though!
@matt_birchler @andyn Or you could sell a limited volume in the subscription and offer the rest as one-time payments?