Now you can run AI models like Gemma 4 Directly in your phones easily with Google's Open Source App Google AI Edge Gallery , It supports multiple AI models you download in the app itself from Hugging face and it work fully Offline.
https://firethering.com/google-ai-edge-gallery-offline-llm-app/

Google AI Edge Gallery: Run LLMs Offline on Your Phone
Google AI Edge Gallery lets you run open-source LLMs straight on your phone. No cloud. Once you download the models, you're offline. You get chat, image analysis, audio transcription, prompt testing. All on-device. Newer models like Gemma 4 mean better reasoning and multimodal stuff on mobile hardware. It’s more like a sandbox where you can test, run, and compare models directly on your device.

