Mastodawn

- Anthropic says it's on the way to Recursive Self-Improvement (also, just ahead of IPO): https://www.anthropic.com/institute/recursive-self-improvement

- New Gemma 4 12b model + AI Edge app also does audio editing: https://developers.googleblog.com/bringing-gemma-4-12b-to-your-laptop-unlocking-local-agentic-workflows-with-google-ai-edge/

- NSA is using Mythos now: https://www.ft.com/content/d02d91b3-2636-454e-9442-dc7e69f51815

- Meta adding name recognition to it's smart glasses (this was always inevitable.. and it would be super useful to me but it's also distopian/surveillance state stuff): https://www.wired.com/story/meta-smart-glasses-face-recognition-nametag-connections/

#AI #AINews #anthropic #nsa #meta #gemma #audio

When AI builds itself

Our progress toward recursive self-improvement, and its implications.

Alessio Pomaro 3h ago

🧠 #Google ha rilasciato #Gemma 4 12B, che introduce il supporto alla Multi-Token Prediction (#MTP) e porta capacità multimodali avanzate su hardware consumer.

👉 Per approfondire: https://www.linkedin.com/posts/alessiopomaro_google-mtp-gemma-ugcPost-7468543000747601920-Lp1r/

___
✉️ 𝗦𝗲 𝘃𝘂𝗼𝗶 𝗿𝗶𝗺𝗮𝗻𝗲𝗿𝗲 𝗮𝗴𝗴𝗶𝗼𝗿𝗻𝗮𝘁𝗼/𝗮 𝘀𝘂 𝗾𝘂𝗲𝘀𝘁𝗲 𝘁𝗲𝗺𝗮𝘁𝗶𝗰𝗵𝗲, 𝗶𝘀𝗰𝗿𝗶𝘃𝗶𝘁𝗶 𝗮𝗹𝗹𝗮 𝗺𝗶𝗮 𝗻𝗲𝘄𝘀𝗹𝗲𝘁𝘁𝗲𝗿: https://bit.ly/newsletter-alessiopomaro

#AI #GenAI #GenerativeAI #IntelligenzaArtificiale #LLM

Zdroják 15h ago

Google DeepMind představil nový model Gemma 4 12B – a jeho největší předností je, že výkon na úrovni blízké většímu 26B modelu nabídne ve výrazně menší paměťové stopě, takže ho lze spustit lokálně na běžném laptopu s 16 GB RAM nebo unifikované paměti.

Co dělá Gemma 4 12B zajímavým?

Model přichází s unikátní „encoder-free“ architekturou, místo […]

https://zdrojak.cz/zpravicky/google-predstavil-gemma-4-12b-vykonny-ai-model-ktery-pobezi-i-na-vasem-laptopu/

Andrii Kuznietsov 18h ago

🤖💻 #Google представила відкриту ШІ-модель #Gemma 4 12B з 11,95 млрд параметрів, яка здатна працювати локально на ноутбуках із 16 ГБ відео- або уніфікованої пам'яті.

https://blog.google/innovation-and-ai/technology/developers-tools/introducing-gemma-4-12B/

Introducing Gemma 4 12B: a unified, encoder-free multimodal model

An overview of Gemma 4 12B, a model designed to bring high-performance multimodal intelligence directly to your laptop.

Google

AI_BEAR_NEWS 21h ago

📰 Google lancia Gemma 4 12B open source — multimodale e 100% locale

Il nuovo modello open source di Google analizza testo, audio e video interamente su un laptop enterprise da 16GB di RAM. Zero cloud, zero API key, zero costi di inferenza. Gemma 4 12B porta le capacità multimodali dei grandi modelli direttamente sul device, democratizzando l'AI locale per sviluppatori e aziende.

https://venturebeat.com/technology/googles-new-open-source-gemma-4-12b-analyzes-audio-video-and-runs-entirely-locally-on-a-typical-16gb-enterprise-laptop

#AI #OpenSource #Google #Gemma #AILocale #Multimodale

Firethering 21h ago

Google Built Gemma 4 12B Without Multimodal Encoders

https://firethering.com/google-built-gemma-4-12b-without-multimodal-encoders/

#gemma #google #opensource #ai #technews #gemma12B

Google Built Gemma 4 12B Without Multimodal Encoders - Firethering

Every multimodal model you've used has the same basic system. Text goes in one way, images go through a vision encoder first, audio goes through an audio encoder first, and then everything gets handed off to the language model in a form it can work with. The encoders are load-bearing and you don't just remove them.Google actually removed them.Gemma 4 12B takes raw image patches and raw audio waveforms and projects them directly into the same embedding space as text tokens. There is no vision encoder or audio encoder. One decoder handling everything.

Firethering

بجاد الأثري 21h ago

أطلقت شركة Google طراز الذكاء الاصطناعي Gemma 4 12B مفتوح المصدر، المصمم لتشغيل المهام متعددة الوسائط كالنصوص والصور والصوت محلياً على الحواسيب المحمولة العادية بذاكرة 16 جيجابايت. يتميز النموذج الجديد بنصف حجم ذاكرة طراز 26B مع تقديم أداء مماثل تقريباً، وهو أول طراز متوسط الحجم يدعم معالجة الصوت الأصلية. يتيح النموذج عمليات الاستدلال المعقدة، وهو متاح للاستخدام التجاري عبر منصات مثل Hugging Face وOllama بموجب ترخيص Apache 2.0 المفتوح.

#Google #Gemma #AI