✨ 1:1 text-audio token alignment
✨ Precise prosody & timing control
✨ Multilingual (DE/EN/FR/ES/JA/AR)
✨ Built on Llama 3.2 (1B/3B)
🔗 https://github.com/HumeAI/tada
| Website | https://www.aime.info |
| https://www.linkedin.com/company/a-i-m-e/ | |
| Blog | https://www.aime.info/blog/ |
| Location | Berlin |
Instant LLM adaptation via text prompts? 🧠⚡️
SakanaAI's new Text-to-LoRA (T2L) uses a hypernetwork to generate task-specific LoRAs from simple text descriptions—no expensive fine-tuning required.
✅ Compresses 100s of adapters
✅ Generalizes to unseen tasks
✅ ICML 2025 Paper & Code: https://github.com/SakanaAI/text-to-lora
DeepSeek OCR 2 is a 3B VLM that reads documents like humans do. "Visual Causal Flow" dynamically reorders tokens by semantic meaning, not left-to-right, unlocking 91.09% accuracy on complex layouts.
Invoice parsing • contract analysis • archival digitization • form extraction
Fully open source (Apache 2.0).
Alibaba released Qwen3-TTS, a new text-to-speech model with discrete multi-codebook LM architecture under Apache license. Features 97ms synthesis latency, 3-second voice cloning, and 10-language support including German. Available on Hugging Face and ModelScope.

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice...
NVIDIA just dropped PersonaPlex - a speech-to-speech model that lets you control AI personas through text prompts AND voice conditioning! 🎙️✨
🔥 Real-time, full-duplex conversations with consistent character
🔥 Natural latency + multiple voice embeddings (NAT/VAR)
🔥 Perfect for customer service, assistants & immersive experiences
Z.AI just released GLM-4.7-Flash - a 30B-A3B MoE model that dominates the 30B parameter class!
🔥 Key benchmarks:
✅ 91.6% on AIME 25 (beats GPT-OSS-20B)
✅ 75.2% on GPQA
✅ 59.2% on SWE-bench Verified (3x better than Qwen3!)
Perfect balance of power & efficiency for enterprise deployment. Supports vLLM, SGLang & native tool integration.
Z.AI released GLM-Image, an innovative image generation model that establishes new benchmarks in specific application areas through its hybrid architecture.