https://www.together.ai/blog/mamba-3 #techbuzzwords #acceleratedcompute #innovation #Mamba3 #HackerNews #ngated
Sebastian Raschka (@rasbt)
Mamba-3μ΄ μΆμλμμΌλ©°, μμ±μλ Mamba λ° μ μ¬ λͺ¨λΈλ€μ΄ νΈλμ€ν¬λ¨Έ μ΄ν μ νμ΄λΈλ¦¬λ μν€ν μ²(Qwen3.5, Kimi Linear λ±)μμ ν₯λ―Έλ‘μ΄ νμ©μ²λΌκ³ νκ°ν©λλ€. λ€μ μΈλ νμ΄λΈλ¦¬λμμ Gated DeltaNet λμ RoPEκ° μΆκ°λ Mamba-3μ κ΅μ²΄ν΄λ³΄λ μ€νμ μ μνκ³ μμ΅λλ€.

Oh wow, Mamba-3 is here! For me, the most interesting use case of Mamba and Mamba-likes are the recent transformer attention hybrid architectures (Qwen3.5, Kimi Linear, etc.) Would be interesting to swap Gated DeltaNet with Mamba-3 (which now also has RoPE) in next gen hybrids.
https://winbuzzer.com/2026/03/18/open-source-mamba-3-arrives-to-surpass-transformer-xcxwbn/
New Mamba-3 AI Model Beats Transformers by 4%, Runs 7x Faster
#AI #AIModels #AIResearch #DeepLearning #MachineLearning #OpenSourceAI #AIInference #AIBenchmarks #Mamba3 #TogetherAI #StateSpaceModels