πŸš€ Ah yes, the latest tech buzzword salad! Mamba-3 is apparently speeding things up faster than it takes to blink, and now we can all rent fancy #NVIDIA #GPUs as if we're borrowing sugar from a neighbor. 🍬 Who knew "accelerated compute" was just a synonym for "we added a few more buzzwords and called it innovation"? πŸ˜‚
https://www.together.ai/blog/mamba-3 #techbuzzwords #acceleratedcompute #innovation #Mamba3 #HackerNews #ngated
Mamba-3

Meet Mamba-3: the SSM built for inference. Faster than Transformers at decode, stronger than Mamba-2, and open-source from day one.

Mamba-3

Meet Mamba-3: the SSM built for inference. Faster than Transformers at decode, stronger than Mamba-2, and open-source from day one.

Sebastian Raschka (@rasbt)

Mamba-3이 μΆœμ‹œλ˜μ—ˆμœΌλ©°, μž‘μ„±μžλŠ” Mamba 및 μœ μ‚¬ λͺ¨λΈλ“€μ΄ 트랜슀포머 μ–΄ν…μ…˜ ν•˜μ΄λΈŒλ¦¬λ“œ μ•„ν‚€ν…μ²˜(Qwen3.5, Kimi Linear λ“±)μ—μ„œ ν₯미둜운 ν™œμš©μ²˜λΌκ³  ν‰κ°€ν•©λ‹ˆλ‹€. λ‹€μŒ μ„ΈλŒ€ ν•˜μ΄λΈŒλ¦¬λ“œμ—μ„œ Gated DeltaNet λŒ€μ‹  RoPEκ°€ μΆ”κ°€λœ Mamba-3을 κ΅μ²΄ν•΄λ³΄λŠ” μ‹€ν—˜μ„ μ œμ•ˆν•˜κ³  μžˆμŠ΅λ‹ˆλ‹€.

https://x.com/rasbt/status/2034088726997893168

#mamba3 #transformer #qwen3.5 #gateddeltanet #rope

Sebastian Raschka (@rasbt) on X

Oh wow, Mamba-3 is here! For me, the most interesting use case of Mamba and Mamba-likes are the recent transformer attention hybrid architectures (Qwen3.5, Kimi Linear, etc.) Would be interesting to swap Gated DeltaNet with Mamba-3 (which now also has RoPE) in next gen hybrids.

X (formerly Twitter)