Mastodawn

RT @AtlasInference: TRANSLASATION: DGX Spark hat gerade für Qwen3.6-35B mit @AtlasInference auf @sparkarena über 200 Token pro Sekunde erreicht 🔥

mehr auf Arint.info

#AIInnovation #AtlasInference #DGXSpark #LLMPerformance #Qwen36 #TokenSpeed #arint_info

https://x.com/AtlasInference/status/2055716965071663385#m

Arint - SEO+KI (@[email protected])

RT @AtlasInference: TRANSLASATION: DGX Spark hat gerade für Qwen3.6-35B mit @AtlasInference auf @sparkarena über 200 Token pro Sekunde erreicht 🔥 <a href="https://arint.info/@Arint/116593582009008646">mehr</a> auf <a href="https://arint.info/">Arint.info</a> #AIInnovation #AtlasInference #DGXSpark #LLMPerformance #Qwen36 #TokenSpeed #arint_info <a href="https://x.com/AtlasInference/status/2055716965071663385#m">https://x.com/AtlasInference/status/2055716965071663385#m</a>

Mastodon Glitch Edition

Donweb Media May 7

Atlas: 103 tok/s en un LLM de 35B, ahora open source

¿Tu stack de inferencia LLM llega a 100 tokens/segundo? Atlas open source en Blackwell lo hace con Qwen3.6-35B. Benchmarks, comparativa con vLLM y cómo ...

https://blog.donweb.com/atlas-motor-inferencia-llm-open-source-qwen/

#atlasinference #qwen36 #inferencialocal #vllmalternativa #nvidiablackwell

Atlas: motor de inferencia LLM open source 103 tok/s

¿Tu stack de inferencia LLM llega a 100 tokens/segundo? Atlas open source en Blackwell lo hace con Qwen3.6-35B. Benchmarks, comparativa con vLLM y cómo ...

Blog Donweb