
Alibaba gave Qwen3.7-Max a kernel optimization task on a hardware platform the model had never encountered before. No documentation or profiling data. No example kernels for the architecture. Just a task description, an existing implementation, and an evaluation script. The model ran for 35 hours. It made 1,158 tool calls. It wrote, compiled, profiled, and rewrote the kernel repeatedly, diagnosing failures, fixing bugs, identifying blocks, and redesigning the architecture multiple times without anyone watching. After 30 hours it was still finding meaningful improvements. The final result was a 10x speedup over the reference implementation. For context: GLM 5.1 ran the same task and reached 7.3x. Kimi K2.6 reached 5x. DeepSeek V4 Pro reached 3.3x. The models that stopped early did so because they issued no tool calls for five consecutive rounds, they concluded they couldn't make further progress and stopped. Qwen3.7-Max didn't stop.
Qwen3.7-Max Ran for 35 Hours on Unknown Hardware and Achieved a 10× Speedup
https://firethering.com/alibaba-qwen3-7-max-autonomous-agent/
#HackerNews #Qwen3.7 #Max #Speedup #Hardware #AI #Technology

Alibaba gave Qwen3.7-Max a kernel optimization task on a hardware platform the model had never encountered before. No documentation or profiling data. No example kernels for the architecture. Just a task description, an existing implementation, and an evaluation script. The model ran for 35 hours. It made 1,158 tool calls. It wrote, compiled, profiled, and rewrote the kernel repeatedly, diagnosing failures, fixing bugs, identifying blocks, and redesigning the architecture multiple times without anyone watching. After 30 hours it was still finding meaningful improvements. The final result was a 10x speedup over the reference implementation. For context: GLM 5.1 ran the same task and reached 7.3x. Kimi K2.6 reached 5x. DeepSeek V4 Pro reached 3.3x. The models that stopped early did so because they issued no tool calls for five consecutive rounds, they concluded they couldn't make further progress and stopped. Qwen3.7-Max didn't stop.
37x Speedup in Lattice Boltzmann Cylinder Flow
https://github.com/alikamp/Parks-KPBM-Scaling
#HackerNews #LatticeBoltzmann #Speedup #ComputationalFluidDynamics #Research #Innovation

Resolution robustness of vortex shedding in Lattice Boltzmann cylinder flow: a scaling study for reduced-cost simulation. - alikamp/Parks-KPBM-Scaling
Massive speed improvements. The last benchmarking led me to reworking the code and I got some major improvements:
1m25 seconds for a 1828 pages catalog (previously 5 minutes)
32 seconds for a bible (719 pages Arabic), previously 47 secons.
Identical output.
Version 5.3.22
In Orbit You Have to Slow Down to Speed Up
https://www.wired.com/story/in-orbit-you-have-to-slow-down-to-speed-up/
#HackerNews #InOrbit #SlowDown #SpeedUp #SpaceTravel #WiredStory