Mastodawn

Eliciting Complex Spatial Reasoning in MLLMs through Wide-Baseline Matching

Outperforms all other open- and closed-source spatial reasoning models, but still inferior to human accuracy. Uses #LMDB

CVPR 2026 · ReasonMatch-Bench (2,810 image pairs) and DCRL reach 70.5 F1, outperforming evaluated open- and closed-source MLLM baselines.