Last week, the 1st GeoAI conference took place at #UGent! I had the opportunity to attend interesting talks, engage in discussions about the future of the domain, host a workshop about the #DarkSideOfGeoAI, and, last but not least, present our work on #VISQAM, funded by #NFDI4Earth and made 100% in-house at IFGI, Uni Münster.
Check out our proceedings paper about VISQAM, an open dataset that comprises 1200 thematic geographic maps annotated with 4500+ QA pairs: https://doi.org/10.5281/zenodo.20273244.
Introducing the VISQAM Dataset: Toward Automated Map Interpretation
We introduce VISQAM, an open dataset designed specifically for visual question-answering (VQA) on thematic geographic maps. Comprising 1200 annotated images and 4594 QA pairs from four permissive-license sources, VISQAM enables the development of models capable of interpreting and understanding the complex spatial and thematic information encoded in maps. We fine-tuned Qwen3-VL-2B-Instruct on this dataset, achieving substantial performance improvements: BERTScore-F1 increased from 0.43 (base model) to 0.72 (fine-tuned), with exact match rising from 0.0 to 0.24. Our experiments highlight the challenges of spatial and relational reasoning in map interpretation. The fine-tuned model performs best on object-related questions, while relation questions prove most difficult, indicating the need for more examples of this question type in future expansions of the dataset. As automated map interpretation can improve access to spatial information, e.g. for visually impaired users, and facilitate knowledge extraction from cartographic products, VISQAM lays the groundwork for developing more advanced VQA systems.








