Michael Szell

@mszll@datasci.social
2.2K Followers
1,094 Following
1.3K Posts
Targeted cooling of urban cycling networks for heat-resilient mobility
https://arxiv.org/abs/2512.11753
Cool 😬 paper on how targeted tree planting can substantially cool streets for #cycling!

Can we infer citywide traffic speeds without sensors or proprietary data?

In our #research, we explore an environment-driven approach to #traffic speed classification using #OpenStreetMap road context and Street View Imagery. By focusing on speed classes, the framework supports spatial completion across an entire urban network, including data-sparse areas.

Tested on #Berlin with OSM data quality assessed via the ohsome dashboard.

👉 https://heigit.org/new-paper-estimating-road-speed-classes-integrating-openstreetmap-and-street-view-imagery-for-missing-data-imputation/

#opendata #GIS

With #openalex being mature enough, I have finally completed #degoogling my website 🥳
https://github.com/search?q=repo%3Amszell%2Fhomepage_mszell+google&type=commits
Build software better, together

GitHub is where people build software. More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects.

GitHub
This Christmas is a great opportunity to switch together with the whole family from #WhatsApp to #Signal. Also moving them from Google photos to the new #ente family account, and doing some more #techhygiene.

Anna's Archive backed up Spotify. They got 99.9% of metadata, and 300TB of music representing 86 million tracks - original 160kbps OGG for tracks with popularity>0, and re-encoded 75kbps for popularity=0. absolutely wild project.

the metadata in particular is a hugely useful data source. MusicBrainz catalogues 5 million unique ISRCs (like ISBNs but for music releases), whereas this archive has a whopping 186 million.

https://annas-archive.li/blog/backing-up-spotify.html

Backing up Spotify

We backed up Spotify (metadata and music files). It’s distributed in bulk torrents (~300TB). It’s the world’s first “preservation archive” for music which is fully open (meaning it can easily be mirrored by anyone with enough disk space), with 86 million music files, representing around 99.6% of listens.

We tracked like 17 million train arrivals last year to see where delays happen, and this is the result 🗺️

Find out the best and worst stations, routes and times of day in our 2025 Wrapped overview: https://chuuchuu.com/2025wrapped

(on that note, we have a new website so check that out too)

When the study confirms intuition:

"We find that the number of papers cited at least as well as those appearing in high-impact factor journals vastly exceeds the number of papers published in such venues."

https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.3003532

Decades on, academic journals are still useless as indicators of much of anything.

#publishing #academicchatter

Most researchers would receive more recognition if assessed by article-level metrics than by journal-level metrics

Are authors fairly judged by assessment of the prestige of the journals in which their work is published? This study compares article level metrics with journal level metrics, finding that the vast majority of influential papers are published in lower tier journals, and that more authors, regardless of demographics, would be better recognized with article level data.

Just published in JOSS: 'dython: A Set of Analysis and Visualization Tools for Data and Variables in Python' https://doi.org/10.21105/joss.09174
dython: A Set of Analysis and Visualization Tools for Data and Variables in Python

Zychlinski, S., (2025). dython: A Set of Analysis and Visualization Tools for Data and Variables in Python. Journal of Open Source Software, 10(116), 9174, https://doi.org/10.21105/joss.09174

Journal of Open Source Software
Trump has declared #fentanyl to be a "Weapon of mass destruction" because it kills ten thousands of people in the US every year. Guess what else kills so many people? #cars

How Is Public Space Shared in Your Neighborhood? 🚗🌳🛝
Ever wondered how much of your neighborhood’s public space is taken up by car parking and how that compares to green and play areas?

This interactive map lets you explore exactly that. Recently updated, it visualizes how public space is distributed between cars, greenery, and playgrounds across ALL of Berlin, offering insight into how the city allocates its shared space.

👇 Explore the map below
https://www.hanshack.com/parking/

Parkplatz oder Spielplatz

Mit dieser Karte kannst Du die Größe von Parkplätzen, Spielplätzen und Grünanlagen in deinem Kiez vergleichen.

Parking or Playground