🦙 The @CSVCONF is getting bigger and better every year. This time there was lots of interest, conversations and exchanges in a magnificent venue in Bologna, Italy. 🧵

#CSVCONF #CommaLlama

What a great time in Bologna for #csvconf. Reconnecting with digital friends in person, meeting new people and great ideas, all within this beautiful city and eating delicious food. Very grateful to the organizers and everyone who made it possible

Also attending the same conference in Bologna, Italy, is The Carpentries Director of Workshops and Training, SherAaron Hurt.

She is a member of the #csvconf 2025 organising team, working alongside other amazing open and data science colleagues to put together this year's programme: csvconf.com/schedule.html.

Well done SherAaron and your team! 👏

This has been a busy week of public community engagement for various members of The Carpentries Core Team!

First off, our Executive Director @drkariljordan is at #CSVCONF, sharing how The Carpentries is collaborating with GREI (Generalist Repository Ecosystem Initiative) to strengthen researcher & librarian skills in data/software management, ensuring high-quality, shareable deposits across repositories.

#csvconf talk "A Toolkit for Community-Driven Data Governance" by Jennifer Ding (https://jending.com/). Let's make "Data Governance" a verb. Working with choirs in the UK to teach them that they're data subjects and their voices are data objects and to hear their thoughts about what that means for them individually and collectively. h/t to two Moz projects: https://commonvoice.mozilla.org/ and https://datacollective.mozillafoundation.org/ (launching soon!)
Jennifer Ding

Web site created using create-react-app

#csvconf Talk by Philip Meyer of Rent Brigade on Tracking Rent Gouging During a Climate Disaster. Crowd-sourced spreadsheet of rent spikes during and after the recent fires in Los Angeles moved to automated web scraping (now open source for detecting rent gouging in your own neighbourhood: https://github.com/rent-brigade/rent-scraper) then tips on rallying the community, working with politicians, and getting heard. Epic stuff.
GitHub - rent-brigade/rent-scraper: Scrape data from rental listings

Scrape data from rental listings. Contribute to rent-brigade/rent-scraper development by creating an account on GitHub.

GitHub
The slides from yesterday's talks at #csvconf are now available on #Zenodo
📌Arcangelo Massari's "Mapping the unmapped citation landscape: how crowdsourcing will fill the citation gap": https://zenodo.org/records/17098600
📌@essepuntato "How did we get to OpenCitations: a brief history of open scholarly citations": https://zenodo.org/records/17093870
Mapping the unmapped citation landscape: how crowdsourcing will fill the citation gap

This presentation examines the critical issue of citation invisibility in scholarly publishing, particularly affecting journals in the Global South. The research reveals that among 52,000+ Open Journal Systems (OJS) journals worldwide, 79.9% are located in the Global South, with 98.8% remaining invisible to Web of Science and 94.3% invisible to Scopus. Despite this invisibility, the vast majority of these journals are legitimate scholarly publications rather than predatory journals. The presentation introduces a collaborative solution developed through a partnership between OpenCitations, the Public Knowledge Project (PKP), and the Leibniz Information Centre for Science and Technology. This initiative implements a seamless citation submission workflow integrated into OJS, enabling automatic submission of structured bibliographic metadata and citation relationships to OpenCitations through GitHub issues. The system employs a validation pipeline encompassing structural, semantic, and closure validation processes. The technical implementation utilizes CSV formats for bibliographic metadata and citation relationships, with monthly data ingestion cycles. Unlike existing predatory journal lists, the project emphasizes transparency and accountability through board governance and published evaluation criteria. The infrastructure maintains independence from proprietary platforms, utilizing OpenCitations APIs, Zenodo for permanent archival, and GitHub for workflow visibility. The potential global impact includes connecting 41,500 Global South journals and introducing millions of new citations into the global citations network, effectively democratizing the entire citation landscape. Presented at csv,conf,v9 in Bologna, Italy, on September 10, 2025.  

Zenodo
#csvconf Afternoon Keynote "Fifteen years into the open data movement" with Giorgia Lodi and Andrea Borruso. Using examples from Italy's government-published open data standards and varying levels of adherence to them to show that the regs don't always get followed, communities can sometimes fill in the gaps, but even when the regs -are- followed they exclude the role of the human (the individual nerd, the lay person, the community) by missing descriptions and explanations for codes/acronyms.
#csvconf Daniela Popova and Monika Popova showing off datahub.io, a way to go from a github repo to a data-rich website with just a few clicks; and portaljs.com, a way to build something a little deeper. Looks pretty neat and sits for me in that corner of my brain that fondly remembers Mozilla's now-abandoned Iodide project and @wlach's Irydium project.
#csvconf Dr Kari Jordan from The Carpentries talking about Bridging Communities. Oh dear, a shared doc (lol). I'd only heard about The Carpentries yesterday, so I'm excited to learn more about how they've integrated open publishing and citation into their curricula. Now I wonder if https://public-data.telemetry.mozilla.org/all-datasets.json should have DOIs...