The Record: Spotify disables accounts after open-source group scrapes 86 million songs from platform . “The spokesperson added that Anna’s Archive did not contact them before publishing the files. They also said it did not consider the incident a ‘hack’ of Spotify. The people behind the leaked database systematically violated Spotify’s terms by stream-ripping some of the music from the […]

https://rbfirehose.com/2025/12/25/the-record-spotify-disables-accounts-after-open-source-group-scrapes-86-million-songs-from-platform/
The Record: Spotify disables accounts after open-source group scrapes 86 million songs from platform | ResearchBuzz: Firehose

ResearchBuzz: Firehose | Individual posts from ResearchBuzz

#Spotify #music #metadata #DataRescue

'It’s the world’s first “preservation archive” for music which is fully open (meaning it can easily be mirrored by anyone with enough disk space), with 86 million music files, representing around 99.6% of listens.'

https://annas-archive.li/blog/backing-up-spotify.html

Backing up Spotify

We backed up Spotify (metadata and music files). It’s distributed in bulk torrents (~300TB). It’s the world’s first “preservation archive” for music which is fully open (meaning it can easily be mirrored by anyone with enough disk space), with 86 million music files, representing around 99.6% of listens.

Anna’s Archive: Backing up Spotify. “Anna’s Archive normally focuses on text (e.g. books and papers). We explained in ‘The critical window of shadow libraries’ that we do this because text has the highest information density. But our mission (preserving humanity’s knowledge and culture) doesn’t distinguish among media types. Sometimes an opportunity comes along outside of text. This is […]

https://rbfirehose.com/2025/12/21/annas-archive-backing-up-spotify/
Anna’s Archive: Backing up Spotify | ResearchBuzz: Firehose

ResearchBuzz: Firehose | Individual posts from ResearchBuzz

Flickr Blog: Building Flickr Archives with Data Lifeboat. “With Data Lifeboat, you can create an archive to document a specific time and place, share memories of an event, or curate a collection of perspectives from around the globe. Simply put, conscious archiving with Data Lifeboat can allow you to create and share your own slice of history with future viewers from this vast collection. Here […]

https://rbfirehose.com/2025/12/20/flickr-blog-building-flickr-archives-with-data-lifeboat/
Flickr Blog: Building Flickr Archives with Data Lifeboat | ResearchBuzz: Firehose

ResearchBuzz: Firehose | Individual posts from ResearchBuzz

St. Louis Magazine: Moonlighting librarians save the RFT’s online archive from its post-porn purge. “A newly available digital archive that encompasses much of the recent history of the Riverfront Times went live yesterday. It is the brainchild of Joshua Lawrence and Jaclyn Crow, two St. Louisans with a passion for local history…. The database currently has about 2,000 articles from the […]

https://rbfirehose.com/2025/12/06/st-louis-magazine-moonlighting-librarians-save-the-rfts-online-archive-from-its-post-porn-purge/

St. Louis Magazine: Moonlighting librarians save the RFT’s online archive from its post-porn purge | ResearchBuzz: Firehose

ResearchBuzz: Firehose | Individual posts from ResearchBuzz

Slaw: The Data Rescue Project: Preserving Government Data Is a Tech & Community Issue. “Precursors to the Data Rescue Project such as the End of Term Web Archive, which captures federal government data after presidential administration transitions, the 2017 Data Refuge Project, and the Environmental Data & Governance Initiative (EDGI), laid the groundwork for 2025 preservation efforts, but […]

https://rbfirehose.com/2025/12/01/the-data-rescue-project-preserving-government-data-is-a-tech-community-issue-slaw/

The Data Rescue Project: Preserving Government Data Is a Tech & Community Issue (Slaw) | ResearchBuzz: Firehose

ResearchBuzz: Firehose | Individual posts from ResearchBuzz

Making 10M government PDF documents searchable https://flowingdata.com/2025/11/26/making-10m-government-pdf-documents-searchable/

"The code for GovScape is open source and available on GitHub."

#OpenData #OpenGov #OCR #DataRescue #GovDocs

Making 10M government PDF documents searchable

Government organizations love to distribute documents as PDF files. They are easy to forward and to print. The problem is when you want to find and access them later among millions of other files. …

FlowingData
We often discuss how public data influences our everyday lives whether we acknowledge it or not. This week's guest article highlights your daily interactions with public data: www.datarescueproject.org/guest-post-a... #PublicData #DataRescue

Guest Post: A Day in the Life ...
Guest Post: A Day in the Life with Federal Government Data

Today, we have the fourth post in the series from Claire McKay Bowen and Aaron R. Williams to help diverse audiences understand and support the federal statistical system. Everyone living in the United States is part of this vast statistical ecosystem and benefits from it—both directly and indirectly. Check

Data Rescue Project
Förderinitiative zum Sichern gefährdeter Datenbestände und zur Datenresilienz 2025 bis 2027