@thomaspadilla at #dariah2023 pointed to us the great effort put in enhancing the accessbility of data from libraries, archives and museums for #ML on #huggingface with

#BigLAM

https://huggingface.co/biglam

it's of course #openaccess and anyone can participate!

biglam (BigLAM: BigScience Libraries, Archives and Museums)

🤗 Hugging Face x 🌸 BigScience initiative to create open source community resources for LAMs.

I'm still working on (slowly 😅)adding more datasets to the
Hugging Face hub as part of #BigLAM
📕 Early Printed Books font detection: https://huggingface.co/datasets/biglam/early_printed_books_font_detection
🖼️ V4Design Europeana style dataset: https://huggingface.co/datasets/biglam/v4design_europeana_style_dataset
🔎 More datasets: https://huggingface.co/biglam
biglam/early_printed_books_font_detection · Datasets at Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.