152 Followers
184 Following
86 Posts
Tech Lead at (but not speaking for) the V&A
Web & Data Wrangler, IIIF & Linked Art Herder.
Favourite Biscuits: Mikado (Irish) - Anytime, Lebkuchen (German) - Christmas
Some #BFINationalArchive work profiled for World Digital Preservation Day #wdpd2024 - first our Netflix #digipres featuring important acronyms like IMF, IMP, SMPTE, MXF… https://www.bfi.org.uk/features/archiving-netflix-bfi-national-archive
Archiving Netflix: how and what we preserve from the streamer’s programming

In 2022, the BFI National Archive announced a partnership with Netflix for a selection of their shows to be preserved as part of the national collection. Two years on, we look at highlights from the selections so far, and the technology behind preserving them.

BFI

News from the British Newspaper Archive on twitter:

'We’re delighted to announce that in partnership with the @britishlibrary, we have released over 1 million more free to view newspapers pages on The Archive, bringing our total of free to view pages to over 4 million. Find out more: https://blog.britishnewspaperarchive.co.uk/2024/09/19/one-million-new-free-to-view-newspaper-pages/ ' #FreeToViewNewspapers

Explore Over One Million New Free to View Newspaper Pages

In March 2025, the RUNIP project @ruhr-uni-bochum.de will host the conference “Words in Numbers – Data-Driven Approaches to Texts in the Humanities and Social Sciences.” Keynote speakers include @jerielizabeth and Jo Guldi 🙌. The call for papers is now open, inviting especially early career researchers to submit proposals for short talks or posters. Check it out and join us! https://runip-projekt.ruhr-uni-bochum.de/words_in_numbers.html (English CfP is in the linked PDF at the end) #TextAsData #DigitalHumanities
@awinkler I think that would be for the institutions, as it's so open to interpretation as to what the definition is. I'm doubtful enough on my site assigning any figures (open to correction!) but it's hopefully clearly my view, not official stats.
@awinkler Thanks Alexander! It was a random idea (I have a vague sense of different institutions scale of collection size, but have never seen it depicted). An API would be great, but I'd also just take a standard stats score card for the number of 'artefacts' or records at each institution, as even finding that on a website proved hard to do for some institutions.
Some other obvious issues I need to fix on colours changing between charts, consistency of naming, treemaps not interactive, archives stats approach, cataloguing progress more clear, donut charts purpose. Apart from all that, hopefully interesting!
I think it broadly follows the top level numbers set out in the Towards a National Collection digital collections audit by Gosling K., McKenna G. and Cooper A., although as they note, the size of some collections does overwhelm the stats. https://museumdata.uk/wp-content/uploads/2023/11/Digital-Audit.pdf
The numbers are from my personal interpretation of an institutions website (where available) or annual report (where available), and the classifications likewise are my opinions, undoubtedly it will have errors so happy to update if anyone wants to get in touch.
Totally unofficially and very work in progress (don't read too much into any of the 'spurious accuracy' numbers), some notebooks attempting to visualise what a UK National Collection (of heritage artefacts) looks like in numbers (and cataloguing progress): https://atiro.github.io/national-collection-visualisation/intro.html
UK National Collections Dashboard — Cultural Heritage Collections Data Infrastructure

The Access and mediation working group of the Society of Swiss Archivists has just published a whitepaper on the benefits and opportunities of machine learning for improving access to archives.
It's available in English here: https://vsa-aas.ch/wp-content/uploads/2024/08/MachineLearning_im_Archiv_Whitepaper_2024-08-08_en.pdf