Numéro 12 - focus 🔎

Simon Gabay, Ariane Pinche, Peter Nahon, Alix Chagué, Pauline Jacsont, Élodie Paupe, Jean-Claude Rebetez, Maxime Humeau, Christine Payot, Thibault Maillard, Yvan Jauregui, Elina Leblanc et Loraine Chappuis :

Lire avant de faire lire. Réflexions philologiques sur la reconnaissance automatique de texte pour les manuscrits modernes français
https://doi.org/10.4000/15ick

Dans le domaine francophone, le manuscrit écrit après le Moyen Âge reste le dernier type de document qui n’est pas correctement traité par les outils de reconnaissance automatique de texte. Si des modèles ont déjà été publiés, leur efficacité et leur documentation restent insatisfaisantes, en grande partie à cause des difficultés que suscite l’importante évolution des documents eux-mêmes au cours des siècles, et donc la diversité des formes à traiter. Après avoir décrit le problème d’un point de vue philologique, nous proposons ici quelques réflexions préliminaires sur la transcription des documents modernes, ainsi qu’un nouveau modèle visant à améliorer les conditions de travail des chercheurs et chercheuses, en attendant de concevoir une solution pleinement satisfaisante.

#HumanitésNumériques #transcription #OCR #HTR

Lire avant de faire lire. Réflexions philologiques sur la reconnais...

Introduction Une part non négligeable de la littérature et des documents d’archives rédigés après le Moyen Âge est encore conservée sous forme manuscrite. Or, contrairement à ceux de l’époque médié...

Faktencheck: #SMR sind nicht das gleiche wie #HTR! Hochtemperaturreaktoren sind gescheitert & haben das größte #Atommüll Volumen in Deutschland verursacht #THTR #Hamm & #AVR #Jülich 152 AVR #Castor Behälter sollen nach #Ahaus bis auch dort 2036 die Genehmigung endet de.wikipedia.org/wiki/Hochtem...

RE: https://bsky.app/profile/did:plc:l4rhb3lspoglq2xo2pmqffpx/post/3mgq6lwwobk26


Hochtemperaturreaktor – Wikipe...
We’re looking for a Research Data Engineer (m/f/x) (3,5 years). If you have a #DigitalHumanities profile with experience in #TEI encoding, #OCR / #HTR, and #IIIF (or any of those and are willing to learn the rest), get in touch! The full time position can be split, so if (for whatever reason) you’re interested in part-time work, we’re happy to discuss this. Boosts welcome.
https://jobs.ruhr-uni-bochum.de/jobposting/a605b2652e32bf86489d18e09c0709d084d759f41
3/4
Research Data Engineer (m/f/x)

Interessantes Phänomen: Regelwidrige Buchstabenverzierungen in Torarollen. Klar, man dachte damals schon wohlwollend an die armen Leute mit den #HTR Workflows und die Situation mit nicht-lateinischen Schriften. #MultilingualDH #DHd2026

you go on and off for years to improve your reading skills of #Kurrent / #Sütterlin, even trying to adopt the latter as your "secret" handwriting and then: first comes #AI and seems to make all of it redundant as #HTR improves incredibly; then you go to an exhibition at #DHMBerlin and discover that to read #OttovonBismarck (or his secretary) it was all useless in the first place as he seems to have written in a nice, simple #Lateinschrift

#archives #primarySources #histodons #HistodonsDE

Here, a notebook from the excavations at the Agora in the '30s (agathe.gr). The first image shows the llama.cpp server with the screenshot of a notebook. Second and third shots are side-by-side comparisons of the page and the resulting transcription (using Jason's prompt). #dh #handwriting #htr #ocr #gemma #histodons #archaeology

Today is *definitely* one of those days where I feel a strong kinship with crime drama authors as a pen & paper RPG storyteller.
Because my last couple search engine queries would *definitely* seem suspicious without context. 🥴

"What's the all-cause mortality rate in Northern Ireland?"
"How long after death does a body get cold?"
"How long after death do people still bleed?"
"Can you buy blood in the UK?"
"Can you sell blood in the UK?"
...

#VtM #HtR #WorldOfDarkness

Bookmarked: CoMMA: thousands of medieval manuscripts finally transcribed | Inria https://www.inria.fr/en/comma-medieval-manuscripts-transcribed #HTR Transcribing thousands of medieval manuscripts by hand would be a monumental undertaking. Fortunately, researchers in computational humanities at the Inria Paris Centre have been able to automate the task through the use of generative AI. Their creation is CoMMA, a giant, one-of-a-kind corpus that will now be available to specialists in the humanit
CoMMA: thousands of medieval manuscripts finally transcribed | Inria

Transcribing thousands of medieval manuscripts by hand would be a monumental undertaking. Fortunately, researchers in computational humanities at the Inria Paris Centre have been able to automate the task through the use of generative AI. Their creation is CoMMA, a giant, one-of-a-kind corpus that will now be available to specialists in the humanities, revolutionising the exploration of writing from the Middle Ages.

Kennt jemand gute Einstiegstutorial zum HTR? Ich möchte ein Model für HTR der polnischen und deutschen Handschrift für eigene Forschungszwecke entwickeln, bin ich aber in der Tesseract Dokumentation völlig lost.

Lebt e-Scriptorium-Projekt noch? Kann man den auf eigenem Computer installieren und ausprobieren? #HTR #OCR #eScriptorium #Tesseractocr