In a methods / #DigitalHumamities class next semester, I want to cover basic corpus creation. Especially, I’ll probably focus on #OCR/#HTR/#ATR and #WebScraping. I find it incredibly hard to find good papers that can serve as a general introduction into these topics. All I find are either practical tutorials, or very specialized papers about specific approaches. Do you have any favorite readings about how to get to a text corpus in DH in the first place? Please share!
📣Have you seen our new #OpenMethodd blogpost from the #dhd2024 Workshop?
Linked Data from TEI (LIFT): A Teaching Tool for TEI to Linked Data Transformation
With "Introduction by OpenMethods guest editors Cristian Santini and Sebastian Still (DHd2024, Passau)” #DigitalHumamities #wisskomm #tei
➡️ https://openmethods.dariah.eu/2024/03/04/lift/
Follow us and never miss an update 😍