@brian @standardebooks

Awesome initiative! Made me want to bring some people together and start producing nice ebooks from the #ELTeC collections. Alas, they have an English-only policy... 😮‍💨 https://standardebooks.org/contribute/collections-policy

I guess the best way forward would be to copy the idea for multilingual literary texts... but of course that is harder than joining a group with established structures and workflows and experience...

Great to be at the "Comparative Literature Goes Digital" session at #DH2025!

Session info here: https://www.conftool.pro/dh2025/index.php?page=browseSessions&form_session=300&presentations=show

Full programme here: https://dls.hypotheses.org/1952

Including a talk by Evgeniia Filveva, with Julia Havrylash, myself, Artjoms Šeļa on "#Multilingual #Stylometry: The influence of corpus composition and language on the performance of authorship attribution using corpora from the European Literary Text Collection (#ELTeC)".

#ICLA #ADHO #SIG_DLS #CLS @tcdh

Time for a lunch break at #DH2025

Afterwards we will continue with the following talk as part of the mini-conference „Comparative Literature Goes Digital”: „Multilingual Stylometry: The influence of corpus composition and language on the performance of authorship attribution using corpora from the European Literary Text Collection (#ELTeC)”

Currently on my first long-distance travel in a very long time and it does feel very special and exciting. The destination, South #Korea, is certainly a key factor!

And the program is packed, with four events – and associated dinners ;-) – in the next four days.

So cool to be speaking about our work in #CLS from several projects at @tcdh, including on #Zeta, #MiMoText, #ELTeC and @CLSinfra. I'm also very eager to learn more about Korean DH research!

Details: https://christof-schoech.de/activities/?_sf_s=Korea

@tcdh – Apart from the pleasure of seeing literary studies, history and stylometry come together very nicely today, the discovery of the day (so far!) is Artjoms Sela's plugin for #stylo called "seetrees". https://github.com/perechen/seetrees/ – It allows you to see what words are associated with clusters in the dendrogram at different levels. Basically, it answers the question of *why* a cluster has been created, rather than just showing the cluster. A first test on #ELTeC-fra makes a lot of sense!
GitHub - perechen/seetrees: R `stylo` extension package

R `stylo` extension package. Contribute to perechen/seetrees development by creating an account on GitHub.

GitHub
Here's the repo with the nascent #Gaelic #ELTeC corpus: https://github.com/COST-ELTeC/ELTeC-gle
GitHub - COST-ELTeC/ELTeC-gle: Irish-language novel collection for ELTeC (European Literary Text Collection)

Irish-language novel collection for ELTeC (European Literary Text Collection) - GitHub - COST-ELTeC/ELTeC-gle: Irish-language novel collection for ELTeC (European Literary Text Collection)

GitHub
Yay to #bilingual, here English Gaelic slides in a talk by Justin Tonra at #DH2023. And now he even mentions the #multilingual #ELTeC!
It's done - now you find 1365 #novels in 15 languages from the #ELTeC (European Literary Text Collection) in #TextGrid : https://textgridrep.org/project/TGPR-99d098e9-b60f-98fd-cda3-6448e07e619d - if you want to know more about the advantages of re-publishing ELTeC in TextGrid come to our poster in Graz #DH2023 soon: https://zenodo.org/record/8093218
Thanks @christof @eumanismo Stefan Funk, Ubbo Veentjer and Carolin Odebrecht
TextGrid Repository