CorpusExplorer (Update Q1 2026)

Das Release Q1 2026 enthält keine neuen Features – nur kleinere und größere Fehlerkorrekturen (insbesondere durch Updates von Drittanbieter-Komponenten).
Aktuell teste ich, ob HDF5 eine Alternative zum CEC6-Format sein kann. Es laufen auch größere Umbauarbeiten und Refactorings am Code – die in Q2 2026 einfließen werden.

#2026 #Bugfixes #CorpusExplorer #Q1

Reading of #VoyantTools recently (👋🏻 @felwert ), would you prefer to have a corpus with un-normalized historical spelling variants or rather one with only the lemmatized tokens? We have a mechanism for lemmatizing, but not for "just" normalizing, so this option is not viable for us in the salamanca.school project.

Perhaps @dta_cthomas can you share some experiences with offering both?

Second question: do you know of alternative "distant reading" visualization tools/libraries/platforms to integrate into a (headless) corpus/collection website? (Without trying, I suppose this excludes some visualization-capable corpus analysis apps like #TXM or #CorpusExplorer, but I'd be happy to be proven wrong.)