Mastodawn

fortext lab Dec 19, 2024

🎉 Today, we celebrate the publication of the final article from @jcls' 2024 conference track #JCLS 3 (1). We’re thrilled to see groundbreaking research in computational literary studies and can’t wait for what’s next! #CCLS24 #CLS #CCLS25 #DigitalHumanities #DH #Computing

fortext lab Dec 19, 2024

In #2024, our lab took over a crucial part of editing the conference issue of the Journal of Computational Literary Studies. @EvelynGius as one of the editors, supported by @JulianHaeussler and @SvenjaGuhr as editorial assistants, ensured the smooth copyediting and publication process of @jcls 3 (1).
#CLS #CCLS24 #Journal #JCLS https://jcls.io

Journal of Computational Literary Studies

JCLS Dec 19, 2024

After #JCLS 3(1) conference track #CCLS24 is before #CCLS25. We look forward to your submissions! Please find our Call for Papers for the conference track 2025 at https://jcls.io/site/cfp/ #ComputationalLiteraryStudies #CCLS25 #CfP #Literary #Computing #CLS

Call for Papers

JCLS Dec 19, 2024

The conference issue's final article of 2024 is by Szemes and Nagy: "Repetition and Innovation in Dramatic Texts. An Attempt to Measure the Degree of Novelty in Character's Speech" (10.48694/jcls.3923). Check it out at: https://jcls.io/issue/109/info/ #JCLS #CCLS24 #Shakespeare

Journal of Computational Literary Studies | Issue: Issue: 1(3) (2024)

JCLS Dec 19, 2024

As the final days of 2024 approach, we are excited to announce the final article of the JCLS 3 (1) conference issue, concluding the JCLS year of 2024. #DigitalHumanities #CCLS24 #JCLS #2024 https://jcls.io

Journal of Computational Literary Studies

JCLS Nov 22, 2024

Mélanie-Becquet et al. introduce BookNLP-fr, a French adaptation of the #BookNLP pipeline by @dbamman et al., enhancing #genre classification with interpretable features and expanding tools for #distantreading in #CLS. https://doi.org/10.48694/jcls.3924 #CCLS24 #NLP #DigitalHumanities #DH

BookNLP-fr, the French Versant of BookNLP. A Tailored Pipeline for 19th and 20th Century French Literature

This paper presents BookNLP-fr: the adaptation to French of BookNLP, an existing NLP pipeline tailored for literary texts in English. We provide an overview of the challenges involved in the adaptation of such a pipeline to a new language: from the challenges related to data annotation up to the development of specialized modules of entity recognition and coreference. Moving beyond the technical aspects, we explore practical applications of BookNLP-fr with a canonical task for computational literary studies: subgenre classification. We show that BookNLP-fr provides more relevant and – even more importantly – more interpretable features to perform automatic subgenre classification than the traditional bag-of-words approach. BookNLP-fr makes NLP techniques available to a larger public and constitutes a new toolkit to process large numbers of digitized books in French. This allows the field to gain a deeper literary understanding through the practice of distant reading.

Journal of Computational Literary Studies

JCLS Nov 22, 2024

This week, we announce another article from JCLS 3 (1): Mélanie-Becquet et al. "BookNLP-fr, the French Versant of BookNLP. A Tailored Pipeline for 19th and 20th Century French Literature" (10.48694/jcls.3924). Check it out at: https://jcls.io/issue/109/info/ #CLS #JCLS #DigitalHumanities #CCLS24 #BookNLP

Journal of Computational Literary Studies | Issue: Issue: 1(3) (2024)

JCLS Nov 14, 2024

Agapitos and van Cranenburgh use computational #stylometry to show that while 'Octavia' and 'Hercules Oetaeus' were largely written by #Seneca, a closer analysis of the text segments reveals signs of mixed #authorship. https://doi.org/10.48694/jcls.3919 #CLS #CCLS24 #Classics #AuthorshipVerification

A Stylometric Analysis of Seneca's disputed plays. Authorship Verification of <em>Octavia</em> and <em>Hercules Oetaeus</em>

Seneca's authorship of Octavia and Hercules Oetaeus is disputed. This study employs established computational stylometry methods based on character n-gram frequencies to investigate this case. Based on a Principal Component Analysis (PCA) of stylistic similarities within the Senecan corpus, Octavia and Phoenissae emerge as outliers, while Hercules Oetaeus only stands out when the text is split in half. Subsequently, applying PCA and Bootstrap Consensus Trees (BCT) to a corpus of distractor texts, both disputed plays align with the Senecan cluster/branch. The General Impostors method confidently reports Seneca as the author of the disputed plays under various scenarios. However, upon closer examination of text segments, indications of mixed authorship arise. Based on computational stylometry, it appears that the disputed were in large part, but not wholly, written by Seneca.

Journal of Computational Literary Studies

JCLS Nov 14, 2024

This week, we announce the seventh article from JCLS 3 (1): Agapitos and van Cranenburgh "A Stylometric Analysis of Seneca's Disputed Plays. Authorship Verification of Octavia and Hercules Oetaeus" (10.48694/jcls.3919). Check it out at: https://jcls.io/issue/109/info/ #ComputationalLiteraryStudies #CLS #JCLS #DigitalHumanities #CCLS24

Journal of Computational Literary Studies | Issue: Issue: 1(3) (2024)

Show thread

JCLS Oct 24, 2024

Feldkamp et al. reveal diverse perspectives on literary quality and show that expert opinion often conflicts with crowd-sourced ratings & award nominations, highlighting the complexity of measuring literary quality. https://doi.org/10.48694/jcls.3908 #CLS #CCLS24 #LiteraryQuality #CulturalAnalytics

Measuring Literary Quality. Proxies and Perspectives

Computational studies of literature use proxies like sales numbers, human judgments, or canonicity to estimate literary quality. However, many quantitative use one such measure as a gold standard without fully reflecting on what it represents. We examine the interrelation of 14 proxies of literary quality in novels published in the US from the late 19th to 20th century, distinguishing between expert-based judgments (e.g., syllabi, anthologies) and crowd-based ones (e.g., GoodReads ratings). We show that works favored in expert-based judgments often score lower on GoodReads, while award-nominated works tend to circulate more widely in libraries. Generally, two main kinds of `quality perception' emerge as we map the literary judgment landscape: One associated with canonical literature and one with more popular literature. Additionally, prestige in genre literature, reflected in awards like the Hugo Award, forms a distinct category, more aligned with popular than canonical proxies.

Journal of Computational Literary Studies