Mastodawn

Teddy Roland Apr 23, 2023

Interdisciplinary q: are there disciplines that conventionally model non-linguistic behaviors (in humans or otherwise) using CFGs?

I’m thinking some theory of the firm in Econ or of root structures in plant biology

Teddy Roland Mar 13, 2023

Hi friends, I'm looking for leads on fellowships or dissertation support, for the coming academic year. My weirdo Lit Studies project that has me training and testing LLMs during AY 2023-24.

If you know of any DH or DH-flavored linguistics/CS fellowships, HMU! Thanks!

Teddy Roland Nov 30, 2022

Alex Saum Nov 30, 2022

The #ucstrike continues and here's some interesting data> The highest paid UC faculty (from http://ucop.edu) are the least likely to support the graduate student strike; measured by petition signing [from @BeccaRoskill and @snaidunl, from that place called twitter] #fairUCnow #facultysolidarity

University of California | Office of The President

Teddy Roland Nov 30, 2022

Max Woolf Nov 30, 2022

Coding with GPT-3 text-davinci-003

Teddy Roland Nov 27, 2022

Ted Underwood Nov 27, 2022

The painful part of my disciplinary transition from #English Lit to #InformationScience had nothing to do with technology; it was the gradual recognition that now I need to propose research ideas that could ... somehow ... actually help people. Like, people alive today! Ugh.

Teddy Roland Nov 25, 2022

Jeffrey M. Binder Nov 25, 2022

My first book, Language and the Rise of the Algorithm, is out now with U. Chicago Press!

Bringing together the histories of #mathematics, #computerscience, and #linguistics, Language and the Rise of the Algorithm offers a wide-ranging tour of the intellectual developments that produced the modern idea of the #algorithm, from Renaissance #algebra to recent advances in #machinelearning.

#histtech #histodons #histstm #Leibniz #algorithms #language

https://www.amazon.com/dp/0226822532
https://www.barnesandnoble.com/w/language-and-the-rise-of-the-algorithm-jeffrey-m-binder-phd/1141114947?ean=9780226822532

Amazon.com

Teddy Roland Nov 25, 2022

LRT (LB?) there’s a movement afoot in #DeepLearning to constrain model-space using prior knowledge of solutions’ structure. In mature fields like #NLP this makes a lot of sense, since it blends paradigms developed over decades.

But, over here, #Computational #LitStudies is young and born-statistical. Building up a deterministic paradigm — just identifying relationships of logical necessity, never mind a generative grammar! — would make a powerful complement.

Teddy Roland Nov 24, 2022

JCLS Nov 24, 2022

Just out in JCL! "‘This book makes me happy and sad and I love it’. A Rule-based Model for Extracting Reading Impact from English Book Reviews", by Marijn Koolen, Julia Neugarten and Peter Boot. They distinguish four types of impact: affective, narrative, stylistic, reflective. The authors report on adapting their "reading impact model" from Dutch to English. They also look at correlations between impact and ratings (see screenshot). https://doi.org/10.48694/jcls.104

‘This book makes me happy and sad and I love it’. A Rule-based Model for Extracting Reading Impact from English Book Reviews

Being able to identify and analyse reading impact expressed in online book reviews allows us to investigate how people read books and how books affect their readers. In this paper we investigate the feasibility of creating an English translation of a rule-based reading impact model for reviews of Dutch fiction. We extend the model with additional rules and categories to measure reading impact in terms of positive and negative feeling, narrative and stylistic impact, humour, surprise, attention, and reflection. We created ground truth annotations to evaluate the model and found that the translated rules and new impact categories are effective in identifying certain types of reading impact expressed in English book reviews. However, for some types of impact the rules are inaccurate, and for most categories they are incomplete. Additional rules are needed to improve recall, which could potentially be enhanced by incorporating Machine Learning. At the same time, we conclude that some impact aspects are hard to extract with a rule-based model. When applying the model to a large set of reviews, lists of the top-scoring books in the impact categories show the model's prima-facie validity. Correlations among the categories include some that make sense and others that require further research. Overall, the evidence suggests that for investigating the impact of books, manually formulated rules are partially successful, and are probably best used in a hybrid approach.

Journal of Computational Literary Studies

Teddy Roland Nov 21, 2022

Euan Cochrane Nov 21, 2022

"Slide decks as government publications: exploring two decades of PowerPoint files archived from US government websites" by @tjowens @Tjowens and Jonah Estess https://link.springer.com/article/10.1007/s10502-022-09406-2

Slide decks as government publications: exploring two decades of PowerPoint files archived from US government websites - Archival Science

Over the last three decades, US government agencies have published hundreds of thousands, if not millions, of PowerPoint files on the web. Hundreds of thousands of these files have been captured and preserved in web archives. With that noted, it remains difficult to find and interact with these files. This paper analyzes a public dataset of 1,000 PowerPoints from US government websites in the Library of Congress web archives to explore the properties of these kinds of files. This publicly available dataset published in 2019 includes a random sample of a thousand files from the more than 300,000 files that purport a PowerPoint media type in the Library of Congress web archives. The study focuses on characterizing the nature of these publications, the extent to which embedded metadata in these documents could be used to improve access to them and exploring what properties of these files are likely to be important to future users. Exploring these data provides a means to begin to understand the value and nature of PowerPoint files as a format of government publishing and government records.

SpringerLink

Teddy Roland Nov 21, 2022

Ryan Cordell Nov 21, 2022

This CFP for a new volume, *TextGenEd: Teaching with Text Generation Technologies*, looks especially salient in this precise moment of anxiety around pedagogy & AI https://docs.google.com/document/d/1vjjablPUReOPH-ky_WAAXyAAFLIVIbZ6BRbG-8AXoms/edit

CFP: TextGenEd: Teaching with Text Generation Technologies

CALL FOR PROPOSALS: TextGenEd: Teaching with Text Generation Technologies Editors: Annette Vee, Assoc. Prof. of English and Dir. of Composition, University of Pittsburgh Tim Laquintano, Assoc. Prof. of English and Dir. of College Writing Program, Lafayette College Carly Schnitzler, Ph.D. Candi...

Google Docs