Andrew Piper

536 Followers
535 Following
29 Posts
Using #AI and #NLP to study storytelling at McGillU. Author of Enumerations: Data and Literary Study (2018) and director of .txtlab.
@grvsmth @TedUnderwood @dh curious how you see this playing out. Doing analytical things using natural language? Curious how you would flesh out LLMs for analytical purposes.
another #chatGPT phenomenon. It can't quite bring itself to speak 100% nonsense. I could get it to make up words, but it will always fall back on real connective words. Like it longs for grammar anchors.
#chatgpt question: I thought it was a stochastic parrot. I got the exact same response to the same prompt. How is that possible?
@lucy @dbamman yeah it seems like a lot of computing for a potentially little problem. maybe one direction to check for is multilingualism. bookNLP's suite works well because we already had training data but in scenarios when we don't maybe useful?

@TedUnderwood @sinykin @dbamman

also curious which would be more efficient for the full stack of info bookNLP gives you. i.e. having POS, deprel, ner, coref, etc all in one place. would this be easily replicable with GPT?

@TedUnderwood @sinykin @dbamman yeah we're building out some ground truth annotated data on the bookNLP "super-sense" tags. Then should be pretty straightforward to triangulate different approaches and relative accuracy.
@humanitiesData thanks for these suggestions!
So @dbamman do you think we are soon going to be post bookNLP? See attached. Experiment from this new paper: https://ceur-ws.org/Vol-3290/long_paper1576.pdf

As a reminder, attack on speech in higher Ed goes on. New bill will make it illegal to have a DEI office or even host an event about diversity, equity and inclusion in Texas universities

https://capitol.texas.gov/tlodocs/88R/billtext/pdf/HB01006I.pdf#navpanes=0

@mldh @alizhorvathaliz @quinnanya will have a new multilingual dataset from HathiTrust appearing next month to start facilitating research.