I'm teaching "NLP for Cultural Analytics" at UW Linguistics this quarter, and I thought I'd share my reading list for the course.

https://maria-antoniak.github.io/teaching/2023-nlp-ca

Lots of usual suspects from the Journal of Cultural Analytics, but also pieces from NLP and computational social science. Thinking back to an earlier thread on "what even is cultural analytics," you can see from this list my perception of the field (or maybe just what I'm in the mood to read right now).

#DigitalHumanities #CulturalAnalytics

Maria Antoniak

My academic website / portfolio.

(Thank you @emilymbender for the invitation to revisit UW Linguistics!)
@maria_antoniak Thank you! Our students are very fortunate :)
It's my first time crafting my own syllabus, and I'm sure this list is imperfect. Certainly it reflects my own biases in good ways (my personal interests, what makes the course "mine") and ways that I can work on improving. Lots to reflect on and learn from if I teach again in the future. But I'm happy that the list includes a broad range of topics and a decent amount of data ethics, evaluation, reproducibility, and critical studies from different fields, along with reusable technical methods.
@maria_antoniak Thanks! I don't have the technical/programming chops to take on NLP at the graduate level, but the Santa Fe Institute is running Foundations & Applications of Humanities Analytics through their Complexity Explorer portal starting on January 17. It should be just about my speed with some optional Python/Pandas assignments thrown in.
@curtisfrye The Santa Fe Institute is great, and I'd also recommend this wonderful (free! online!) book by @mellymeldubs for those new to digital humanities, cultural analytics, and NLP:
https://melaniewalsh.github.io/Intro-Cultural-Analytics/features/welcome.html
@maria_antoniak this looks amazing!
@Shugars Thank you! Excited to re-read and share some of your work with these students!

@maria_antoniak More a developer than a data scientist, but my current work is pushing me in that direction so haven't been exposed to much of the literature here. The paper on information manipulation in wartime Russia is just the thing I didn't know I was missing until your post here. Thank you!

For reference: https://arxiv.org/abs/2205.12382

Challenges and Opportunities in Information Manipulation Detection: An Examination of Wartime Russian Media

NLP research on public opinion manipulation campaigns has primarily focused on detecting overt strategies such as fake news and disinformation. However, information manipulation in the ongoing Russia-Ukraine war exemplifies how governments and media also employ more nuanced strategies. We release a new dataset, VoynaSlov, containing 38M+ posts from Russian media outlets on Twitter and VKontakte, as well as public activity and responses, immediately preceding and during the 2022 Russia-Ukraine war. We apply standard and recently-developed NLP models on VoynaSlov to examine agenda setting, framing, and priming, several strategies underlying information manipulation, and reveal variation across media outlet control, social media platform, and time. Our examination of these media effects and extensive discussion of current approaches' limitations encourage further development of NLP models for understanding information manipulation in emerging crises, as well as other real-world and interdisciplinary tasks.

arXiv.org
@maria_antoniak Looks wonderful. Thank you so much for posting this.
@maria_antoniak thanks so much for sharing 😊
I saved for reading the papers later. Have a great semester!
@itovvmk Thank you, glad that it's useful for you!
@maria_antoniak thanks for sharing this is very interesting.