Poking around with sentiment analysis on the public domain copy of Pride and Prejudice by Jane Austen.

I extracted the speech, did a strict attribution, and ran sentiment analysis for different speakers based off chunks sampled from the text.

Elizabeth is neutral with a 28% confidence level, Jane is joyful at a 57% confidence. Darcy is sad with 94% confidence and Mrs Bennet is joyful at 95% confidence.

Those aren't the emotions I get from reading the text. Again, I'm learning more about the sentiment analysis than the text.
https://www.kaggle.com/code/alisonhawke/pride-and-prejudice

#DataScience #Python #Literature #TextAnalysis #SentimentAnalysis

Pride and Prejudice

Explore and run AI code with Kaggle Notebooks | Using data from multiple data sources

Spent some time doing data analysis on the Project Gutenberg text of Pride and Prejudice.

Pulling out all the speech, the library I used said it was "emotionally neutral" in sentiment. Which is interesting because when you read it, the speech is absolutely not that. There's a lot in the subtleties of the speech that makes it very pointed.

The confidence on the emotional rating was 57%, which seems low to me. Doing analysis on a book I'm familiar with and recently read is telling me as much about the means of evaluating the text as the text itself.
#DataScience #TextAnalysis #SentimentAnalysis

Metatextual Literacy

Okay, I want to bitch about a small thing that bugs me a little. To begin, let's do a close reading of Jeff Kinney's Diary of a Wimpy Kid[^1] that it realist...

Jenneral HQ ๐ŸŒ 

ๆ–ฐๆธ…ๅฃซ@(็”ŸๆˆAI)ใ‚คใƒณใƒ‡ใ‚ฃใ‚ฒใƒผใƒ ้–‹็™บ่€… (@kiyoshi_shin)

์ผ๋ณธ์–ด ํŠธ์œ—์œผ๋กœ, 'Opus'๋ผ๋Š” ๋ชจ๋ธ์ด AI ์ƒ์„ฑ ๋ฌธ์žฅ์„ ํƒ์ง€ํ•˜๊ณ  ์ธ๊ฐ„์ด ์ˆ˜์ •ํ•œ ๋ถ€๋ถ„๊ณผ ๊ทธ๋Œ€๋กœ ๋‚จ๊ธด ๋ถ€๋ถ„์˜ ์ฐจ์ด๋ฅผ ์‹๋ณ„ํ•  ์ˆ˜ ์žˆ๋‹ค๊ณ  ์„ค๋ช…ํ•จ. ์ด ๋ชจ๋ธ์€ ์–ธ์–ด ์Šคํƒ€์ผ ๋ถ„์„ ๋ฐ AI ํƒ์ง€ ๋Šฅ๋ ฅ์„ ๊ฐ–์ถ˜ ์ƒˆ๋กœ์šด ํ…์ŠคํŠธ ๋ถ„์„ ๊ธฐ์ˆ ๋กœ ๋ณด์ž„.

https://x.com/kiyoshi_shin/status/2025339820688769277

#ai #nlp #textanalysis #opus #aigeneration

ๆ–ฐๆธ…ๅฃซ@(็”ŸๆˆAI)ใ‚คใƒณใƒ‡ใ‚ฃใ‚ฒใƒผใƒ ้–‹็™บ่€… (@kiyoshi_shin) on X

ๅพŒใ€้ข็™ฝใ„ใจๆ€ใฃใŸใฎใฏOpusใฏAIใฝใ„ๆ–‡็ซ ใ‚’่ฆ‹ๆŠœใ„ใฆใใ‚‹ใ€‚ๆ‰‹ใ‚’ๅ…ฅใ‚ŒใŸ้ƒจๅˆ†ใจใ€ไปปใ›ใŸๆ–‡็ซ ใใฎใพใพใฎ้•ใ„ใ‚’ๆŒ‡ๆ‘˜ใ—ใฆใใ‚‹ใ€‚Opusใฏ่‡ชๅˆ†ใฎๆ–‡็ซ ใฎ็™–ใ‚’ใ‚ˆใ็ŸฅใฃใฆใŠใ‚Šใ€ไบบ้–“ใ‚‚ใพใŸใ™ใๆ…ฃใ‚Œใฆใใฆ่ฆ‹ๆŠœใ„ใฆใใใ†ใ€‚

X (formerly Twitter)

Why do politicians always talk about "middle class," "immigrants," or "families"?

New research funded by @fwf and @dfg_public led by Dr. Lena Maria Huber (https://lenamariahuber.eu/, MZES, University of Mannheim) and Dr. Hauke Licht (University of Innsbruck), explores how politicians talk about social groups in campaign platforms and parliamentary speeches across 8 Western European countries.

๐Ÿ”—https://haukelicht.github.io/projects/gaepd/

#PoliticalCommunication #ComputationalSocialScience #Democracy #TextAnalysis

Can #AI reasoning models infer people's underlying reasons in unstructured chat data from group decisions?

Across multiple prompting steps, #GTP5 usually did NOT select the same underlying reason as a human rater: https://doi.org/10.48550/arXiv.2601.05582

#AI #cogSci #textAnalysis #psychometrics

Fast Concordance

Ive been digging around for text analysis OS apps and found AntConc via a Reddit thread. This app is very good from what I can see in early quick testing. Im looking at term frequency across relevant papers, and some 'concordance' context but AntConc will do a lot more. Together with Taguette you have all you need for a lot of analysis.

Im running portable on Windows but Mac and Linux also work.
https://www.laurenceanthony.net/software/antconc/

#AntConc #textanalysis #research #academia #academicchatter #linguistics

Recs for text analysis tools, without any or only minimal genai - Taguette, QDA Miner, what else? Bulk document (around 50 papers) common word analysis is what Im mainly looking for, as well as individual document labelling. Open source, free, Windows 10.
#QualitativeData #textanalysis #software #research #academia #academicchatter #opensource

Discover google/langextract, a game-changing Python library for extracting insights from unstructured text with LLMs! #LLM #NLP #TextAnalysis

The google/langextract library leverages Large Language Models (LLMs) to extract structured information from unstructured text, enabling precise source grounding and interactive visualization capabilities. By utilizing Python, developers can integrate this...

#google/langextract #LargeLanguageModels #NaturalLanguageProcessing #PythonLibraries