Stanford CS336 | Language Modeling from Scratch

Official course website for Stanford CS336: Language Modeling from Scratch (Spring 2026), including logistics, schedule, assignments, and course materials.

Stanford CS336
Mind Your Tone: Investigating How Prompt Politeness Affects LLM Accuracy (short paper)

The wording of natural language prompts has been shown to influence the performance of large language models (LLMs), yet the role of politeness and tone remains underexplored. In this study, we investigate how varying levels of prompt politeness affect model accuracy on multiple-choice questions. We created a dataset of 50 base questions spanning mathematics, science, and history, each rewritten into five tone variants: Very Polite, Polite, Neutral, Rude, and Very Rude, yielding 250 unique prompts. Using ChatGPT 4o, we evaluated responses across these conditions and applied paired sample t-tests to assess statistical significance. Contrary to expectations, impolite prompts consistently outperformed polite ones, with accuracy ranging from 80.8% for Very Polite prompts to 84.8% for Very Rude prompts. These findings differ from earlier studies that associated rudeness with poorer outcomes, suggesting that newer LLMs may respond differently to tonal variation. Our results highlight the importance of studying pragmatic aspects of prompting and raise broader questions about the social dimensions of human-AI interaction.

arXiv.org

How Can We Prevent AI Models From Cannibalizing Themselves When Human-Generated Data Runs Out? 

Getty Images While the evolution of artificial intelligence (AI) systems has shown no sign of slowing, there's a growing concern that large language models (LLMs) will soon run out of human-made data to ingest and learn from. Once this happens, scientists say, AI models will increasingly rely on synthetic AI-made information, which will lead to an effect called "model collapse."......Continue reading... By:  Roland Moore-Colyer Source:  Live Science . Critics: A backdoor in a […]

https://onlinemarketingscoops.com/2026/05/22/how-can-we-prevent-ai-models-from-cannibalizing-themselves-when-human-generated-data-runs-out/

How Can We Prevent AI Models From Cannibalizing Themselves When Human-Generated Data Runs Out? 

Getty Images While the evolution of artificial intelligence (AI) systems has shown no sign of slowing, there’s a growing concern that large language models (LLMs) will soon run out of human-m…

Online Marketing Scoops

I'd like to introduce #Emily, an #OpenSource #InformationRetrieval system I've been working on in my spare time.

https://petebleackley.github.io/Emily/

#Python #NaturalLanguageProcessing @IRRJ

Emily

Dr Peter Bleackley’s Portfolio

Pete Bleackley

Harnessing Amazon Kinesis in Machine Learning and Artificial Intelligence

Amazon Kinesis, a suite of services offered by AWS, allows the collection, processing, and analysis of real-time streaming data, proving integral to advances in machine learning and artificial intelligence. The services support real-time ingestions, predictions, anomaly detection, personalized user experiences, predictive maintenance, fraud detection, and natural language processing. The tool's scalability, data quality, cost management, and security presents challenges, which can be mitigated with proper configuration, data validation, and robust monitoring.

https://atozofsoftwareengineering.blog/2023/10/30/harnessing-amazon-kinesis-in-machine-learning-and-artificial-intelligence/

NATURAL LANGUAGE PROCESSING IN ARTIFICIAL INTELLIGENCE (b)

(being continued from 12/06/24) CHAPTER 1A Survey on Social BusinessIntelligence: A Case Study of Application of Dynamic Social Networks for Decision MakingSUBRATA PAUL,1CHANDAN KONER,2and ANIRBAN MITRA3Research Scholar, MAKAUT, and Annex College, Kolkata, West Bengal,India, E-mail: [email protected]. B C Roy Engineering College, Durgapur, West Bengal, India3Amity University, Kolkata, West Bengal, India ABSTRACTOver the years, the popularity of social network platforms has […]

https://spacezilotes.wordpress.com/2026/05/15/natural-language-processing-in-artificial-intelligence-b/

ARTIFICIAL INTELLIGENCE (44) – Natural Language Processing (22) RAG (Retrieval‑Augmented Generation)

Retrieval‑Augmented Generation (RAG) is an architecture that combines: Information retrieval (searching relevant documents), and Text generation (using a Large Language Model). Instead of relying only on what the model learned during training, RAG allows the model to: Look up external, up‑to‑date, or private knowledge,  and Use that retrieved information as context when generating an answer. This drastically improves accuracy, grounding, and trustworthiness. Overview of the Main […]

https://yolandamuriel.com/2026/05/10/artificial-intelligence-44-natural-language-processing-22-rag-retrieval-augmented-generation/

ARTIFICIAL INTELLIGENCE (44) – Natural Language Processing (22) RAG (Retrieval‑Augmented Generation)

Retrieval‑Augmented Generation (RAG) is an architecture that combines: Information retrieval (searching relevant documents), and Text generation (using a Large Language Model). Instead of relying o…

Natural Language Autoencoders

Turning Claude's thoughts into text

Let’s talk about LLMs

Everybody seems to agree we’re in the middle of _something_, though what, exactly, seems to be up for debate. It …

James Bennett

ARTIFICIAL INTELLIGENCE (40) – Natural Language Processing (18) Few-Shot Prompting

Few-shot prompting is a technique used with large language models where the model is given a small number of examples inside the prompt itself to show what kind of task it should perform and how the output should look. Importantly, the model is not retrained or fine-tuned. Instead, it learns on the fly from the examples provided in the context. This behavior is often called in-context learning. Emergent few-shot learning highlights that this ability of modelsl was not explicitly programmed, […]

https://yolandamuriel.com/2026/04/30/artificial-intelligence-40-natural-language-processing-18-few-shot-prompting/

ARTIFICIAL INTELLIGENCE (40) – Natural Language Processing (18) Few-Shot Prompting

Few-shot prompting is a technique used with large language models where the model is given a small number of examples inside the prompt itself to show what kind of task it should perform and how th…