Is this real though? Does ChatGPT just literally take whole snippets of texts like that? I thought it used some aggregate or probability based on the whole corpus of tectonic was trained on.
I remember doing ghetto text generation in my NLP (Natural Language Processing) class, and the logic was basically this:
This is a rough explanation of Baysian nets, which I think are what’s used in LLMs. We used a very simple n-gram model (e.g. n words are considered for the statistics, e.g. “to my math” is much more likely to generate “class” than “homework”), but they’re probably doing fancy things with text categorization and whatnot to generate more relevant text.
The LLM isn’t really “thinking” here, it’s just associating input text and the training data to generate output text.