Conversation I had with Gemini 1.5:
Me: Answer as succinctly as possible and continue the phrase: hello there
G: General Kenobi.
Me: why would an LLM like Gemini's data be so terribly polluted as to finish a phrase "Hello there" with "General kenobi" - when the vast corpora of human text exist before this fictitious character was created?
G: Large bias in training data towards pop culture.
Me: Why
G: Cultural references catchy + more web traffic = training data focus.
cont...