New post: Claude can make mistakes β A LLM hallucination example that I was thinking about for my Dad. ποΈββοΈ
https://noumenal.es/posts/claude-can-make-mistakes/3Ay/ #LLMs
New post: Claude can make mistakes β A LLM hallucination example that I was thinking about for my Dad. ποΈββοΈ
https://noumenal.es/posts/claude-can-make-mistakes/3Ay/ #LLMs
@webology JeffBot please summarize π
(It's not a short episode)
@carlton Start about ~10 or 12 minutes from the end.
The tl;dr is that we are getting confused by LLM's willingness to always provide an answer because of how they are tuned. Which is annoying, and that's why I prefer the results I get from the APIs directly, because I have more options available.
Strangely, I'm not sure why my results were better other than Claude hit the web to find them.
@webology I suspect your prompt was more demanding?
(That's also a point I have planned to discussβ¦)
@carlton I copy and pasted your question. I did capitalize the "US Masters" though, but everything were your words. https://claude.ai/share/a464d9e4-9efe-4e2b-bc21-561698c8baa6
Without the ability to do a web search, I don't find LLMs to be good for grabbing random facts.
A fun one to try is give it a list from a requirements.in file and ask it to give you the pypi and GitHub or whatever repo for each one.
@webology π
> I don't find LLMs to be good for grabbing random facts.
Exactly with you! But that's precisely how I see normies using them. I'm not advocating it, rather just preparing an example to demonstrate the point, to my Dad.
I repeatedly bump into bewilderment about how the output could be anything other than correct. The idea that they're fact retrieval engines seems deeply embedded with folks in the wild. π€· (Others have more to say on this than I do.)
@carlton came to read whatever Carlton writes. Stayed for the golf.
Love the AI as teenagers analogy: "The trouble I have with them is that they're not experts at anything. (They're teenagers so they often think they are, but that's not the same.) It's hard to get them to see, and feel, the issues here in the same way."