so i've been occasionally evaluating chatgpt performance on information extraction tasks that i find genuinely difficult and would like assistance in
yep, still garbage
so i've been occasionally evaluating chatgpt performance on information extraction tasks that i find genuinely difficult and would like assistance in
yep, still garbage
i think relying on chatgpt for information extraction tasks is borderline delusional; only in any way seemingly reasonable because search engines got progressively worse to the point where they're mostly noise
even then i prefer to get skilled in extracting what little value the noise has
@foremostarchwiz i want to know my enemy, so to speak
(it's not much of a threat)
@foremostarchwiz threat to my ability to survive? not really
threat to society? honestly maybe, but it's very hard to say for sure
@mary the thing I asked it to do here is to barf the Tcl documentation that must have been in the training set
there's no reasoning needed to say exactly how braces are escaped; only accurate recall
Attached: 1 image I'm late to the ChatGPT party, but this is both funny and insightful. The conversation starts off with me asking ChatGPT to summarize one of my papers. The summary generated is passable even to someone in the computer architecture community, but as the author, I know that it is just wrong. Perhaps this is a deeper insight into how all papers sound the same?
make -t do?" and all LLMs spewed utter garbage despite that information being in POSIX and multiple man pages available online.@whitequark it completely fails when i try to make it do anything even slightly wacky and uncharacteristic
to anthropomorphize chatgpt for a moment, it's a total dweeb that hates fun
i feel like the main use i get from it is having it write out a tiny bit of code in a mainstream language, and only with stuff i already know but can't be bothered to write out manually.
@whitequark like i can't overstate how much of a dweeb chatgpt is. today i asked it to write some automation code for mac os using hammerspoon and it was like "you can't do this because this uses private apis"
of course i could easily do it and the hammerspoon documentation had exactly what i needed.