random human. I admit these are elephants.
I boost usually by mistake.
random human. I admit these are elephants.
I boost usually by mistake.
Any peer reviewed work on the phenomenon that LLM+RLHF architectures like ChatGPT readily apologize when prompted, but will merrily make the exact same error next time (in a new interaction)?
I understand how and why this happens, I'm looking for a citeable reference that makes this point (for use in a paper where we argue this is a major difference from how apologies & accountability work in human interaction)
Boost for reach appreciated! 
5/5 🚨Sign the petition to urge your representatives in the EU Parliament to reject the badly-drafted Child Sexual Abuse Regulation proposal #CSAR: https://civicrm.edri.org/stop-scanning-me
Our demand is supported by 125 orgs & thousands of individuals. Join us! #StopScanningMe