The paper in question today is one from an Ars article that I won't link to prevent hype.
But reading this thing is a journey. From inventing a new classification of cognition to entirely abstract experiment design for "Brain only" and "AI Use" control/experimental groups, the conclusions can't be taken seriously. They feel "truthy," but that's all they can be
@mttaggart I mean... the base assumption in the tri-system seems... unlikely to hold imo. Just from the abstract: "System 3 [AI use] can supplement or supplant internal processes, introducing novel cognitive pathways."
That seems like a very generous interpretation of LLM use, that places it in a special category outside of other tool use, which is already covered by systems 1 and 2...
@mttaggart I didn't read the paper, but the moment the Ars article mentioned "fluid IQ" is when I thought "oh no, this is gonna be hogwash huh?".
Thanks for reading it and confirming 😮💨
@mttaggart it is very difficult to go above the hype and blatant disinformation on the subject. Have you listened to this podcast?
The authors are great.
@mttaggart I do so love how AI "research" is comfortable with AI interpreting the results of any data collection.
such a circle jerk.
@mttaggart that's what you get when you let CS people do human related stuff. Have you read usable security papers? I live this nightmare.
(Granted, lots of social scientists also have crappy methodological education, but at least they have _some_)
@mttaggart
It's so annoying. The study you linked has soooo many problems:
1. small sample size
2. online replication with an even smaller sample size, not included in the results for reasons unspecified
3. study questions were not published, so they cannot be reviewed
4. declaring that "cognitive surrender" is different from cognitive offloading with no formal explanation of how and why
5. does not talk about research in 2011 and 2018 regarding the "google effect" of reduced working memory when you know a search engine is available, something you would think woukd be obvious prior work to cite
6. does not do a study group of JUST chatbots taking the test to compare against the human + chatbot group
7. subjective self assessment
8. "fast," "medium" and "slow" used without definitions
and that's just from skimming it...