Mastodawn

Carl T. Bergstrom Aug 13, 2024

Taylorism is a management philosophy based on using scientific optimization to maximize labor productivity and economic efficiency.

Here's the result of making the false Taylorist assumption that the output of scientific research is scientific papers—the more, faster, and cheaper, the better.

Show thread

Carl T. Bergstrom Aug 13, 2024

Papers are not the output of scientific research in the way that cars are the output of automobile manufacturing.

Papers are merely a vehicle through which a portion of the output of research is shared.

We confuse the two at our peril.
‪
The entire idea of outsourcing the scientific ecosystem to LLMs — as described below — is a concept error that I can scarcely begin to get my head around.

sakana.ai/ai-scientist/

Show thread

Carl T. Bergstrom Aug 13, 2024

"While there are still occasional flaws in the papers produced by this first version..."

Meanwhile the authors note that the output itself fails to meet standards of scientific rigor, but treat this as a minor wrinkle, not a fundamental barrier imposed by using the wrong tool for the wrong job.

Show thread

Carl T. Bergstrom Aug 13, 2024

This system literally fabricates its methods section — an act which goes beyond bad science into the realm of serious scientific misconduct. This is more than a wrinkle to be ironed out.

Show thread

Carl T. Bergstrom Aug 13, 2024

Scientists: We need to slow down the publication race and produce higher quality papers at a slower rate to make the literature manageable again.

Engineers: We hear you. Now every lab in the world will be able to produce hundreds of medium-quality papers (with a few mistakes in each) every week.

Show thread

Carl T. Bergstrom Aug 13, 2024

I do appreciate the authors' candor in detailing failure modes.

A system that makes difficult-to-catch mistakes in implementation, fails to compare quantitative data appropriately, and fabricates entire results—maybe I have high standards but I don't see this as writing "medium-quality" papers.

Show thread

Carl T. Bergstrom Aug 13, 2024

Here's the weird Taylorism again. The system produces work at the level of an early trainee requiring substantive supervision. This is not good ROI for producing papers.

The primary output of time invested in trainee research is the development of independent scientists—not the research papers.

Show thread

Carl T. Bergstrom Aug 13, 2024

In the end, how one judges this paper probably comes down to how one assesses the claim that is always used to justify this kind of work.

The authors "believe" that future versions will be
greatly improved.

Given what I know of fundamental limits to what LLMs can do, I see no reason to agree.

When I fail to do something, I either don't publish or very occasionally I publish describing that failure. When I do so, I don't pretend it was a success and promise that it'll magically get better.

Show thread

Coprolite9000 Aug 14, 2024

@ct_bergstrom
> 'The authors "believe" that future versions will be greatly improved.'

I'm wondering if there's a name for this effect - where a machine does something resembling a human activity, and people ascribe it further human qualities - then assume it will advance in ability in a similar manner to a human.

See also: self-driving cars. Shuffling mindlessly around a car park? Great! It'll soon be driving perfectly.

Or not.

Show thread

Jon Spring

@coprolite9000 @ct_bergstrom I agree skepticism is in order about the pace and extent of progress. But this expectation seems reasonable to me in the abstract if we are iterating and can always use the best model to date.
For example, I can’t imagine how computers could ever get worse at playing chess than they are now, and as long as we iterate new approaches, there’s a possibility of improvement.

Show thread

FeralRobots Aug 15, 2024

@Spring @coprolite9000 @ct_bergstrom
FWIW it's quite common for performance of machine learning systems to degrade rather than improve. Ensuring that performance DOES improve typically takes a lot of human work.

Show thread

Bornach Aug 15, 2024

@FeralRobots @Spring @coprolite9000 @ct_bergstrom
And then there is the problem of an AI that has defeated the top-level human players is somehow still beatable using a strategy that only an intermediate level player would consider deploying

https://youtu.be/l7tWoPk25yU

https://arstechnica.com/information-technology/2023/02/man-beats-machine-at-go-in-human-victory-over-ai/

- YouTube

Enjoy the videos and music you love, upload original content, and share it all with friends, family, and the world on YouTube.