People trying to train AIs are now complaining that all of the AI data on the internet are making it hard for them to get quality training sets of natural language and images.

*bitter snickering*

@futurebird One thing that's pretty clear is that LLMs don't learn very efficiently. None of us inhaled that much data to learn to speak one (or more) languages. None of us inhaled that much data to learn to recognize dog breeds, or plants, or ants, etc. The thing that the LLMs seem to have learned better than (most of) us is multi-subject "man on the Internet" confidence.

OTOH, perhaps our human ability to "learn efficiently" makes us vulnerable to learning conspiracy theories from bullshit.

@futurebird pretty sure there's trained professionals working on this problem, the advantage gained by solving it would be huge.