Haven't read this one yet, but I'm itching to:

https://mastodon.world/@Mer__edith/113197090927589168

Hype, Sustainability, and the Price of the Bigger-is-Better Paradigm in AI
With the growing attention and investment in recent AI approaches such as large language models, the narrative that the larger the AI system the more valuable, powerful and interesting it is is increasingly seen as common sense. But what is this assumption based on, and how are we measuring value, power, and performance? And what are the collateral consequences of this race to ever-increasing scale? Here, we scrutinize the current scaling trends and trade-offs across multiple axes and refute two common assumptions underlying the 'bigger-is-better' AI paradigm: 1) that improved performance is a product of increased scale, and 2) that all interesting problems addressed by AI require large-scale models. Rather, we argue that this approach is not only fragile scientifically, but comes with undesirable consequences. First, it is not sustainable, as its compute demands increase faster than model performance, leading to unreasonable economic requirements and a disproportionate environmental footprint. Second, it implies focusing on certain problems at the expense of others, leaving aside important applications, e.g. health, education, or the climate. Finally, it exacerbates a concentration of power, which centralizes decision-making in the hands of a few actors while threatening to disempower others in the context of shaping both AI research and its applications throughout society.
Currently this is on #arXiv which, if you've read any of my critiques, is a dubious source. I'd love to see this article appear in a peer-reviewed or otherwise vetted venue, given the importance of its subject.

I've heard through the grapevine that US federal grantmaking agencies like the #NSF (National Science Foundation) are also consolidating around generative AI. This trend is evident if you follow directorates like CISE (Computer and Information Science and Engineering). A friend told me there are several NSF programs that tacitly demand LLMs of some form be used in project proposals, even when doing so is not obviously appropriate. A friend of a friend, who is a university professor, has said "if you're not doing LLMs you're not doing machine learning".

This is an absolutely devastating mindset. While it might be true at a certain cynical, pragmatic level, it's clearly indefensible at an intellectual, scholarly, scientific, and research level. Willingly throwing away the diversity of your own discipline is bizarre, foolish, and dangerous.

#AI #GenAI #GenerativeAI #LLM #ML
Meredith Whittaker (@[email protected])

Attached: 1 image 📣NEW paper! Don’t believe the hype: bigger AI ≠ better AI. @SashaMTL, @GaelVaroquaux and me on how the race to bigger, and bigger AI has bad consequences and isn't necessary. 1. Smaller AI models often perform better than big models in context And 2. Obsession with bigness has severe collateral consequences, from climate costs, to concentrated power, to more surveillance, to the capture of AI research. All of this, and what we can do instead 👇 https://arxiv.org/abs/2409.14160

Mastodon
Speaking of machine learning, I once had a paper rejected from #ICML (International Conference on Machine Learning) in the early 2000s because it "wasn't about machine learning" (minor paraphrase of comments in 2 of the 3 reviews if I recall correctly). That field was consolidating--in a bad way, in my view--around a very small set of ideas even back then. My co-author and I wrote a rebuttal to the rejection, which we had the opportunity to do, arguing that our work was well within the scope of machine learning as set out by Arthur Samuel's pioneering work in the late 1950s/early 1960s that literally gave the field its name (Samuel 1959, Some studies in machine learning using the game of checkers). Their retort was that machine learning consisted of: learning probability distributions of data (unsupervised learning); learning discriminative or generative probabilistic models from data (supervised learning); or reinforcement learning. Nothing else. OK maybe I'm missing one, but you get the idea.

We later expanded this work and landed it as a chapter in a 2008 book Multiobjective Problem Solving from Nature, which is downloadable from https://link.springer.com/book/10.1007/978-3-540-72964-8 . You'll see the chapter starting on page 357 of that PDF (p 361 in the PDF's pagination). We applied a technique from the theory of coevolutionary algorithms to examine small instances of the game of Nim, and were able to make several interesting statements about that game. Arthur Samuel's original papers on checkers were about learning by self-play, a particularly simple form of coevolutionary algorithm, as I argue in the introductory chapter of my PhD dissertation. Our technique is applicable to Samuel's work and any other work in that class--in other words, it's squarely "machine learning" in the sense Samuel meant the term.

Whatever you may think of this particular work of mine, it's bad news when a field forgets and rejects its own historical origins and throws away the early fruitful lines of work that led to its own birth. #GenerativeAI threatens to have a similar wilting effect on artificial intelligence and possibly on computer science more generally. The marketplace of ideas is monopolizing, the ecosystem of ideas collapsing. Not good.

#MachineLearning #ML #AI #ComputerScience #Coevolution #CoevoutionaryAlgorithm #checkers #Nim #BoardGames
Multiobjective Problem Solving from Nature

SpringerLink