Mastodawn

Arvind Narayanan Mar 29, 2023

The AI moratorium letter only fuels AI hype. It repeatedly presents speculative, futuristic risks, ignoring the version of the problems that are already harming people. It distracts from the real issues and makes it harder to address them. The letter has a containment mindset analogous to nuclear risk, but that’s a poor fit for AI. It plays right into the hands of the companies it seeks to regulate. By @sayashk and me. https://aisnakeoil.substack.com/p/a-misleading-open-letter-about-sci

A misleading open letter about sci-fi AI dangers ignores the real risks

Misinformation, labor impact, and safety are all risks. But not in the way the letter implies.

AI Snake Oil

Show thread

Arvind Narayanan Mar 21, 2023

Instead of standalone benchmark exams, we should study how well language models can do any of the real-world tasks that professionals must do. But it's not human vs bot. Study professionals doing their jobs with the help of AI tools — ideally qualitatively and not just quantitatively. https://aisnakeoil.substack.com/p/gpt-4-and-professional-benchmarks

GPT-4 and professional benchmarks: the wrong answer to the wrong question

OpenAI may have tested on the training data. Besides, human benchmarks are meaningless for bots.

AI Snake Oil

Show thread

Arvind Narayanan Feb 21, 2023

We looked at > 100 papers on flaws of specific predictive optimization applications. Seven recurring clusters emerged. Given the technical architecture of predictive optimization (vs other forms of automated decision making) we show that this is exactly what we should expect. That made us ask: does *every* predictive optimization application have these flaws? We picked 8 case studies based on prevalence and impact on people. Among these, the answer seems to be yes. https://predictive-optimization.cs.princeton.edu/

Against Predictive Optimization

Show thread

Arvind Narayanan Feb 21, 2023

The key move in our paper is defining the category of predictive optimization and showing what's wrong with it. Contesting these applications one by one (criminal risk prediction, resume screening, ...) leave us playing whack-a-mole as they proliferate (we counted about 50, probably missed many more). On the other hand, critiques that apply to *all* automated decision making are too broad to effectively challenge any particular system. We offer a middle ground. https://predictive-optimization.cs.princeton.edu/

Against Predictive Optimization

Show thread

Arvind Narayanan Jan 5, 2023

Here's a nice article explaining why there are many ways of measuring the size of Mastodon and why there's no single right way to measure it. https://absolutelymaybe.plos.org/2022/12/05/mastodon-growth-numbers-might-not-mean-what-you-think-they-mean/

The main reason why the total user count of 6 million in my chart is different from the 9 million often reported by sources such as @mastodonusercount is that the latter includes the part of Mastodon that is blocked by most instances for hate speech and other such content.

Mastodon Growth Numbers Might Not Mean What You Think They Mean - Absolutely Maybe

Mastodon’s growth in the last month has been extraordinarily fast – but just how fast? Did the number of users jump up…

Absolutely Maybe

Arvind Narayanan Jan 4, 2023

The number of active Mastodon users — those who've logged in in the last 30 days — has fallen to 1.8 million from a high of 2.6 million. This reflects the fact that while there's a wave of new users after each Musk tantrum, the majority aren't sticking around. Even the 1.8M figure is inflated by new accounts, so I expect a further drop.

Still, the number of active users is over 4 times what it used to be before the Twitter takeover.

Source: https://api.joinmastodon.org/statistics (and Wayback machine).

Show thread

Arvind Narayanan Dec 22, 2022

Here's a fascinating chart showing Mastodon adoption among a few scientific communities by @TrueSciPhi. https://mastodon.truesciphi.org/@TrueSciPhi/109554915418774658

I'd love to see charts like this for other communities.

One thing the chart shows is that Dear Leader's recent decree about Mastodon had a small but measurable effect: 5-10% of those who had their Mastodon info in their bios removed it.

Kelly Truelove (@[email protected])

Attached: 1 image Mastodon lists based on Twitter lists: Astronomers: https://truesciphi.org/ast_mas.html Physicists: https://truesciphi.org/phy_mas.html Professional Science Writers: https://truesciphi.org/psw_mas.html Mathematicians: https://truesciphi.org/mat_mas.html Philosophers: https://truesciphi.org/phi_mas.html

TrueSciPhi Mastodon

Arvind Narayanan Nov 22, 2022

This is an amazing chart https://sigmoid.social/@hal/109389647341556477

It shows that the Lake Wobegon effect shows up even in a population that's presumably fluent in statistics (NeurIPS authors). When submitting papers, authors wildly overestimate their probability of acceptance — the median prediction was 70% — even after being told that the base rate in prior years was 21%.

This is the opposite of the Dunning-Kruger effect: those with *lower* acceptance rates are well calibrated!

Hal Daumé III (@[email protected])

Attached: 1 image New blog post on the NeurIPS'21 experiment re authors' perceptions of their own papers! https://blog.ml.cmu.edu/2022/11/22/neurips2021-author-perception-experiment/ Key findings: 1) Authors significantly overestimate their papers' chances of acceptance. By like a LOT. >

Sigmoid Social

Show thread

Arvind Narayanan Nov 16, 2022

Princeton is a gorgeous small town to live in, but many of us also live in NYC, Philadelphia, or other nearby areas. I'd be happy to answer any questions about any of our positions.

Here's our website: https://citp.princeton.edu/

Home - Center for Information Technology Policy

Center for Information Technology Policy

Website	https://www.cs.princeton.edu/~arvindn/
Substack: AI Snake Oil	https://aisnakeoil.com/
Book: Fairness and machine learning	https://fairmlbook.org/