#NeurIPS paper by Merlijn Krale, Eline Bovy, Maris Galesloot, Thiago Simรฃo, and Nils Jansen: On Evaluating Policies for Robust Partially Observable Markov Decision Processes #POMDPs

https://repository.ubn.ru.nl/bitstream/handle/2066/324616/324616.pdf

#TIL about #GPTZero that has built a hallucination check tool which found 100 hallucinated references in #NeurIPS 2025 accepted papers https://gptzero.me/news/neurips/

... and NeurIPS organizers just brushed it off! (sorry for the substack link) https://stevensalzberg.substack.com/p/an-embarrassing-scandal-for-ai-research

#AI #AIslop

GPTZero finds 100 new hallucinations in NeurIPS 2025 accepted papers

GPTZero's analysis 4841 papers accepted by NeurIPS 2025 show there are at least 100 with confirmed hallucinations

AI Detection Resources | GPTZero

Googleโ€™s new PaperBanana system strings together five AI agents to automatically draft scientific diagramsโ€”right down to Matplotlibโ€‘style plots. It works well, but still drops missing icons, a reminder that AIโ€‘generated figures need human polish. See how this openโ€‘sourceโ€‘friendly approach performed at NeurIPS and what it means for academic publishing. #PaperBanana #AIAgents #DiagramGeneration #NeurIPS

๐Ÿ”— https://aidailypost.com/news/googles-paperbanana-uses-five-ai-agents-auto-generate-diagrams

You know, I thought it would be the clankers (LLMs, gAI) turning out more and more bogus information and ultimately eating their tails. But this article as about humans, under pressure to advance in hierarchical society, are turning out the slop that is clogging our knowledge bases using clankers as a tool.

#AI
#NeurIPS
#GIGO
#LLM
#enshittification
#PublishOrPerish

https://www.theguardian.com/technology/2025/dec/06/ai-research-papers

Artificial intelligence research has a slop problem, academics say: โ€˜Itโ€™s a messโ€™

AI research in question as author claims to have written over 100 papers on AI that one expert calls a โ€˜disasterโ€™

The Guardian

alphaXiv (@askalphaxiv)

NeurIPS์—์„œ ์—ด๋ฆฐ Latent Space ํŒŸ์บ์ŠคํŠธ์—์„œ @swyx์™€ ๋Œ€ํ™”ํ•œ ๋‚ด์šฉ ์š”์•ฝ. ๋ฐœ์–ธ์ž๋Š” ๋Œ“๊ธ€ ๊ธฐ๋ฐ˜์—์„œ ์‹ฌ์ธต ์—ฐ๊ตฌ์™€ ML ์ƒŒ๋“œ๋ฐ•์Šค ํ™˜๊ฒฝ์œผ๋กœ์˜ ์ง„ํ™”, ๊ทธ๋ฆฌ๊ณ  alphaXiv๋ฅผ AI ์—ฐ๊ตฌ์šฉ GitHub์œผ๋กœ ๋งŒ๋“ค๊ฒ ๋‹ค๋Š” ๋น„์ „์„ ์„ค๋ช…ํ•จ. ํŒŸ์บ์ŠคํŠธ ๋งํฌ๋กœ ์ž์„ธํ•œ ์ด์•ผ๊ธฐ์™€ ํ–ฅํ›„ ๋กœ๋“œ๋งต ํ™•์ธ ๊ฐ€๋Šฅ.

https://x.com/askalphaxiv/status/2016941098699280474

#podcast #neurips #alphaxiv #ai #research

alphaXiv (@askalphaxiv) on X

Really enjoyed chatting with @swyx โ€‹on the @latentspacepod at NeurIPS! Walked through our origin story, evolution from comments towards deep research and ML sandbox environments, and vision for the future of alphaXiv as the GitHub for AI research ๐Ÿš€ Check out the podcast below!

X (formerly Twitter)

Interesting opinion piece from andrew gelmans blog (Jessica Hullman wrote it) regarding use of LLMs in academic publication. My position is essentially the same as his:

I don't care if you use LLMs do write, but I care a lot whether you accept full responsibility for the truth in both your findings and the argumentation to get there. Using LLMs and letting their hallucinations slip through indicates you do not care about truth.

https://statmodeling.stat.columbia.edu/2026/01/26/machine-learning-research-is-not-serious-research-and-therefore-hallucinated-references-are-not-necessarily-a-big-deal-agrees-a-prestigious-group-of-machine-learning-researchers/

#neurips #ai #hallucinations #academia

Peer Review Missed Fake AI Citations in Conference Papers

#ArtificialIntelligence #AcademicPublishing #NeurIPS #ResearchIntegrity

Chinese-born AI scholars in the US are building bridges at NeurIPS, linking academic networks with peers back home. Their crossโ€‘border collaborations are boosting machineโ€‘learning breakthroughs and tech transfer, deepening USโ€‘China research ties. Discover how these networks are reshaping the field. #AIResearch #USChinaCollab #NeurIPS #CrossBorderResearch

๐Ÿ”— https://aidailypost.com/news/chinese-born-ai-scholars-us-forge-ties-deepening-us-china

mark bissell (@MarkMBissell)

2025๋…„์€ AI ์ „๋ฐ˜, ํŠนํžˆ ํ•ด์„๊ฐ€๋Šฅ์„ฑ(interp) ๋ถ„์•ผ์—์„œ ํฐ ์ „ํ™˜์˜ ํ•ด์˜€์œผ๋ฉฐ, ์ž‘์„ฑ์ž๋Š” NeurIPS์—์„œ Latent Space ํŒŸ์บ์ŠคํŠธ(@latentspacepod)์™€์˜ ๋Œ€ํ™”์—์„œ ๋ช‡ ๊ฐ€์ง€ ์ฃผ์š” ํ…Œ๋งˆ(ํ•ด์„์„ฑ ๋“ฑ)๋ฅผ ๋‹ค๋ฃฌ ์ ์„ ์–ธ๊ธ‰ํ•˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค.

https://x.com/MarkMBissell/status/2011147204623528433

#ai #interpretability #neurips #podcast

mark bissell (@MarkMBissell) on X

2025 was a big year for all of AI but especially for interp great chatting on @latentspacepod at neurips to touch on a few big themes

X (formerly Twitter)

Ashvin Nair (@ashvinair)

NeurIPS์—์„œ @swyx์™€ ๋งŒ๋‚˜ ๊ฐ•ํ™”ํ•™์Šต(RL)์˜ ๊ณผ๊ฑฐ์™€ ํ˜„์žฌ ์—ฐ๊ตฌ์— ๋Œ€ํ•ด ๋Œ€ํ™”ํ–ˆ๋‹ค๋Š” ๊ฐ„๋‹จํ•œ ์–ธ๊ธ‰์ž…๋‹ˆ๋‹ค. ํ–‰์‚ฌ์—์„œ์˜ RL ์—ฐ๊ตฌ ๋…ผ์˜๊ฐ€ ์žˆ์—ˆ์Œ์„ ์•Œ๋ฆฌ๋Š” ํŠธ์œ—์œผ๋กœ, ์ตœ์‹  ์—ฐ๊ตฌ ๋™ํ–ฅ์ด๋‚˜ ํ† ๋ก ์ด ์ด๋ฃจ์–ด์ง„ ์ž๋ฆฌ์˜€์Œ์„ ์‹œ์‚ฌํ•ฉ๋‹ˆ๋‹ค.

https://x.com/ashvinair/status/2009061569548959902

#reinforcementlearning #neurips #research

Ashvin Nair (@ashvinair) on X

Had a great chat with @swyx at NeurIPS about RL research past and present https://t.co/HJWZT5XSur

X (formerly Twitter)