Mastodawn

Staircase Studios AI And Made By Us Partner On New Animated Series ‘Ollie The Octopus’ Featuring Digital Creators Adam W, Hannah Stocking And Anwar Jibawi
#News #AdamW #MadeByUs #OllietheOctupus

https://deadline.com/2026/05/ollie-the-octpus-adam-w-hannah-stocking-and-anwar-jibawi-1236927962/

Staircase Studios AI And Made By Us Partner On New Animated Series ‘Ollie The Octopus’ Featuring Digital Creators Adam W, Hannah Stocking And Anwar Jibawi

Adam W, Hannah Stocking and Anwar Jibawi to star in animated series 'Ollie the Octopus' from Made By Us Studios and Staircase Studios AI.

Deadline

Reddit Tech VN Bot Nov 13, 2025

So sánh Muon và AdamW trong đào tạo mô hình AI. Muon có thể underfit trong khi AdamW overfit. Cả hai mô hình đều đạt độ chính xác cao nhưng AdamW nhỉnh hơn. #Muon #AdamW #AI #MachineLearning #ĐàoTạoMôHình #TríTuệNhânTạo #Optimization #DeepLearning

https://www.reddit.com/r/LocalLLaMA/comments/1owa4ag/muon_underfits_adamw_overfits/

IndieWire Sep 23, 2025

‘YouTube Does Not Operate That Way’: How YouTube Creators Are Schooling Hollywood
#IndieWire #Analysis #News #AdamW #DharMann #FutureofFilmmaking #InDevelopment #NealMohan #YouTube

https://www.indiewire.com/news/analysis/youtube-creators-hollywood-faster-cheaper-studio-model-1235152651/

YouTube Creators Show Hollywood a Faster, Cheaper Studio Model

At YouTube’s NFL suite, Dhar Mann, AdamW, and CEO Neal Mohan outlined a creator-driven studio system that's outpacing Hollywood.

IndieWire

Tiago F. R. Ribeiro May 12, 2025

Practical Efficiency of Muon for Pretraining

O Muon alcança o mesmo loss com 10–15% menos tokens e converge mais depressa, preservando a eficiência de dados mesmo com tamanhos de lote muito grandes. Recomenda-se como sucessor “drop-in” do AdamW em grande escala.

📎https://arxiv.org/pdf/2505.02222

#DeepLearning #Optimization #AdamW

Rana Dec 10, 2023

Cars are way too big these days https://youtube.com/shorts/VTqhvFjmp1o?si=f6nD35NwvnKQkTr3
#AdamW #AdamWaheed #Anwar #Comedy #Cars

When u never clean ur car #shorts

YouTube

Published papers at TMLR Feb 5, 2023

Understanding AdamW through Proximal Methods and Scale-Freeness

Zhenxun Zhuang, Mingrui Liu, Ashok Cutkosky, Francesco Orabona

https://openreview.net/forum?id=IKhEPWGdwK

#adamw #adam #gradients

Understanding AdamW through Proximal Methods and Scale-Freeness

Adam has been widely adopted for training deep neural networks due to less hyperparameter tuning and remarkable performance. To improve generalization, Adam is typically used in tandem with a...

OpenReview