gok

@doctorgoktor
7 Followers
51 Following
208 Posts
@shac local or not it would be good if it were easier to get access to models with trust that your use is secure to you
@stroughtonsmith it seems to be hard for people to separate how impressive the technology is in principle from the particular models shown in a demo
My first parade
@caseyliss how much battery life would you have been willing to give up?
@siracusa efficiency cores are still efficiency cores, some previously performance cores are now super cores, all-new performance cores are a secret third thing
@bigzaphod is `items` objc types?
Bi-modal masked diffusion? no man you need to be tri-modalmaxxing https://arxiv.org/abs/2602.21472
The Design Space of Tri-Modal Masked Diffusion Models

Discrete diffusion models have emerged as strong alternatives to autoregressive language models, with recent work initializing and fine-tuning a base unimodal model for bimodal generation. Diverging from previous approaches, we introduce the first tri-modal masked diffusion model pretrained from scratch on text, image-text, and audio-text data. We systematically analyze multimodal scaling laws, modality mixing ratios, noise schedules, and batch-size effects, and we provide optimized inference sampling defaults. Our batch-size analysis yields a novel stochastic differential equation (SDE)-based reparameterization that eliminates the need for tuning the optimal batch size as reported in recent work. This reparameterization decouples the physical batch size, often chosen based on compute constraints (GPU saturation, FLOP efficiency, wall-clock time), from the logical batch size, chosen to balance gradient variance during stochastic optimization. Finally, we pretrain a preliminary 3B-parameter tri-modal model on 6.4T tokens, demonstrating the capabilities of a unified design and achieving strong results in text generation, text-to-image tasks, and text-to-speech tasks. Our work represents the largest-scale systematic open study of multimodal discrete diffusion models conducted to date, providing insights into scaling behaviors across multiple modalities.

arXiv.org
@marcoarment it keeps going if you hit "cancel", it just cancels the next action you had queued up (like Run)
@carnage4life This has been widely reported but it's wrong. Gen Z scores better than their parents (Gen X). They don't do as well as Millennials.
@madcoder @shac you have to harvest them later in the year when the sun is more dimm