ZDNet: OpenAI’s GPT-5.4 mini and nano launch – with near flagship performance at much lower cost . “The latest GPT-5.4 mini model delivers benchmark results surprisingly close to the full GPT-5.4 model while running much faster, signaling a shift toward smaller AI models powering real-world applications.”

https://rbfirehose.com/2026/03/20/zdnet-openais-gpt-5-4-mini-and-nano-launch-with-near-flagship-performance-at-much-lower-cost/
ZDNet: OpenAI’s GPT-5.4 mini and nano launch – with near flagship performance at much lower cost

ZDNet: OpenAI’s GPT-5.4 mini and nano launch – with near flagship performance at much lower cost . “The latest GPT-5.4 mini model delivers benchmark results surprisingly close to …

ResearchBuzz: Firehose

University of Waterloo: Top AI coding tools make mistakes one in four times. “Even the most advanced models achieved only about 75 per cent accuracy in the tests, while open-source models performed closer to 65 per cent. The study evaluated 11 LLM models across 18 structured output formats and 44 tasks designed to assess how reliably the systems followed structured rules.”

https://rbfirehose.com/2026/03/20/university-of-waterloo-top-ai-coding-tools-make-mistakes-one-in-four-times/
University of Waterloo: Top AI coding tools make mistakes one in four times

University of Waterloo: Top AI coding tools make mistakes one in four times. “Even the most advanced models achieved only about 75 per cent accuracy in the tests, while open-source models per…

ResearchBuzz: Firehose

MakeUseOf: I switched to a local LLM for these 5 tasks and the cloud version hasn’t been worth it since. “Local LLMs have also come a long way, to the point where you can run lightweight AI models on just about every device. They’re not good at everything, but they do some tasks so well you’d want to cancel that cloud AI subscription right away.”

https://rbfirehose.com/2026/03/19/makeuseof-i-switched-to-a-local-llm-for-these-5-tasks-and-the-cloud-version-hasnt-been-worth-it-since/
MakeUseOf: I switched to a local LLM for these 5 tasks and the cloud version hasn’t been worth it since

MakeUseOf: I switched to a local LLM for these 5 tasks and the cloud version hasn’t been worth it since. “Local LLMs have also come a long way, to the point where you can run lightweigh…

ResearchBuzz: Firehose

Spotted in my RSS feeds: CanIRun.AI. From the Why page: “CanIRun.ai runs entirely in your browser. When you visit the site, we use browser APIs to detect your GPU, CPU, and memory — then we calculate which AI models can run on your hardware and how fast. No data is sent to any server. Everything is computed client-side.”

https://rbfirehose.com/2026/03/17/canirun-ai/
CanIRun.AI

Spotted in my RSS feeds: CanIRun.AI. From the Why page: “CanIRun.ai runs entirely in your browser. When you visit the site, we use browser APIs to detect your GPU, CPU, and memory — then we c…

ResearchBuzz: Firehose

MakeUseOf: I use Linux for local LLMs and everything is easier than Windows. “With the right tools and a bit of restraint, you can now run a genuinely useful ChatGPT-style setup locally on Linux Mint without turning your laptop into a space heater. I know because I just did exactly that on a Ryzen 5 machine with 8 GB of RAM and integrated graphics. Not a powerhouse, or a lab rig. Just a very […]

https://rbfirehose.com/2026/03/15/makeuseof-i-use-linux-for-local-llms-and-everything-is-easier-than-windows/
MakeUseOf: I use Linux for local LLMs and everything is easier than Windows

MakeUseOf: I use Linux for local LLMs and everything is easier than Windows. “With the right tools and a bit of restraint, you can now run a genuinely useful ChatGPT-style setup locally on Li…

ResearchBuzz: Firehose

New York Institute of Technology: Not All AI is Built to Diagnose. “The researchers provided each AI model with the same CT brain scan showing clear intracranial pathology. Then, they asked the models to analyze the image like a radiologist—identifying the imaging technique used, the location of the pathology in the brain, primary diagnosis, key features, and potential alternative diagnoses. […]

https://rbfirehose.com/2026/03/14/new-york-institute-of-technology-not-all-ai-is-built-to-diagnose/
New York Institute of Technology: Not All AI is Built to Diagnose

New York Institute of Technology: Not All AI is Built to Diagnose. “The researchers provided each AI model with the same CT brain scan showing clear intracranial pathology. Then, they asked t…

ResearchBuzz: Firehose

PetaPixel: He Tried to Stop Adobe From Training its AI on His Photo Library – He Lost. “Gerald Carter tells PetaPixel that Adobe fed every single image from Diversity Photos into its Firefly AI image model. After he protested, Adobe offered him a paltry fee for the AI training, which Carter rejected. Adobe then relied on its legal resources to successfully thwart Carter’s legal challenge […]

https://rbfirehose.com/2026/03/12/petapixel-he-tried-to-stop-adobe-from-training-its-ai-on-his-photo-library-he-lost/

ZDNet: I tried GPT-5.4, and most answers were really good – but a few had me concerned. “Every answer I got back from GPT-5.4 Thinking was quite good in its own right. But in half my tests, the AI didn’t answer the question it was asked. You can get it to give you good responses, but you have to fairly relentlessly correct the AI to keep it on point. That gets old.”

https://rbfirehose.com/2026/03/11/zdnet-i-tried-gpt-5-4-and-most-answers-were-really-good-but-a-few-had-me-concerned/
ZDNet: I tried GPT-5.4, and most answers were really good – but a few had me concerned

ZDNet: I tried GPT-5.4, and most answers were really good – but a few had me concerned. “Every answer I got back from GPT-5.4 Thinking was quite good in its own right. But in half my te…

ResearchBuzz: Firehose

PsyPost: New research: AI models tend to reflect the political ideologies of their creators. “A new study suggests that large language models tend to adopt the ideological perspectives of the companies and countries that build them. These findings were published in the journal npj Artificial Intelligence.”

https://rbfirehose.com/2026/03/03/new-research-ai-models-tend-to-reflect-the-political-ideologies-of-their-creators-psypost/
New research: AI models tend to reflect the political ideologies of their creators (PsyPost)

PsyPost: New research: AI models tend to reflect the political ideologies of their creators. “A new study suggests that large language models tend to adopt the ideological perspectives of the…

ResearchBuzz: Firehose

Ars Technica: Microsoft deletes blog telling users to train AI on pirated Harry Potter books. “Following backlash in a Hacker News thread, Microsoft deleted a blog post that critics said encouraged developers to pirate Harry Potter books to train AI models that could then be used to create AI slop.”

https://rbfirehose.com/2026/02/24/ars-technica-microsoft-deletes-blog-telling-users-to-train-ai-on-pirated-harry-potter-books/
Ars Technica: Microsoft deletes blog telling users to train AI on pirated Harry Potter books

Ars Technica: Microsoft deletes blog telling users to train AI on pirated Harry Potter books. “Following backlash in a Hacker News thread, Microsoft deleted a blog post that critics said enco…

ResearchBuzz: Firehose