🤔 "AI models are like that friend who seems super reliable until you actually need them to help you move a couch. 😂 Spoiler alert: they drop it." 🛋️🔍
https://arxiv.org/abs/2510.22371 #AImodels #CouchMoving #Reliability #Humor #HackerNews #ngated
Reasoning Models Reason Well, Until They Don't

Large language models (LLMs) have shown significant progress in reasoning tasks. However, recent studies show that transformers and LLMs fail catastrophically once reasoning problems exceed modest complexity. We revisit these findings through the lens of large reasoning models (LRMs) -- LLMs fine-tuned with incentives for step-by-step argumentation and self-verification. LRM performance on graph and reasoning benchmarks such as NLGraph seem extraordinary, with some even claiming they are capable of generalized reasoning and innovation in reasoning-intensive fields such as mathematics, physics, medicine, and law. However, by more carefully scaling the complexity of reasoning problems, we show existing benchmarks actually have limited complexity. We develop a new dataset, the Deep Reasoning Dataset (DeepRD), along with a generative process for producing unlimited examples of scalable complexity. We use this dataset to evaluate model performance on graph connectivity and natural language proof planning. We find that the performance of LRMs drop abruptly at sufficient complexity and do not generalize. We also relate our LRM results to the distributions of the complexities of large, real-world knowledge graphs, interaction graphs, and proof datasets. We find the majority of real-world examples fall inside the LRMs' success regime, yet the long tails expose substantial failure potential. Our analysis highlights the near-term utility of LRMs while underscoring the need for new methods that generalize beyond the complexity of examples in the training distribution.

arXiv.org

Here’s the official one 🫣:

“Summary Of The Amazon DynamoDB Service Disruption In The Northern Virginia (US-EAST-1) Region”, AWS (https://aws.amazon.com/message/101925/).

On HN: https://news.ycombinator.com/item?id=45677139

On Lobsters: https://lobste.rs/s/mw0pus/summary_amazon_dynamodb_service

#Amazon #AWS #DynamoDB #Reliability #Outage

Summary of the Amazon DynamoDB Service Disruption in the Northern Virginia (US-EAST-1) Region

Amazon Web Services, Inc.

👇🏽 is *much better* than the official #Amazon postmortem on the recent #AWS outage 👌🏽:

“More Than DNS: The 14 Hour AWS us-east-1 outage”, Jonathon Belotti (https://thundergolfer.com/blog/aws-us-east-1-outage-oct20).

Via HN: https://news.ycombinator.com/item?id=45722471

On Lobsters: https://lobste.rs/s/gti2pe/more_than_dns_14_hour_aws_us_east_1_outage

#Reliability #RCA

More Than DNS: The 14 hour AWS us-east-1 outage

A thorough review of a major cloud outage.

Jonathon Belotti [thundergolfer]
UDP Isn't Unreliable, It's a Convertible - Proxylity Blog

Explore why UDP isn't truly unreliable and how you can build custom reliability layers that give you the best of both UDP and TCP worlds.

Proxylity Blog
Website Reliability: The Latest Generation Guide for Web Monitoring https://visualmodo.com/website-reliability-the-latest-generation-guide-for-web-monitoring/ 🖥🧑‍💻💡📸 #Reliability #Website #Guide #Monitoring
Website Reliability: The Latest Generation Guide for Web Monitoring

In this article, we'll explore website reliability and the latest generation guide and tips for web monitoring for high-quality performance

Visualmodo
🎉 #AWS is more reliable than a toddler with a crayon, until it's not! All it took was a 14-hour hiccup to turn seasoned techies into nail-biting, hotel-bound philosophers questioning cloud infallibility 🤔. Apparently, #SLAs are just good suggestions when the cloud gods decide to nap. 💥
https://thundergolfer.com/blog/aws-us-east-1-outage-oct20 #Reliability #CloudComputing #TechPhilosophy #ToddlerMetaphor #HackerNews #ngated
More Than DNS: The 14 hour AWS us-east-1 outage

A thorough review of a major cloud outage.

Jonathon Belotti [thundergolfer]
More Than DNS: The 14 hour AWS us-east-1 outage

A thorough review of a major cloud outage.

Jonathon Belotti [thundergolfer]

https://communitymedia.video/w/h1jdbHm8xTj5VEDuCWAbBW
#lispyGopherClimate #technology #podcast #weekly

featuring @kentpitman @ramin_hal9001 @jns
Longterm #reliability , the #climateCrisis , technology and knowledge

Kent has found the source to his Cross referenced editing facility from Open University #lisp #lispm example.

#scifi #books with this theme? Ending of Cats Cradle?

On the #art side check, @prahou 's #unix_surrealism, https://photronic.art

Happy #lambdaMOO year! 35 years! https://lambda.moo.mud.org as always.

How Complex Systems Fail

📰'There’s a reason electricity prices are rising.
And it’s not data centers.' 🫢

WaPo claims that it is NOT the datacenters raising costs! Instead:

"... a new study from researchers at Lawrence Berkeley National Laboratory and the consulting group Brattle suggests that, counterintuitively, more electricity demand can actually lower prices." 🧐

Also, they say it's the "... price of transformers and wires [which have] far outpaced inflation over the past five years." 🤔

But there's nothing in the article about Amazon's plans to build datacenters... like the plan to spend $20B in Pennsylvania by taking advantage of government tax breaks. 💰💰💰 (1)

Heck, do they even mention that their paper is owned by the same folks building these datacenters?

Or do they discuss "[all] of the lingering questions about Amazon’s [plans which are creating] big concerns for lawmakers, as well as for advocates of clean energy and grid reliability." 🕵️‍♂️ (1)

It seems the article from WaPo is biased. Do you agree?

🌐
https://www.washingtonpost.com/climate-environment/2025/10/25/data-centers-electricity-prices-rise/

(archive) https://archive.ph/WwIQh

(1) https://archive.ph/o94zx

#environment #datacenters #media #news #bias #AWS #Amazon #greenwashing #tech #cleanenergy #reliability #infrastructure

There’s a reason electricity prices are rising. And it’s not data centers.

It’s not data centers or AI, it’s something else.

The Washington Post