Inference cost at scale with napkin math

InJuly

You can't scale yourself. You can only scale the system around you.

Anything that depends on one person's memory stops the day they're out โ€” and never grows past what they can hold. Stop being the hard drive.

#Systems #Scaling #Leadership
https://apps.apple.com/us/app/decisio-ai/id6759219287

Capacity planning is becoming critical ๐Ÿ“Š

Waiting until teams are overloaded is already too late.

๐Ÿ“‰ Teams under pressure
โณ Delays increase
๐Ÿ“Š Backlogs grow

MSPs are planning ahead to maintain balance and avoid disruption ๐ŸŒ

โœ… Better workload planning
โœ… Balanced capacity
โœ… Consistent delivery

A smarter way to stay in control as you grow.

#CapacityPlanning #MSP #ITOps #Scaling

From Christmas Outage to #1 App Store Ranking: An Aura Frames Postgres Scaling Retrospective https://lobste.rs/s/emkdck #databases #scaling
https://andyatkinson.com/postgresql-rds-scaling-aws-christmas-day-peak
From Christmas Outage to #1 App Store Ranking: An Aura Frames Postgres Scaling Retrospective

0 comments

Lobsters

From Christmas Outage to #1 App Store Ranking: An Aura Frames Postgres Scaling Retrospective

https://andyatkinson.com/postgresql-rds-scaling-aws-christmas-day-peak

#Postgres #Scaling #Databases

From Christmas Outage to #1 App Store Ranking: An Aura Frames Postgres Scaling Retrospective

๐Ÿ“Œ Overview On Christmas Day 2024, Postgres infrastructure powering the Aura Frames API had problems under peak load, being unavailable for three hours and disrupting the experience for new customers. The team knew it would need improvements to handle the surge for Christmas 2025 and beyond. One year later, much of the resource intensive data access was reworked, the Postgres infrastructure was upsized, and this approach not only survived, but thrived, providing reliable service through the holiday season. The sum of Transactions Per Second (TPS) across the DBs peaked at 226,000, with more than 100K TPS sustained for 10 hours and repeating on multiple days after Christmas, with an average query time of 25 microseconds. The improved reliability meant customers could smoothly set up new frames and add photos, and they did it more than ever, with the Aura Frames app reaching #1 in U.S. and Canadian Apple and Android App Stores on Christmas Day. In this post weโ€™ll look back at the months of planning and execution that went into achieving that outcome! A second post in this series will dig into the Ruby on Rails side, while this one will focus on Postgres.

Software Engineer, Author, High Performance PostgreSQL for Rails

Local vs global hiring ๐ŸŒ

Relying only on local talent can slow down growth.

๐Ÿ“‰ Limited talent pool
โณ Slower hiring
๐Ÿ“Š Restricted capacity

Global hiring opens up new possibilities ๐ŸŒ

โœ… Faster access to talent
โœ… Quick team expansion
โœ… Better scalability

The future of growth is not limited by location.

#GlobalTalent #MSP #Scaling #ITStaffing

How We Moved Discord Voice to the Edge

0 comments

Lobsters
How We Moved Discord Voice to the Edge

Moving Discordโ€™s voice and video onto Cloudflare's edge network. Closer servers, lower ping in most regions, and a few real bugs getting there.

The rapid #advancement of #AI, driven by #exponential #scaling laws, poses a significant challenge to slow-moving political institutions. While AIโ€™s potential risks and benefits are becoming undeniable, policymakers are still catching up. This essay proposes a comprehensive approach to AI policy, focusing on #regulation, #macroeconomics, #scientificinnovation, and #geopolitics, with a particular emphasis on robust #AIsafety regulations. https://darioamodei.com/post/policy-on-the-ai-exponential?AIagents.at #AIagent #AI #ML #NLP #LLM #GenAI
Dario Amodei โ€” Policy on the AI Exponential