Inference cost at scale with napkin math
https://injuly.in/blog/napkin-inference-cost/index.html
#HackerNews #inference #cost #napkin #math #scaling #machine #learning #data #science
Inference cost at scale with napkin math
https://injuly.in/blog/napkin-inference-cost/index.html
#HackerNews #inference #cost #napkin #math #scaling #machine #learning #data #science
You can't scale yourself. You can only scale the system around you.
Anything that depends on one person's memory stops the day they're out โ and never grows past what they can hold. Stop being the hard drive.
#Systems #Scaling #Leadership
https://apps.apple.com/us/app/decisio-ai/id6759219287
Capacity planning is becoming critical ๐
Waiting until teams are overloaded is already too late.
๐ Teams under pressure
โณ Delays increase
๐ Backlogs grow
MSPs are planning ahead to maintain balance and avoid disruption ๐
โ
Better workload planning
โ
Balanced capacity
โ
Consistent delivery
A smarter way to stay in control as you grow.
From Christmas Outage to #1 App Store Ranking: An Aura Frames Postgres Scaling Retrospective
https://andyatkinson.com/postgresql-rds-scaling-aws-christmas-day-peak

๐ Overview On Christmas Day 2024, Postgres infrastructure powering the Aura Frames API had problems under peak load, being unavailable for three hours and disrupting the experience for new customers. The team knew it would need improvements to handle the surge for Christmas 2025 and beyond. One year later, much of the resource intensive data access was reworked, the Postgres infrastructure was upsized, and this approach not only survived, but thrived, providing reliable service through the holiday season. The sum of Transactions Per Second (TPS) across the DBs peaked at 226,000, with more than 100K TPS sustained for 10 hours and repeating on multiple days after Christmas, with an average query time of 25 microseconds. The improved reliability meant customers could smoothly set up new frames and add photos, and they did it more than ever, with the Aura Frames app reaching #1 in U.S. and Canadian Apple and Android App Stores on Christmas Day. In this post weโll look back at the months of planning and execution that went into achieving that outcome! A second post in this series will dig into the Ruby on Rails side, while this one will focus on Postgres.
Local vs global hiring ๐
Relying only on local talent can slow down growth.
๐ Limited talent pool
โณ Slower hiring
๐ Restricted capacity
Global hiring opens up new possibilities ๐
โ
Faster access to talent
โ
Quick team expansion
โ
Better scalability
The future of growth is not limited by location.
How We Moved Discord Voice to the Edge
https://discord.com/blog/how-we-moved-discord-voice-to-the-edge