https://www.linkedin.com/pulse/demand-meets-design-aligning-supply-real-capacity-sanjay-k-mohindroo--gy2ec
Grow or Die
For its whole life, Amazon faced a classic capacity planning problem that everyone has. You need to have many machines at the busy time of the year, but they will lie idle the rest of the year.
While growing rapidly, your answer is easy: arrange to buy the needed machines as late as possible, so they came on-line just before Black Friday. The rest of the year they don’t exist, so they don’t cost you anything.
Something changed
But what if you’re not growing at 30% a year? Now you do have idle machines, for 11 out of 12 months a year.
You could arrange to start growing again. Amazon did just that, when they went from running a few data centres just for themselves and started AWS, Amazon web services. Now they were getting business from tons of new customers, and jumped back to their previous solution.
In it, you add as many machines as you can afford, until you are up to the predicted number you needed for the next “Black Friday”. You’re growing like crazy again. You can buy machines and entire data centres again … until business starts to flatten out.
What now?
You’re back to not having to buy extra every year, but you still have to pay for running everything you bought. And your profits have to keep paying back the capital investment you made when you bought those machines.
For a little while you can just coast, profit-taking on your investments.
But that won’t last. Your smaller competitors are still growing as fast as they can get customers., including by taking them from you. When that stops, you’re all in the same boat. Microsoft, Google, Alibaba and all the little guys.
Choose any Two
There are three classic approaches to staying alive in business: maximize growth, maximize revenue, or minimize cost. You did the first, now you need to do the others.
While you’re doing that, let’s pretend you’re Dr. Evil so your solutions don’t have to be ethical (:-))
Start with cost minimization: extend the effective life of your servers. That works because Moore’s Law is coming to an end, and clock speeds have stagnated. That in turn means a three-year-old machine is no longer hugely slower that a new machine. You can keep using your old machines without a cost-benefit crash. Amazon did just that, extending the life of its servers to six years. They claimed a $900m benefit in the first 90 days [Cantrill, 2024].
Then take advantage of some of the smaller advantages:
Finally, keep creeping the price upwards. Your cost may be slowly decreasing, but you should always be increasing your prices, now you’ve locked in your customers.
Does this Sound Familiar?
Yup!
Cory Doctorow defines enshittification as
That’s a good tactic for Dr. Evil. But maybe not for everyone.
What does this do to me?
If you’re in the same pickle as Amazon, leave the Dr. Evil solutions out, and see how many you can apply to your version of the problem.
If your year-end business isn’t hugely greater that your regular business, the capacity planning problem is easier. A customer I know was once was in just that situation. They were able to use the longer life of recent servers and the price advantage of large purchases to solve their problem.
–dave
[Cantrill 2024] Bryan Cantrill, Moore’s Scofflaws, Oxide blog, https://oxide.computer/blog/moores-scofflaws
[Wright 2025] Wikipedia, Experience curve effects, https://en.wikipedia.org/wiki/Experience_curve_effects
Does any one use asana for project and team management?
Need to find a tool to propose to our bosses to manage our projects (long term planning / forecast), and the workload of the team (forecasts based on the project but also weekly planning overview)
New blog post:
Risk analysis is a crucial component of resource forecasting for datacenters. By identifying and evaluating potential risks, datacenter managers can ensure that their operations are prepared to handle any potential challenges that may arise, allowing them to deliver reliable, high-quality service to their customers. This blog post explores some types of risks that affect resource demand forecasting in a datacenter.
Risk analysis is a crucial component of resource forecasting for datacenters. By identifying and evaluating potential risks, datacenter managers can ensure that their operations are prepared to handle any potential challenges that may arise, allowing them to deliver reliable, high-quality service to their customers. In this blog post, we will explore the different types of risks that can affect resource forecasting in a datacenter. We will also discuss how to use risk analysis to inform resource allocation and capacity planning decisions.
An excellent reading about reliability, automation, and a bit of capacity planning at #Twitter Article tries to answer the question "Why Twitter didn't go down".
https://matthewtejo.substack.com/p/why-twitter-didnt-go-down-from-a #reliability #sre #automation #CapacityPlanning
Twitter supposedly lost around 80% of its work force. What ever the real number is, there are whole teams with out engineers on it now. Yet, the website goes on and the tweets keep coming. This left a lot wondering what exactly was going on with all those engineers and made it seem like it was all just bloat. I’d like to explain my little corner of Twitter (though it wasn’t