General Compute bets on inference-focused AI infrastructure using SambaNova chips

📰 Original title: Has the hunt for AI compute uncovered the next Cerebras?

🤖 IA: It's clickbait ⚠️
👥 Users: It's clickbait ⚠️

View full AI summary https://en.killbait.com/general-compute-bets-on-inference-focused-ai-infrastructure-using-sambanova-chips.html?utm_source=mastodon_world&utm_medium=social&utm_campaign=killbait.mastodon_world

#artificialintelligence #aicompute #inference #neocloud

General Compute bets on inference-focused AI infrastructure using SambaNova chips

The article explores how the surging demand for AI compute, especially for inference workloads, is reshaping the infrastructure landscape and creating opportunities for new players. A startup called General Compute is positioning itself as an “inference neocloud,” focusing on providing optimized compute for AI models during their deployment phase rather than training. The company recently raised a $15 million seed round at a $60 million post-money valuation, led by FUSE VC with participation from Carya Venture Partners and Village Global Ventures. Instead of relying primarily on GPUs, General Compute is turning to specialized inference chips developed by SambaNova, an Intel-backed chipmaker. These chips are designed to improve performance during inference by using higher memory capacity and more efficient architectures for handling context-heavy workloads. The company claims these chips can deliver between 600 and 700 tokens per second, compared to roughly 250 tokens per second on traditional GPUs. General Compute has reportedly placed $300 million in orders for SambaNova’s SN50 chips and plans to be the first neocloud deploying them at scale. A key differentiator is infrastructure flexibility: the chips are air-cooled and consume less power, allowing deployment in existing data centers without costly upgrades. This enables colocation strategies, including partnerships with traditional data centers and even crypto mining facilities repurposing infrastructure. The broader industry context includes rising competition in AI inference, with companies like Groq and Cerebras shaping expectations for specialized hardware. The article also references major funding activity, such as OpenRouter’s $113 million Series B, highlighting a shift toward multi-model AI ecosystems where speed and cost of inference are critical. Investors see parallels between General Compute and earlier infrastructure plays like CoreWeave’s partnership with Nvidia or Groq’s vertical integration approach. The core question is whether inference-optimized architectures will become the dominant layer of AI computing as agents and real-time applications demand faster, cheaper model responses.

KillBait

General Compute bets on inference-focused AI infrastructure using SambaNova chips

📰 Original title: Has the hunt for AI compute uncovered the next Cerebras?

🤖 IA: It's clickbait ⚠️
👥 Users: It's clickbait ⚠️

View full AI summary https://en.killbait.com/general-compute-bets-on-inference-focused-ai-infrastructure-using-sambanova-chips.html?utm_source=mastodon_social&utm_medium=social&utm_campaign=killbait.mastodon_social

#artificialintelligence #aicompute #inference #neocloud

General Compute bets on inference-focused AI infrastructure using SambaNova chips

The article explores how the surging demand for AI compute, especially for inference workloads, is reshaping the infrastructure landscape and creating opportunities for new players. A startup called General Compute is positioning itself as an “inference neocloud,” focusing on providing optimized compute for AI models during their deployment phase rather than training. The company recently raised a $15 million seed round at a $60 million post-money valuation, led by FUSE VC with participation from Carya Venture Partners and Village Global Ventures. Instead of relying primarily on GPUs, General Compute is turning to specialized inference chips developed by SambaNova, an Intel-backed chipmaker. These chips are designed to improve performance during inference by using higher memory capacity and more efficient architectures for handling context-heavy workloads. The company claims these chips can deliver between 600 and 700 tokens per second, compared to roughly 250 tokens per second on traditional GPUs. General Compute has reportedly placed $300 million in orders for SambaNova’s SN50 chips and plans to be the first neocloud deploying them at scale. A key differentiator is infrastructure flexibility: the chips are air-cooled and consume less power, allowing deployment in existing data centers without costly upgrades. This enables colocation strategies, including partnerships with traditional data centers and even crypto mining facilities repurposing infrastructure. The broader industry context includes rising competition in AI inference, with companies like Groq and Cerebras shaping expectations for specialized hardware. The article also references major funding activity, such as OpenRouter’s $113 million Series B, highlighting a shift toward multi-model AI ecosystems where speed and cost of inference are critical. Investors see parallels between General Compute and earlier infrastructure plays like CoreWeave’s partnership with Nvidia or Groq’s vertical integration approach. The core question is whether inference-optimized architectures will become the dominant layer of AI computing as agents and real-time applications demand faster, cheaper model responses.

KillBait
The sustainability mirage at the heart of neocloud

Article three in our AI cloud sustainability research series examines the neocloud challengers which present a more values-driven message than their hyperscale competitors. Do the ...

xAI Partners with Anthropic to Monetize Data Center Capacity

📰 Original title: Is xAI a neocloud now?

🤖 IA: It's not clickbait ✅
👥 Usuarios: It's not clickbait ✅

View full AI summary: https://killbait.com/en/xai-partners-with-anthropic-to-monetize-data-center-capacity/?redirpost=a4c34c3a-f65c-43e7-aef6-d9c7fba96a17&utm_source=mastodon_world&utm_medium=social&utm_campaign=killbait

#artificialintelligence #xai #neocloud #datacenter

xAI Partners with Anthropic to Monetize Data Center Capacity

xAI and Anthropic have announced a surprising partnership, with Anthropic acquiring all available compute capacity at xAI’s Colossus 1 data center, totaling approximately 300MW.

KillBait Archive

xAI Partners with Anthropic to Monetize Data Center Capacity

📰 Original title: Is xAI a neocloud now?

🤖 IA: It's not clickbait ✅
👥 Usuarios: It's not clickbait ✅

View full AI summary: https://killbait.com/en/xai-partners-with-anthropic-to-monetize-data-center-capacity/?redirpost=a4c34c3a-f65c-43e7-aef6-d9c7fba96a17&utm_source=mastodon_social&utm_medium=social&utm_campaign=killbait

#artificialintelligence #xai #neocloud #datacenter

xAI Partners with Anthropic to Monetize Data Center Capacity

xAI and Anthropic have announced a surprising partnership, with Anthropic acquiring all available compute capacity at xAI’s Colossus 1 data center, totaling approximately 300MW.

KillBait Archive

Corey Sanders, Senior Vice President of Product at #neocloud provider CoreWeave, leads product strategy and execution for the company. His mission: Gain enterprises' trust for #CoreWeave's #AIcloud services. The challenge: slower-than-expected #enterpriseAI adoption so far and skyrocketing demand for #AIinfrastructure, including data center power and water resources.

In today’s episode, we’ll cover…

-- The shift from model building to #AIinference

-- The potential effect of reinforcement learning on #AIaccuracy

-- CoreWeave's new ARENA AI lab

-- NeoCloud architectures take on "#RAMageddon"

and more!

https://www.youtube.com/watch?v=eY3d5yFpKr8

IT Ops Query: CoreWeave neocloud makes AI pitch to enterprises

YouTube

Erwan Menard spent six years at #Google, where he most recently worked as director of product management for Google's Cloud #AI services. Six months ago, he left to join #CrusoeCloud as senior vice president of product management. Crusoe Cloud, which offers everything from hosted #cloudcomputing services to renewable power sources, is best known for operating the #Stargate 1.2 gigawatt #datacenter campus in Abilene, Texas.

In this interview, Menard discusses the advantages of designing #neocloud data centers from the ground up to support AI workloads, and how the industry can meet exploding demand for #datacenter resources, including power, as technology evolves.

In today’s episode, we’ll cover…

- Challenges and opportunities in #AIcloud services
- Technical differences in neocloud and public cloud networks
- Gaining enterprises' trust in neoclouds
- Keeping #AIdatacenters sustainable

and more!

https://youtube.com/watch?v=gJVN5v9CH6I&si=thGpQblR5BcrEzpI

IT Ops Query: Ex-Googler joins Crusoe Cloud, neocloud of Stargate fame

YouTube

Upstart #cloudprovider Railway says it has won thousands of paying customers from startups to Fortune 500 companies -- which is not to mention this week's $100 million Series B funding round -- with its own take on efficient hardware design and built-in automation for infrastructure that doesn't include either #GPUs or #Kubernetes.

My writeup on the #neocloud forging its own path in the age of #AI, including an interview with founder and CEO Jake Cooper: https://www.techtarget.com/searchcloudcomputing/news/366637659/Upstart-cloud-provider-Railway-turns-heads-with-speed #cloudcomputing #rackscalehardware #enterprisetech #ITinfrastructure

Upstart cloud provider Railway turns heads with speed

Railway began as a PaaS provider but now looks to disrupt hyperscalers with a bespoke back-end hardware design and built-in infrastructure best practices for developers.

TechTarget

Neocloud Economics: CoreWeave vs Nebius – Vertical AI Infra Crushes Hyperscalers (60-70% Margins) ⚡

Neoclouds own stack (chips→racks), dodge AWS debt/leasing. Nebius edges CoreWeave on costs; $10B+ ARR potential. AI training explodes demand

Why vertical? 2-3x cheaper GPUs vs cloud giants.

VCs: Next hyperscalers? Founders: Build atop. GPU wars incoming. 📈

#Neocloud #CoreWeave #Nebius #AIInfra #GPUEconomics

https://buff.ly/XJsjWqp

#Oracle’s #Cloud Infrastructure business is thriving, driven by partnerships with #ByteDance and #OpenAI: Their #hybrid infrastructure strategy, combining attributes of a traditional #hyperscaler and an #AInative #Neocloud, has proven successful. https://semianalysis.com/2025/06/30/how-oracle-is-winning-the-ai-compute-market/?eicker.news #tech #media #news
How Oracle Is Winning the AI Compute Market

Oracle’s Cloud Infrastructure business is firing on all cylinders and is greatly outpacing expectations. All eyes are on the high-profile Stargate JV and the massive Abilene, Texas datacenter…

SemiAnalysis