Mastodawn

OpenAI (@OpenAI)

OpenAI가 기업들이 AI를 구축하고 배포하도록 돕는 'OpenAI Deployment Company'를 출범했습니다. OpenAI가 과반 지분과 통제권을 가지며, 19개 투자사·컨설팅사·시스템 통합업체와 함께 프런티어 AI의 프로덕션 배포를 지원합니다.

https://x.com/OpenAI/status/2053824997777457651

#openai #deployment #aiinfra #enterpriseai #ai

OpenAI (@OpenAI) on X

Today we’re launching the OpenAI Deployment Company to help businesses build and deploy AI. It's majority-owned and controlled by OpenAI. It brings together 19 leading investment firms, consultancies, and system integrators to help organizations deploy frontier AI to production

X (formerly Twitter)

AIEONYX 3d ago

We are thrilled to Introduce to the world, the first AI-native sovereign systems language.
"AXON"
• It is a Compiler that formally verifies your declared intent at compile time
• It Targets seL4 formally verified microkernel
• It has 190M ops/sec native performance
• it has Zero cloud dependency. All local AI inference.

The Axon verify monitor.axon → ✓ classify() verified on all paths

Above all, it is truly Open source. Built in Rust.
github.com/aieonyx/AXON

#AI #Opensource #AIinfra #SeL4

Sudhanshu Shekhar May 3

The silent battle for AI dominance is fought in data centers. Compute power isn't just about speed; it's about efficient, sustainable infrastructure. Choose your cloud wisely. #AIInfra #SustainableAI #CloudComputing #AI

sayzard Apr 26

ecohash.eth (@ecohash_co)

애플 실리콘 M3 Ultra 512GB 두 대를 확보했고, 향후 M5 애플 실리콘에 대한 기대를 언급했다. 256GB 최대 구성 루머가 사실이라도, 4노드 @exolabs 클러스터에서 두 대면 충분한 성능을 낼 수 있을 것이라고 전망했다.

https://x.com/ecohash_co/status/2048474681171443888

#apple #m3ultra #m5 #server #aiinfra

ecohash.eth (@ecohash_co) on X

@arpeyton @LottoLabs I was able to get two m3 ultra 512s before they were gone. Very excited about potential for m5 Apple silicon. Even if rumors of 256GB max configs are true, a pair of those will go a long way on a 4-node @exolabs cluster

X (formerly Twitter)

AIntelligenceHub Apr 22

Google introduced TPU 8t for training and TPU 8i for inference at Cloud Next 2026. We map the practical impact on latency, utilization, and AI infrastructure budgets. https://go.aintelligencehub.com/ma-googlesplitaichipstra #GoogleCloud #AIInfra #AIAgents #DataCenter

Google Split Its New AI Chips by Job, One for Training and One for Inference

At Cloud Next 2026, Google introduced TPU 8t for training and TPU 8i for inference. The split points to a new infrastructure playbook for AI teams that need speed in model development and lower latency in production.

sayzard Apr 20

0xSero (@0xSero)

Hugging Face 제품 책임자 Victor와의 인터뷰를 통해, Hugging Face가 AI 학습과 인프라, 실제 모델 코드까지 연결해주는 핵심 플랫폼이라고 강조한다. AI 개발자에게 모델, 도구, 생태계를 한곳에서 접할 수 있는 중요한 오픈 플랫폼임을 부각한다.

https://x.com/0xSero/status/2045798559702749457

#huggingface #aiinfra #opensource #platform

0xSero (@0xSero) on X

I interviewed Huggingface's head of product Victor. Huggingface is AI's core I think this is the most important platform if you're interested in learning AI, it introduced me to everything I know about infra, and actual model code. Very grateful teams like this exist. (:

X (formerly Twitter)

Solomon Apr 19

🧠 Qwen releases Qwen3Guard: streaming and offline moderation, 3-tier severity labels, 119-language coverage. Useful for multilingual guardrails in production. solomonneas.dev/intel

#AI #ML #LLMOps #AIInfra

sayzard Apr 11

Base Camp Bernie (@basecampbernie)

동시성 에이전트를 높은 대역폭으로 서빙한 사례가 공유되며, 멀티 에이전트 추론/서빙 최적화가 인상적으로 작동하고 있음을 시사한다.

https://x.com/basecampbernie/status/2042661495864177074

#agents #serving #multitasking #aiinfra

Base Camp Bernie (@basecampbernie) on X

@AiXsatoshi Yes, concurrent agents served with that bandwidth. It is wonderful to see.

X (formerly Twitter)

Mike Watson 🇨🇦Apr 10

Tool calling quality is noisy in a way LLM text generation isn't. The difference between "works" and "explodes" is tiny, and traditional benchmarks miss it. We need tool-specific evaluation frameworks. It would almost immediately become one of the most sought-after metrics.

#AgenticAI #ToolCalling #LLM #MLevaluation #AIinfra #machineLearning #hermesAgent #openclaw #claudecode

Solomon Mar 29

Ollama v0.19.0-rc1 dropped.

New warning when local server context is below 64K tokens. If you run Ollama for agent workflows, this prerelease will surface misconfigured deployments that were silently truncating on longer tasks. Also includes VS Code path handling fixes and hides the Cline integration.

Test in non-production before upgrading anything OpenClaw-adjacent.

Source: https://github.com/ollama/ollama/releases/tag/v0.19.0-rc1

Full intel feed: solomonneas.dev/intel

#Ollama #LocalAI #DevTools #AIInfra

Release v0.19.0-rc1 · ollama/ollama

mlx: fix vision capability + min version (#15106)

GitHub