Mastodawn

AI at the edge is an infrastructure puzzle. Red Hat is helping solve it by contributing llm-d to the #CNCF, establishing "well-lit paths" for AI-RAN orchestration with SoftBank. 🐧

This is about optimization—making inference a first-class citizen alongside traditional containers.

Proud to see Red Hat continuing our legacy of open-source leadership, from #Kubernetes and #etcd to #KEDA and now #llmd.

#RedHat #AI #OpenSource #KubeCon #CloudNative

How llm-d brings critical resource optimization with SoftBank’s AI-RAN orchestrator

In Red Hat’s latest collaboration with SoftBank Corp., we have integrated llm-d into SoftBank’s AI-RAN orchestrator, AITRAS.

Chuck Mattern Mar 26

Red Hat is contributing llm-d to the #CNCF, turning fragmented AI into modular, interoperable microservices. 🐧

The goal? Make AI inference a first-class citizen in the same cloud-native environment as your traditional apps.

I love how Red Hat continues to fuel the #OpenSource ecosystem. From our roots in #Kubernetes and #etcd to newer projects like #KEDA and #CRI-O, we’re committed to building "well-lit paths" for everyone.

#RedHat #KubeCon #CloudNativeCon #AI #llmd

https://www.redhat.com/en/blog/why-were-contributing-llm-d-cncf-standardizing-future-ai?sc_cid=701f2000000txokAAA&utm_source=bambu&utm_medium=organic_social

Why we’re contributing llm-d to the CNCF: Standardizing the future of AI

Red Hat is contributing llm-d to the Cloud Native Computing Foundation (CNCF) as a Sandbox project to standardize high-performance, distributed AI inference serving within the cloud-native stack. This contribution aims to bridge the capabilities gap between AI experimentation and production by providing a specialized data-plane orchestration layer that maximizes infrastructure efficiency and enables flexible deployment on any choice of hardware.

Rost Glukhov Mar 14

Learn the critical failure points when running LLM inference on Kubernetes, including resource constraints, operator compatibility, security, scalability, and monitoring best practices for production workloads.

#Kubernetes #LLM Inference #Dynatrace #GPU Resource Allocation #Service Mesh #Network Policies #KEDA #Triton Inference Server #Redis #Prometheus

https://dasroot.net/posts/2026/02/running-llm-inference-on-kubernetes-what-breaks-first/

Running LLM Inference on Kubernetes: What Breaks First

Technical news about AI, coding and all

ragingHungryPanda Jan 30

Blog - I had a new bit of learning with Kubernetes Event Driven Autoscaling today

Changing Piefed Worker Scaling to be Based on Queue Size in Kubernetes with KEDA
I recently caused myself a bit of a minor issue by installing some updates on the Keyboard Vagabond cluster. It wasn’t a big deal, just s…

Changing Piefed Worker Scaling to be Based on Queue Size in Kubernetes with KEDA

I recently caused myself a bit of a minor issue by installing some updates on the Keyboard Vagabond cluster. It wasn't a big deal, just s...

Software and Tech

ragingHungryPanda Jan 30

Blog - I had a new bit of learning with Kubernetes Event Driven Autoscaling today

https://piefed.keyboardvagabond.com/c/programming/p/226531/blog-i-had-a-new-bit-of-learning-with-kubernetes-event-driver-autoscaling

Blog - I had a new bit of learning with Kubernetes Event Driven Autoscaling today

_Changing Piefed Worker Scaling to be Based on Queue Size in Kubernetes with KEDA_ I recently caused myself a bit of a minor issue by installing…

Habr Dec 7

Реальный кейс настройки Pod Autoscaling в k8s с точки зрения разработчика

На носу 2026 год, а я хочу поделиться своим путешествием по переводу приложения на инфраструктуру Kubernetes. И самая сложная и интересная часть, как раз, настройка автоскейлинга. Не слишком ли заезженная тема? Думаю нет, потому что я буду рассказывать именно с позиции разработчика приложения, а не девопса. Мне повезло, я без понятия как это всё настраивается. Я буду рассказывать как это всё работает. Конфигов кубера будет минимум, рассуждений и погружений в метрики максимум. В конце оставил TL;DR. Поехали?

https://habr.com/ru/articles/973936/

#kubernetes #hpa #horizontal_pod_autoscaler #keda #ec2 #cadvisor #k8s

Реальный кейс настройки Pod Autoscaling в k8s с точки зрения разработчика

Что я знаю о бриллиантах? Я устраиваю боксерские бои. Всего неделю назад я устраивал боксерские бои и радовался жизни, и вдруг... Что я знаю о бриллиантах? На носу 2026 год, а я хочу поделиться своим...

Хабр

Dotan Horovits #CNCFAmbassador Sep 29, 2025

From event-driven architectures to autoscaling, from #cloudnative #microservices to agentic AI, from corporate to #opensource and startups - the latest episode of OpenObservability Talks has it all!

I invited co-creator of #Dapr & #KEDA @yaronschneider to give us us the grand tour:
https://medium.com/p/eb2f4013d9a1

Habr Sep 18, 2025

Автомасштабируем узлы кластера Kubernetes. Часть 2

Всем привет! Это вновь Илья Смирнов, архитектор решений из

https://habr.com/ru/companies/cloud_ru/articles/948140/

#keda #мультиклауд #масштабирование #eventdriven

Автомасштабируем узлы кластера Kubernetes. Часть 2

Всем привет! Это вновь Илья Смирнов, архитектор решений из Cloud.ru . В прошлой статье мы рассмотрели традиционные подходы к масштабированию подов и узлов кластера Kubernetes. Но остался нерешенным...

Хабр

Show thread

Cees-Jan Kiewiet

Sep 15, 2025

Hah score! Managed to push data from #HomeAssistant to #MQTT using the MQTT Publish action in HA, then https://github.com/hikhvar/mqtt2prometheus picks it up and services it up to #Prometheus for use in queries. Will need to switch my #Keda scale object from a #RabbitMQ one to a Prometheus query. But will first let this metric sit there for a few days to make sure it behaves as expected.

Habr Aug 29, 2025

Автомасштабируем узлы кластера Kubernetes. Часть 1

Автомасштабирование узлов кластера Kubernetes и горизонтальное масштабирование подов позволяют быстро расширить ресурсы при пиковых нагрузках. Но сложные приложения могут не нагружать поды или узлы максимально, но требовать дополнительных ресурсов, например, для параллельной обработки нескольких объектов в очереди. Триггером масштабирования кластера может быть не утилизация, а события от внешних систем — например, очереди сообщений Kafka, системы мониторинга Prometheus или от платформы CI/CD. Всем привет! Меня зовут Илья Смирнов, я архитектор решений в

https://habr.com/ru/companies/cloud_ru/articles/941976/

#keda #k8s #kubernetes #автомасштабирование #managed_kubernetes

Автомасштабируем узлы кластера Kubernetes. Часть 1

Хабр