Whatโ€™s going on in your OpenSearch cluster right now? ๐Ÿ‘€

Live queries in OpenSearch 3.0 let you monitor searches in real time, troubleshoot faster, and stay ahead of performance issues.

Learn more โ†’ https://opensearch.org/blog/real-time-query-monitoring-with-live-queries-in-opensearch-3-0/

#OpenSearch #RealTime #Observability

Featured today only on the @DZoneInc main page, ๐ŸŒถ๏ธ Mastering Fluent Bit: Controlling Logs With @fluentbit on Kubernetes (Part 4)! @chronosphereio #cloudnative #o11y #observability https://dzone.com/articles/controlling-logs-with-fluent-bit-kubernetes

๐Ÿฆซ ๐Ÿ Overhead for distributed tracing can be costly, but #eBPF can help reduce resource strain on CPUs, memory, and network traffic: https://t.ly/QBRYt

#Golang #Go #observability #kubernetes #tech #IT #Linux #Datadog #SigNoz #Valkey #Redis #opentelemetry #OTEL

If you've tried to get started with @grafana Loki, there's a little thing you'll have to contend with first... LogQL. Here's what LogQL is, how to ease into using it, and how to write your first queries in it. https://youtu.be/57dQwcmqkpQ
#observability #grafana #loki
How to query Grafana Loki with LogQL

YouTube

#AmazonEKSDashboard is now Generally Available!

Get unified visibility across your #Kubernetes clusters deployed in multiple AWS regions and accounts.

Learn more: https://bit.ly/45nExJU

#InfoQ #AWS #EKS #Observability

Log management in large Kubernetes environments poses challenges with thousands of pods generating vast data volumes, increasing CPU, RAM, and disk costs. Aliaksandr will discuss strategies using #Elasticsearch, #Grafana #loki and #VictoriaLogs to cut infrastructure costs by 10x or more.
This talk is essential for anyone building or running large-scale #Kubernetes clusters.

https://hubs.li/Q03rRvjC0

#Observability #DevOps #OpenSource #Logging

How to Efficiently Manage Logs in Large-Scale Kubernetes Clusters DevConf.CZ 2025

Large Kubernetes clusters can generate significant volumes of logs, especially when housing thousands of running pods. This may demand substantial CPU, RAM, disk IO, and disk space for storing and querying large log volumes. In this talk, we will look into different strategies of storing those logs in ElasticSearch, Grafana Loki and VictoriaLogs and examine how we can save 10x or more on infrastructure costs. This talk presents real-world examples of efficient log management in large-scale Kubernetes clusters. It includes setup demonstrations, numerical data, and graphical representations from reproducible benchmarks.

More from #Datadog #DASH2025 - As #AIagents mature, new #observability and #AIsecurity tools aim to bolster their reliability and security with fresh visibility into automation workflows and more detailed troubleshooting. https://www.techtarget.com/searchitoperations/news/366625992/Datadog-AI-agent-observability-security-seek-to-boost-trust
Datadog AI agent observability, security seek to boost trust

As AI agents mature, new tools aim to bolster their reliability and security with fresh visibility into automation workflows and more detailed troubleshooting.

TechTarget
2025-06-12 Freelance Inventor's Notebook
Structured a new decision-making flow (Do / Study / Defer) for investigating lab anomalies, and used it to investigate Prometheus load spikes on Pi nodes. Created a new experiment Kanban template in Obsidian. Progress feels real. Logs (Loki?) maybe next. #Freelance #Inventor #Laboratory #Observability #RaspberryPi

My obsession with #MCP servers continues. And like many of my obsessions, I love finding ways in which they intersect with other obsessions (hello, #Observability). I decided to leverage the #Dynatrace MCP server to use natural language to query #OpenTelemetry data in Dynatrace. End result? Well, youโ€™ll need to read the blog post!

https://dt-url.net/dt-mcp-medium

Querying OpenTelemetry Data with the Dynatrace MCP Server

Learn how to configure the Dynatrace MCP server to use natural language to query OpenTelemetry data in Dynatrace.

Medium

Speaking of #Datadog, I recently sat down with the company's new chief scientist, Ameet Talwalkar, to discuss its latest venture into #AIresearch. Talwalkar joined the company in February to lead its new #AI research lab, and its first project, Toto, a time-series foundational model for #observability, was just unveiled last month.

Talwalkar highlighted how specialized foundational models like Toto could enhance AI responses for #o11y tools such as Watchdog and potentially lead to cost reductions compared with general-purpose frontier models.

Our dialogue also delved into future-oriented topics, including #agenticAI, observability for AI, AI for observability, the evolution of human-computer interaction, and the persistent challenge of AI explainability.

Check out our discussion on the #ITOps Query podcast: https://www.podbean.com/ew/pb-hqa2e-18d36d7 #DASH2025

Datadog AI researcher teaches time-series models new tricks

Ameet Talwalkar joined observability vendor Datadog in February to lead a new AI research lab at the company. He is also an Associate Professor at Carnegie Mellon University. One of the first projects from the lab was released to open source in May โ€“ Toto, a foundation model trained on real-world observability data. Talwalkar discusses how Toto could improve observability forecasting tools such as Datadog's Watchdog, and does some forecasting of his own about how AI agents will alter observability and human-computer interactions.