Many people think of "Distributed Tracing" as being valuable only when MANY apps/services in a system participate in it.
But even if you enable it for just a SINGLE service in your system, it can save you hours...
Here is WHY (based on a real issue):
We have one service that uses an open-source SDK to achieve a part of its functionality.
One day, we saw an issue where the calls made to this SDK are taking several seconds to complete.
Our service itself is instrumented using #OpenTelemetry Tracing APIs. But how do we diagnose this problem happening in the library it uses?
(Thread 1/3)