What is Telemetry in Kyma?

Fundamentally, "Observability" is a measure of how well the application's external outputs can reflect the internal states of single components. The insights that an application and the surrounding infrastructure expose are displayed in the form of metrics, traces, and logs - collectively, that's called "telemetry" or "signals". These can be exposed by employing modern instrumentation.

Stages of Observability

  1. In order to implement Day-2 operations for a distributed application running in a container runtime, the single components of an application must expose these signals by employing modern instrumentation.
  2. Furthermore, the signals must be collected and enriched with the infrastructural metadata in order to ship them to a target system.
  3. Instead of providing a one-size-for-all backend solution, Kyma supports you with instrumenting and shipping your telemetry data in a vendor-neutral way.
  4. This way, you can conveniently enable observability for your application by integrating it into your existing or desired backends. Pick your favorite among many observability backends, available either as a service or as a self-manageable solution, that focus on different aspects and scenarios.

Kyma's Telemetry module focuses exactly on the aspects of instrumentation, collection, and shipment that happen in the runtime and explicitly defocuses on backends.

TIP: An enterprise-grade setup demands a central solution outside the cluster, so we recommend in-cluster solutions only for testing purposes. If you want to install lightweight in-cluster backends for demo or development purposes, check the Telemetry tutorials.

Features

To support telemetry for your applications, Kyma's Telemetry module provides the following features:

  • Guidance for the instrumentation: Based on Open Telemetry, you get community samples on how to instrument your code using the Open Telemetry SDKs in nearly every programming language.
  • Tooling for collection, filtering, and shipment: Based on the Open Telemetry Collector, you can configure basic pipelines to filter and ship telemetry data.
  • Integration in a vendor-neutral way to a vendor-specific observability system: Based on the OpenTelemetry protocol (OTLP), you can integrate backend systems.
  • Opt-out from features for advanced scenarios: At any time, you can opt out for each data type, and use custom tooling to collect and ship the telemetry data.
  • SAP BTP as first-class integration: Integration into BTP Observability services is prioritized.
  • Enterprise-grade qualities - the setup is battle-tested and will satisfy typical development standards.

Initially,

  • metrics will not be supported; follow the related epic for tracking the progress of the minimal first version
  • logs will not be based on the vendor-neutral OTLP protocol; follow this epic to understand the current progress on that
  • the focus on filtering capabilities is to reduce the overall shipment bandwidth to an acceptable minimum, because telemetry data is high-volume data that is always related to a price. So filtering features focus on reducing signal data to, for example, relevant Namespaces or workloads. By default, system-related telemetry data will not be shipped. For custom scenarios, there is always the opt-out possibility.

Scope

The Telemetry module focuses only on the signals of application logs, distributed traces, and metrics. Other kinds of signals are not considered. Also, logs like audit logs or operational logs are not in scope.

Supported integration scenarios are neutral to the vendor of the target system.

Components

Components

Telemetry Manager

Kyma's Telemetry module ships Telemetry Manager as its core component. Telemetry Manager is a Kubernetes operator that implements the Kubernetes controller pattern and manages the whole lifecycle of all other components covered in the Telemetry module. Telemetry Manager watches for the user-created Kubernetes resources: LogPipeline, TracePipeline, and, in the future, MetricPipeline. In these resources, you specify what data of a signal type to collect and where to ship it. If Telemetry Manager detects a configuration, it rolls out the relevant components on demand.

For more information, see Telemetry Manager.

Log Collector

The log agent is based on a Fluent Bit installation running as a DaemonSet. It reads all containers' logs in the runtime and ships them according to a LogPipeline configuration.

For more information, see Logs.

Trace Collector

The Trace Collector is based on an OTel Collector Deployment. It provides an OTLP-based endpoint to which applications can push the trace signals. According to a TracePipeline configuration, the gateway processes and ships the trace data to a target system.

For more information, see Traces.