SESAR Lab - Università degli Studi di Milano
20/05/2025
Link to the slides
Agenda
Timeseries-based measurements of system aspects
Numerical: e.g. CPU and RAM usage, available storage
Categorical: e.g. system on/off, pipeline execution stages
Metrics provide objective and structured measurements of a system’s behavior
Common formats: Prometheus metrics, JSON
Text records describing services’ operation
Organized in level of detail:
ERROR, WARNING, INFO, DEBUG, TRACE
Structured logs also include contextual information
e.g., variables values, timestamps, source code
Common formats: Logfmt, JSON, GELF
Traces record the stack of execution
Spans record contexts of execution
Spans can be nested for instrumenting sections of code functions Common formats: OpenTelemetry, Jaeger, Zipkin
Example: logging and spans
Example: tracing
All monitoring events can have associated metadata
For scalability we store monitoring events in append-only logs
Event type | Service |
---|---|
Metrics | Mimir |
Logs | Loki |
Traces | Tempo |
Events are:
Grafana Mimir’s architecture
Link to the slides