Observability in the CLOUD

We can imagine this scenario where we need to monitor each customers with one Stack.

The four Golden Signals :

Latency
Traffic
Errors
Saturation

why collecting this signals ?

Alerting: notify when something is wrong Troubleshooting: help us to isolate and fix the problem Tuning/Capacity Planning: to assist us in improving our setup over time

Why monitoring ?

Wich tools ?

In our case we will use those components on a docker compute:

Thanos for the retention (and a Azure Storage account)
Prometheus to orchestrate the supervision
AlertManager for alerting
Grafana for display metrics and logs
Loki for parse our LOGs
Telegraf for self monitoring
Nginx to securise and expose our stack

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
data/telegraf		data/telegraf
README.md		README.md
docker-compose.yml		docker-compose.yml
telegraf.conf		telegraf.conf
test_alert_manager.sh		test_alert_manager.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Observability in the CLOUD

We can imagine this scenario where we need to monitor each customers with one Stack.

The four Golden Signals :

why collecting this signals ?

Why monitoring ?

Wich tools ?

Our principal monitoring stack is made with this :

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Observability in the CLOUD

We can imagine this scenario where we need to monitor each customers with one Stack.

The four Golden Signals :

why collecting this signals ?

Why monitoring ?

Wich tools ?

Our principal monitoring stack is made with this :

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages