Prometheus

Prometheus

Prometheus is a widely popular tool for monitoring and alerting a wide variety of systems. A distributed cluster offers a number of Prometheus metrics if the prometheus_client package is installed. The metrics are exposed in Prometheus’ text-based format at the /metrics endpoint on both schedulers and workers.

Available metrics

Apart from the metrics exposed per default by the prometheus_client, schedulers and workers expose a number of Dask-specific metrics. See the dask.distributed documentation for details.