description |
---|
This tutorial goes through the steps required to run our Docker monitoring setup to collect and visualize metrics for various Boost processes |
The monitoring stack we will use includes:
- Prometheus - collects metrics and powers dashboards in Grafana
- Tempo - collects traces and powers traces search in Grafana with Jaeger
- Grafana - provides visualization tools and dashboards for all metrics and traces
Lotus and Boost are already instrumented to produce traces and stats for Prometheus to collect.
The Boost team also packages a set of Grafana dashboards that are automatically provisioned as part of this setup.
This setup has been tested on macOS and on Linux. We haven’t tested it on Windows, so YMMV.
All the monitoring stack containers run in Docker.
We have tested this setup with Docker 20.10.23 on macOS and Ubuntu.
https://docs.docker.com/engine/install/
Update extra_hosts
in docker-compose.yaml
for prometheus
, so that the Prometheus container can reach all its targets - boostd
, lotus-miner
, booster-bitswap
, booster-http
, etc.
https://github.com/filecoin-project/boost/blob/main/docker/monitoring/docker-compose.yaml#L47-L55
Depending on where your Filecoin processes (boostd
, lotus
, lotus-miner
, booster-bitswap
, etc.) are running, you need to confirm that they are reachable from Prometheus so that it can scrape their metrics.
By default the setup expects to find them within the same Docker network, so if you are running them elsewhere (i.e. on the `host` network), add the following arguments:
Confirm that Prometheus targets are scraped at http://localhost:9090 / Targets
If you are running software firewall like `ufw`, you might need to modify your iptables and allow access from the Prometheus container / network to the Filecoin stack network, for example:
sudo docker network inspect monitoring
# note the Subnet for the network
sudo ufw allow from 172.18.0.0/16
Go to Grafana at http://localhost:3333 and inspect the dashboards: