Address prometheus scraping jobs that are failing

I see that we have 3 failing scraping jobs (at time of writing). [Query](https://prometheus.nixos.org/graph?g0.expr=up%20%3D%3D%200&g0.tab=1&g0.display_mode=lines&g0.show_exemplars=0&g0.range_input=1h):

- `up{instance="r13y.com:443", job="r13y"}`
- `up{instance="127.0.0.1:9190", job="rfc39"}`
- `up{instance="hydra.nixos.org:9199", job="hydra_notify"}`

# r13y

[Last successful scrape](https://prometheus.nixos.org/graph?g0.expr=up%7Bjob%3D%22r13y%22%7D%20%3D%3D%201&g0.tab=0&g0.display_mode=lines&g0.show_exemplars=0&g0.range_input=1y): 2024-09-21

Added in https://github.com/nixos/infra/commit/3c4f476eb63f4bf69ed0149e8852956c34a5f683.

Looks like it's a reproducibility checker created by @grahamc: https://github.com/grahamc/r13y.com.

Perhaps this can just be removed?

# rfc39

This is trickier. Seems to be periodically up ([link](https://prometheus.nixos.org/graph?g0.expr=up%7Bjob%3D%22rfc39%22%7D%20%3D%3D%201&g0.tab=0&g0.display_mode=lines&g0.show_exemplars=0&g0.range_input=8w)):Query

![Image](https://github.com/user-attachments/assets/bffa8d67-dcd6-4b15-a2c2-b966efb819e5)

Apparently this is a known "issue", see comment here: https://github.com/nixos/infra/blob/af0ed6d10dbb3a3ec919321314506b180d1f5faf/build/pluto/prometheus/exporters/rfc39.nix#L12.

AFAICT, we don't have any alerting rules configured that react to this (just [this systemd unit state](https://github.com/nixos/infra/blob/af0ed6d10dbb3a3ec919321314506b180d1f5faf/build/pluto/prometheus/exporters/rfc39.nix#L29)). Perhaps we could just stop scraping this? Is there useful historical data in here?

# hydra_notify

[Last successful scrape](https://prometheus.nixos.org/graph?g0.expr=up%7Bjob%3D%22hydra_notify%22%7D%20%3D%3D%201&g0.tab=0&g0.display_mode=lines&g0.show_exemplars=0&g0.range_input=1y): 2024-08-02

Added in <https://github.com/nixos/infra/commit/bf95096ae0bf081f4e1fe079b76470465051174f>, also see <https://github.com/nixos/infra/commit/88abf452e6cadcc0a6f88745e33bb806f67abb6e>.

Looks like @mweinelt disabled hydra-notify here: <https://github.com/nixos/infra/commit/66da5cfddb8c67b5e2f0b5bba62899f1f82eec42>, which lines up with the last successful scrape.

Seems like we should just disable this scrape job as well.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Address prometheus scraping jobs that are failing #551

r13y

rfc39

hydra_notify

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development