Commit Graph

63 Commits (d72b6a3fda9c2d76abff18041fd5278b42af8ee3)

Author SHA1 Message Date
Sven Ketelsen 42d8398349 DEV-664 bugfix use server specific domain 3 years ago
Görz, Friedrich fe97fbbab5 Bug/dev 659 pgdatadir nospace 3 years ago
friedrich goerz e23813f9d1 NOTICKET: but metrics missing since Nov2021 - needs to be fixed ;) 3 years ago
Michael Hähnel 87a286dd60 DEV-624 New alert for failed db backups 3 years ago
Ketelsen, Sven db57bcb7ca DEV-579 add basic auth to prometheus stack 3 years ago
Görz, Friedrich 24e5cbf3d9 DEV-616: increased vol_count to mitigate disk size problem 3 years ago
Hoan To 98c5f39c85 DEV-579: added prometheus basic auth 3 years ago
Ketelsen, Sven f47c5dc345 DEV-578 investigation for hetzner api rate limits 3 years ago
Ketelsen, Sven ac7285bbcf DEV-572: alertmanager metrics 3 years ago
friedrich goerz 659943ccc5 DEV-563: bugfixed hetzner rate limit alert 3 years ago
Ketelsen, Sven 35dbd3cad1 DEV-569: extended stage overview dashboard 3 years ago
friedrich goerz 9e6f28c62a DEV-563: added hetzner dashboard + svennes dashboard + refactoring alert for hetzner_api_rate_limit 3 years ago
Görz, Friedrich 01c972771b Rollout main=>qa 13.09.2022 3 years ago
friedrich goerz 5367c9929e DEV-539: increased timerange; bugfixed broken silencing for patchday 3 years ago
Görz, Friedrich ffb3aa2122 DEV-543: integrated DO-blackbox VM into DEV-patchday + increased threshold for... 3 years ago
Hoan To a0ff9a5d8e added elasticsearch health check rule 3 years ago
friedrich goerz 1558548682 DEV-517: added alerting for DO API usage 3 years ago
Görz, Friedrich 1c5b1c44dd DEV-391: fix merge problems + fixing linter problems 4 years ago
Görz, Friedrich 6c6dd5c1ae DEV-442: added threshold for pg_repl_lag to avoid false positives on DEV-stage 4 years ago
Michael Hähnel ff9c0d94a1 Extended Monitoring/Alerting for PostgreSQL 4 years ago
friedrich goerz 8c8722851f DEV-386: added alert to get notification in case of ssh root login 4 years ago
Görz, Friedrich f0eab6d3ae DEv-421: refactored installation for postgres-exporter + installed newer... 4 years ago
Görz, Friedrich a2fa12ef40 DEV-396: changed diskspace alert from predictive to alert of current usage 4 years ago
friedrich goerz a834b13ded DEV-378: increased allowed pending time for some alerts 4 years ago
Görz, Friedrich ea2ef949c9 DEV-360: rollout k8s on prodnso 4 years ago
friedrich goerz 46e021d22c DEV-327: added several stuff for new prodnso-stage + bugfixing and improving other stuff 4 years ago
Sven Ketelsen d314e164c7 bugfix: disabled blackbox exporter for connect management
- current config didn't works with 302 to login page
4 years ago
Sven Ketelsen df0e320743 bugfix: fixed connect url for blackbox exporter 4 years ago
Sven Ketelsen 43a4dccc3f chore: removed unnecessary ip lookup 4 years ago
Görz, Friedrich 9f9a192432 DEV-269: added stuff to federate k8s-internal prometheus metrics 4 years ago
Görz, Friedrich 5bdff07d1b DEV-253: digitalocean stuff - add droplet but not idempotentgit branch git branch plz check 4 years ago
Sven Ketelsen d780336dad bugfix: wrong port for postgres exporter
- monitor_port_system > monitor_port_postgres
4 years ago
friedrich goerz 3766911cc5 DEV-241: added monitoring stuff for redis 4 years ago
Sven Ketelsen bd13643e30 feat: prometheus now uses stage_server_infos (auto discover task) 4 years ago
Sven Ketelsen da646bf4bd chore: removed duplications between iam/gitea
- deploying is now done by shared role
- only configuration needed by iam/gitea role
4 years ago
Sven Ketelsen 8e88f4bf3d feat: added monitoring for gitea 4 years ago
friedrich goerz ac3136a441 DEV-228: changed promql-query to reduce noise in alerting MhimBHsteams-channels 4 years ago
Sven Ketelsen bb62199bcd bugfix: set repeat_interval for alerts to 6h 4 years ago
Sven Ketelsen 335e3bb9dd chore: cors for swagger on connect/iam 4 years ago
Gordon, Alexander c0cd50339c DEV-163: feat: keycloak prometheus integration 4 years ago
Sven Ketelsen eec580d2dc chore: increased pg_stat_database_numbackends from 21 to 30 4 years ago
Sven Ketelsen 9e3af9b1a8 chore: increased default max connection for pg from 20 to 21 4 years ago
Peter Heise c86ccc48aa Added postgres exporter + dashboard. 4 years ago
Peter Heise 7c0f9c597b Added mysql/maria-exporter + dashboard. 4 years ago
Sven Ketelsen ad861db16e SMARCH-92: split elastic stack services for qa
- elasticsearch
- logstash
- kibana
4 years ago
Peter Heise 1bfcac5646 Removed container node-exporter, added system node-exporter, optimized aotidiscover pre-tasks. 4 years ago
Sven Ketelsen b6cdd8528b bugfix: prometheus scrape config
- skip traefik scraping when traefik_enabled is false
- skip node_exporter scraping when node_exporter_enabled is false
4 years ago
Sven Ketelsen a8b60e9069 chore: teams alerting hook can now be stage specific
- added var netgo_msteams_hook_alerting (DEV)
4 years ago
Alexander Gordon a966d90020 Added MSTeams Alerts for Prometheus 4 years ago
Sven Ketelsen d7704681ee bugifx: awx polling configuration produces wrong instance
- <url>:80 -> <url>
4 years ago