Sven Ketelsen
|
42d8398349
|
DEV-664 bugfix use server specific domain
|
3 years ago |
Görz, Friedrich
|
fe97fbbab5
|
Bug/dev 659 pgdatadir nospace
|
3 years ago |
friedrich goerz
|
e23813f9d1
|
NOTICKET: but metrics missing since Nov2021 - needs to be fixed ;)
|
3 years ago |
Michael Hähnel
|
87a286dd60
|
DEV-624 New alert for failed db backups
|
3 years ago |
Ketelsen, Sven
|
db57bcb7ca
|
DEV-579 add basic auth to prometheus stack
|
3 years ago |
Görz, Friedrich
|
24e5cbf3d9
|
DEV-616: increased vol_count to mitigate disk size problem
|
3 years ago |
Hoan To
|
98c5f39c85
|
DEV-579: added prometheus basic auth
|
3 years ago |
Ketelsen, Sven
|
f47c5dc345
|
DEV-578 investigation for hetzner api rate limits
|
3 years ago |
Ketelsen, Sven
|
ac7285bbcf
|
DEV-572: alertmanager metrics
|
3 years ago |
friedrich goerz
|
659943ccc5
|
DEV-563: bugfixed hetzner rate limit alert
|
3 years ago |
Ketelsen, Sven
|
35dbd3cad1
|
DEV-569: extended stage overview dashboard
|
3 years ago |
friedrich goerz
|
9e6f28c62a
|
DEV-563: added hetzner dashboard + svennes dashboard + refactoring alert for hetzner_api_rate_limit
|
3 years ago |
Görz, Friedrich
|
01c972771b
|
Rollout main=>qa 13.09.2022
|
3 years ago |
friedrich goerz
|
5367c9929e
|
DEV-539: increased timerange; bugfixed broken silencing for patchday
|
3 years ago |
Görz, Friedrich
|
ffb3aa2122
|
DEV-543: integrated DO-blackbox VM into DEV-patchday + increased threshold for...
|
3 years ago |
Hoan To
|
a0ff9a5d8e
|
added elasticsearch health check rule
|
3 years ago |
friedrich goerz
|
1558548682
|
DEV-517: added alerting for DO API usage
|
3 years ago |
Görz, Friedrich
|
1c5b1c44dd
|
DEV-391: fix merge problems + fixing linter problems
|
4 years ago |
Görz, Friedrich
|
6c6dd5c1ae
|
DEV-442: added threshold for pg_repl_lag to avoid false positives on DEV-stage
|
4 years ago |
Michael Hähnel
|
ff9c0d94a1
|
Extended Monitoring/Alerting for PostgreSQL
|
4 years ago |
friedrich goerz
|
8c8722851f
|
DEV-386: added alert to get notification in case of ssh root login
|
4 years ago |
Görz, Friedrich
|
f0eab6d3ae
|
DEv-421: refactored installation for postgres-exporter + installed newer...
|
4 years ago |
Görz, Friedrich
|
a2fa12ef40
|
DEV-396: changed diskspace alert from predictive to alert of current usage
|
4 years ago |
friedrich goerz
|
a834b13ded
|
DEV-378: increased allowed pending time for some alerts
|
4 years ago |
Görz, Friedrich
|
ea2ef949c9
|
DEV-360: rollout k8s on prodnso
|
4 years ago |
friedrich goerz
|
46e021d22c
|
DEV-327: added several stuff for new prodnso-stage + bugfixing and improving other stuff
|
4 years ago |
Sven Ketelsen
|
d314e164c7
|
bugfix: disabled blackbox exporter for connect management
- current config didn't works with 302 to login page
|
4 years ago |
Sven Ketelsen
|
df0e320743
|
bugfix: fixed connect url for blackbox exporter
|
4 years ago |
Sven Ketelsen
|
43a4dccc3f
|
chore: removed unnecessary ip lookup
|
4 years ago |
Görz, Friedrich
|
9f9a192432
|
DEV-269: added stuff to federate k8s-internal prometheus metrics
|
4 years ago |
Görz, Friedrich
|
5bdff07d1b
|
DEV-253: digitalocean stuff - add droplet but not idempotentgit branch git branch plz check
|
4 years ago |
Sven Ketelsen
|
d780336dad
|
bugfix: wrong port for postgres exporter
- monitor_port_system > monitor_port_postgres
|
4 years ago |
friedrich goerz
|
3766911cc5
|
DEV-241: added monitoring stuff for redis
|
4 years ago |
Sven Ketelsen
|
bd13643e30
|
feat: prometheus now uses stage_server_infos (auto discover task)
|
4 years ago |
Sven Ketelsen
|
da646bf4bd
|
chore: removed duplications between iam/gitea
- deploying is now done by shared role
- only configuration needed by iam/gitea role
|
4 years ago |
Sven Ketelsen
|
8e88f4bf3d
|
feat: added monitoring for gitea
|
4 years ago |
friedrich goerz
|
ac3136a441
|
DEV-228: changed promql-query to reduce noise in alerting MhimBHsteams-channels
|
4 years ago |
Sven Ketelsen
|
bb62199bcd
|
bugfix: set repeat_interval for alerts to 6h
|
4 years ago |
Sven Ketelsen
|
335e3bb9dd
|
chore: cors for swagger on connect/iam
|
4 years ago |
Gordon, Alexander
|
c0cd50339c
|
DEV-163: feat: keycloak prometheus integration
|
4 years ago |
Sven Ketelsen
|
eec580d2dc
|
chore: increased pg_stat_database_numbackends from 21 to 30
|
4 years ago |
Sven Ketelsen
|
9e3af9b1a8
|
chore: increased default max connection for pg from 20 to 21
|
4 years ago |
Peter Heise
|
c86ccc48aa
|
Added postgres exporter + dashboard.
|
4 years ago |
Peter Heise
|
7c0f9c597b
|
Added mysql/maria-exporter + dashboard.
|
4 years ago |
Sven Ketelsen
|
ad861db16e
|
SMARCH-92: split elastic stack services for qa
- elasticsearch
- logstash
- kibana
|
4 years ago |
Peter Heise
|
1bfcac5646
|
Removed container node-exporter, added system node-exporter, optimized aotidiscover pre-tasks.
|
4 years ago |
Sven Ketelsen
|
b6cdd8528b
|
bugfix: prometheus scrape config
- skip traefik scraping when traefik_enabled is false
- skip node_exporter scraping when node_exporter_enabled is false
|
4 years ago |
Sven Ketelsen
|
a8b60e9069
|
chore: teams alerting hook can now be stage specific
- added var netgo_msteams_hook_alerting (DEV)
|
4 years ago |
Alexander Gordon
|
a966d90020
|
Added MSTeams Alerts for Prometheus
|
4 years ago |
Sven Ketelsen
|
d7704681ee
|
bugifx: awx polling configuration produces wrong instance
- <url>:80 -> <url>
|
4 years ago |