Görz, Friedrich
2bcffed2d7
DEV-650: added config stuff to drop docker.container.label to avoid crashing...
3 years ago
sven.ketelsen
ad6f470920
Revert "DEV-647 added hetzner domain smardigo.dev"
...
This reverts commit 0b7b2a0f01 .
3 years ago
Ketelsen, Sven
0b7b2a0f01
DEV-647 added hetzner domain smardigo.dev
3 years ago
Görz, Friedrich
a9c0e86f36
Revert "DEV-647 added hetzner domain smardigo.dev"
3 years ago
Ketelsen, Sven
7cdc602534
DEV-647 added hetzner domain smardigo.dev
3 years ago
friedrich goerz
bf72c7fbc7
DEV-635: removed creating index per job/pod
3 years ago
Michael Hähnel
87a286dd60
DEV-624 New alert for failed db backups
3 years ago
Ketelsen, Sven
f754404845
DEV-629 added logging buckets for k8s [job|pod][name]
3 years ago
Michael Hähnel
9b63b2e5a8
DEV-601 added extra configuration for bdev mpmexec demo server
3 years ago
Michael Hähnel
b9e48a3260
DEV-601 added playbook for bdev demo setup
3 years ago
Ketelsen, Sven
db57bcb7ca
DEV-579 add basic auth to prometheus stack
3 years ago
Görz, Friedrich
24e5cbf3d9
DEV-616: increased vol_count to mitigate disk size problem
3 years ago
Hoan To
98c5f39c85
DEV-579: added prometheus basic auth
3 years ago
Ketelsen, Sven
f47c5dc345
DEV-578 investigation for hetzner api rate limits
3 years ago
Ketelsen, Sven
9919985e3d
DEV-593 updated versions
3 years ago
Ketelsen, Sven
ac7285bbcf
DEV-572: alertmanager metrics
3 years ago
friedrich goerz
659943ccc5
DEV-563: bugfixed hetzner rate limit alert
3 years ago
Ketelsen, Sven
35dbd3cad1
DEV-569: extended stage overview dashboard
3 years ago
friedrich goerz
9e6f28c62a
DEV-563: added hetzner dashboard + svennes dashboard + refactoring alert for hetzner_api_rate_limit
3 years ago
Görz, Friedrich
01c972771b
Rollout main=>qa 13.09.2022
3 years ago
friedrich goerz
5367c9929e
DEV-539: increased timerange; bugfixed broken silencing for patchday
3 years ago
Görz, Friedrich
ffb3aa2122
DEV-543: integrated DO-blackbox VM into DEV-patchday + increased threshold for...
3 years ago
Hoan To
a0ff9a5d8e
added elasticsearch health check rule
3 years ago
friedrich goerz
1558548682
DEV-517: added alerting for DO API usage
3 years ago
Sven Ketelsen
2cf1d8b9dc
bugfix: service creation with portal is broken
...
- Filebeat autodiscover condition isn't working for all
hosts. Switched condition to docker_enabled flag. If a
container has no default log file (harbor) there isn't
a problem because there will just no log file found.
The autodiscover docker container log files mustn't
deactivated in this cases at all.
4 years ago
Sven Ketelsen
72ff5db355
DEV-416: review collect postgres logs to elk-stack
4 years ago
Sven Ketelsen
1048f5845d
bugfix: removed daily roll over for log indices
4 years ago
Sven Ketelsen
8156a45ec2
feat: updated elastic certs for qa/prod stages
...
- create new certificates (--days 1095)
- rollout with playbook smardigo.yml + -t update_certs
all elasticsearch
all kibana
all logstash
- rollout with playbook setup.yml + -t update_certs
all filebeat
- manually updates connect certs
use smardigo.yml + -t update_certs - with connect role
4 years ago
Sven Ketelsen
1fd63f3676
feat: updated elastic certs on dev stage
...
- create new certificates (--days 1095)
- rollout with playbook smardigo.yml + -t update_certs
all elasticsearch
all kibana
all logstash
- rollout with playbook setup.yml + -t update_certs
all filebeat
- manually updates connect certs
use smardigo.yml + -t update_certs - with connect role
4 years ago
Görz, Friedrich
84a013d169
MOB-148: added k8s cluster for mobene stuff
4 years ago
Görz, Friedrich
0f69260711
DEV-416: added stuff to enable filebeat for postgres + mariabb instances
4 years ago
friedrich goerz
43fbb20fb8
DEV-484: changed index naming pattern from monthly to daily
4 years ago
Görz, Friedrich
1c5b1c44dd
DEV-391: fix merge problems + fixing linter problems
4 years ago
Sven Ketelsen
26dad106ba
review: logstash index pattern
...
- added block for [kubernetes][statefulset][name]
4 years ago
Sven Ketelsen
2f0c919f2e
review: logstash index pattern
...
- added block for [kubernetes][daemonset][name]
4 years ago
Sven Ketelsen
9c052aabc7
review: logstash index pattern
...
- added uncategorized block for kubernetes
no [kubernetes][deployment][name] available
- added uncategorized block for beats
no [container][name] available
4 years ago
Görz, Friedrich
98c9f70e8a
DEV-338: added logstash config to deliver k8s-dockerlogs into specific indices
4 years ago
Görz, Friedrich
6c6dd5c1ae
DEV-442: added threshold for pg_repl_lag to avoid false positives on DEV-stage
4 years ago
Michael Hähnel
ff9c0d94a1
Extended Monitoring/Alerting for PostgreSQL
4 years ago
Sven Ketelsen
7a9bd9411e
bugfix: logstash mutate - remove_field
...
- [host][ip]
- [host][mac]
4 years ago
friedrich goerz
8c8722851f
DEV-386: added alert to get notification in case of ssh root login
4 years ago
Görz, Friedrich
f0eab6d3ae
DEv-421: refactored installation for postgres-exporter + installed newer...
4 years ago
Görz, Friedrich
a2fa12ef40
DEV-396: changed diskspace alert from predictive to alert of current usage
4 years ago
friedrich goerz
a834b13ded
DEV-378: increased allowed pending time for some alerts
4 years ago
Görz, Friedrich
ea2ef949c9
DEV-360: rollout k8s on prodnso
4 years ago
Sven Ketelsen
7c891e472c
feat: activated jaeger traecing on dev
...
- traefik
- connect
- iam
4 years ago
Ketelsen, Sven
65df2886e3
DEV-359: feat: added jaeger-operator/jaeger
4 years ago
friedrich goerz
46e021d22c
DEV-327: added several stuff for new prodnso-stage + bugfixing and improving other stuff
4 years ago
Sven Ketelsen
cdd9c2543a
cleanup: removed vault for group/all > moved to stage groups
...
- every stage has now its own vault file
4 years ago
Sven Ketelsen
190b8394eb
feat: added metricbeat (inactive)
4 years ago