232 Commits

Author SHA1 Message Date
Milan Pavlik
0d249cb410 [grafana] Fix Connect / server error requests labels 2022-11-02 23:28:07 +01:00
Anton Kosyakov
1b60280a18 [openvsx-mirror] stable dashboard variables 2022-11-02 15:06:07 +01:00
ArthurSens
6ee2fc5ef9 Add links to explore cardinality metrics
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2022-11-02 11:44:07 +01:00
Milan Pavlik
4a076cd618 [public-api] Add alerts for no metrics and failing RPCs 2022-11-02 08:43:06 +01:00
Thomas Schubart
f31bbd2ca9 [obs] Rename psi dashboard to node psi 2022-11-01 03:40:06 +01:00
Thomas Schubart
fb62393f1f [obs] Create workspace psi dashboard 2022-11-01 03:40:06 +01:00
ArthurSens
2b82c4be43 Extend cardinality dashboard with labels information
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2022-10-31 15:20:05 +01:00
Milan Pavlik
9bc39df509 [server] Remove gitpod_server_api_calls_user_total metric 2022-10-31 14:54:06 +01:00
ArthurSens
ebc01f087d Fix missing datasource variable
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2022-10-28 13:35:02 +02:00
ArthurSens
fad5a55f69 Create initial dashboard for cardinality management
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2022-10-28 13:32:02 +02:00
mustard
fef272da21 [observability] update openvsx mirror dashboard 2022-10-28 12:57:02 +02:00
Aleksandar Aleksandrov
bfe9171bcb Remove Kube*Overcommit alerts 2022-10-28 12:39:02 +02:00
ArthurSens
8b0f245bc7 Fix timeseries_total recording rule
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2022-10-27 21:32:42 +02:00
ArthurSens
6e6c78e9e4 Add alert for promethus crashlooping
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2022-10-27 20:37:42 +02:00
ArthurSens
a86543f3a2 Move observability-stack alerts to delivery-and-operations folder
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2022-10-27 20:37:42 +02:00
ArthurSens
2aee2d9ae5 Add dashboard URL button to argocd alerts
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2022-10-27 20:15:42 +02:00
Huiwen
1956748de5 [observability] add dashboard for openvsx mirror
Co-authored-by: akosyakov <anton@gitpod.io>
2022-10-27 17:20:41 +02:00
ArthurSens
2e6067f5a5 Fix ServerEventLoopLagTooHigh alert
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2022-10-27 15:43:41 +02:00
ArthurSens
652da9d2f8 Add intial prometheus rules for cardinality analysis
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2022-10-27 14:43:41 +02:00
ArthurSens
06a8e0e8f7 Add 'team' as a filter to ArgoCD dashboard
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2022-10-26 17:01:41 +02:00
Milan Pavlik
310caaf9be [dashboard] Add dashboard for Connect API calls 2022-10-26 16:45:41 +02:00
Milan Pavlik
72771c036f [webapp] Remove namespace from monitoring rules 2022-10-26 14:32:41 +02:00
Milan Pavlik
81c4685ccc [alerts] Group by cluster, where relevant, ahead of centralizing rule evaluation 2022-10-26 14:32:41 +02:00
Liam Bennett
d373c49124 Add QOL to the observability dashboard 2022-10-26 13:17:40 +02:00
ArthurSens
dd97e9e019 Add minimal dashboard for the observability stack
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2022-10-26 10:32:40 +02:00
Manuel Alejandro de Brito Fontes
52848f6e18 [registry-facade] Add new blobSopurce dashboard 2022-10-25 23:35:40 +02:00
ArthurSens
cebf1e03aa Add alerts for ArgoCD Apps' state
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2022-10-24 18:59:39 +02:00
Arthur Silva Sens
0f33e51450 Update dashboards.libsonnet 2022-10-19 15:28:34 +02:00
ArthurSens
68b2fe68ce Add dashboard for ArgoCD applications' state
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2022-10-18 13:54:33 +02:00
ArthurSens
721dd00364 Add Namespace Selector for ArgoCD ServiceMonitors
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2022-10-17 12:42:32 +02:00
Filip Troníček
499bd02585 Add a Prometheus alert rule for SSH Gateway 2022-10-14 18:44:29 +02:00
JenTing Hsiao
b9c841f2f5 obs: fix new workspace does not shown workspace success rate
Signed-off-by: JenTing Hsiao <hsiaoairplane@gmail.com>
2022-10-13 22:34:29 +02:00
ArthurSens
f98e1b16ae Adjust IDE alerts to work with centralized alerting
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2022-10-13 14:27:28 +02:00
Liam Bennett
6fb1ac06ea add servicemonitors for argocd 2022-10-11 14:25:26 +02:00
Thomas Schubart
f668ab404f [obs] Remove NetworkConnectionsTooHigh alert 2022-10-11 04:08:26 +02:00
Laurie T. Malau
52a5bcd678 [grafana] Add panel for instances_marked_stopped 2022-10-10 11:18:26 +02:00
Milan Pavlik
7d7cec2bd2 [usage] Simplify dashboard by showing gauge and history in the same graph 2022-10-04 11:22:20 +02:00
Milan Pavlik
18f25b9926 [usage] Fix alerting for GitpodUsageTooLongSinceLastSuccessfulLedgerReconciliation 2022-10-04 09:28:20 +02:00
Thomas Schubart
f4a71fa6cc [obs] Dashboard for psi metrics 2022-10-04 00:34:19 +02:00
ArthurSens
7805fd67cb Migrate dashboards from jsonnet to json
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2022-09-29 20:33:30 +02:00
Milan Pavlik
d43a933f9b [usage] Add Stripe usage records udpated to Usage component dashboard 2022-09-29 15:12:30 +02:00
Thomas Schubart
e99d002c5a Revert "fix network connections alert to fire only for workspace pods"
This reverts commit 83d4edba28efbe99a0c00d1d26e747b3824ee3c7.
2022-09-29 11:34:29 +02:00
ArthurSens
02faf852fd Improve notification for WebAppServicesHighCPUUsage
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2022-09-29 07:59:29 +02:00
Pavel Tumik @ GitPod
83d4edba28 fix network connections alert to fire only for workspace pods 2022-09-29 03:49:29 +02:00
ArthurSens
8b618c8e84 Add dashboard URL to GitpodOpenVSXRegistryDown alert
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2022-09-28 20:00:29 +02:00
Anton Kosyakov
88783077f0 Use user centric metrics to trigger an alert. 2022-09-28 15:02:28 +02:00
Milan Pavlik
d9dc8f0623 [grafana] Fix grpc / client dashboard 2022-09-26 14:15:26 +02:00
Milan Pavlik
95b5fb128c [usage] Fix alerting rules 2022-09-26 09:11:26 +02:00
Milan Pavlik
a9c85bd4b3 [grafana] Add gRPC Client dashboard with metrics 2022-09-25 01:12:25 +02:00
Kyle Brennan
f2cf455e4c Link existing KubeNodeNotReady runbook to alert 2022-09-24 02:25:24 +02:00