374 Commits

Author SHA1 Message Date
Prince Rachit Sinha
95592d00d8 Update run book ref for GitpodWorkspaceTooManyRegularNotActive 2022-02-09 20:04:31 +01:00
Prince Rachit Sinha
2a3e4d60f3 Update GitpodWorkspaceTooManyRegularNotActive severity level 2022-02-09 20:04:31 +01:00
Mads Hartmann
dd8b5b728a Remove OWNERS related files
Fixes https://github.com/gitpod-io/ops/issues/844
2022-02-08 09:15:30 +01:00
Pavel Tumik
8c7cb822ed add alert for conntrack table getting full 2022-02-08 04:42:30 +01:00
Pavel Tumik
a33a4a08a8 [observability] add coredns dashboard 2022-02-04 20:36:26 +01:00
ArthurSens
f6575f7f91 observability/mixins: Add Makefile step that generates dashboards
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2022-02-02 11:32:24 +01:00
ArthurSens
ac87de0a4c observability/mixins: Remove dashboard linter
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2022-02-02 11:23:24 +01:00
Laurie T. Malau
9ad62ce3fa Fix server dashboard default time range 2022-01-31 13:38:23 +01:00
Thomas Schubart
b396c23d3a Show difference between agent-smith and ws-manager 2022-01-31 10:28:22 +01:00
Thomas Schubart
c694273d37 Update dashboard 2022-01-31 10:28:22 +01:00
Pavel Tumik
2fb5775ef7 add metric to track failed manifest requests from registry-facade 2022-01-31 10:11:22 +01:00
ArthurSens
d5f92dc8e9 Update monitoring-satellite documentation
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2022-01-28 10:54:20 +01:00
George Tsiolis
31dfc5bd6b Update WebApp team label in component owners 2022-01-26 10:17:18 +01:00
Jan Koehnlein
d30815e685 [owners] rename team meta to webapp 2022-01-26 08:27:17 +01:00
Manuel Alejandro de Brito Fontes
82d786e2bb Remove ws-scheduler 2022-01-24 20:08:17 +01:00
Kyle Brennan
71f543110f Trigger node high load warnings sooner 2022-01-21 22:24:14 +01:00
Thomas Schubart
c62ec6633b Trigger node high load warnings sooner 2022-01-21 17:41:13 +01:00
Thomas Schubart
ae1d476e34 Update workspace success criteria
The threshold line for workspace startup 95% case should be 40 seconds,
not 30 seconds.
2022-01-20 10:27:12 +01:00
Laurie T. Malau
ea76aec273 Add metric and plug in 2022-01-13 15:52:06 +01:00
Manuel Alejandro de Brito Fontes
4935b242b7 Remove workspace deployment 2022-01-01 13:34:55 +01:00
Kyle Brennan
821d463fb9 Helm is needed to support Observability 2021-12-23 22:43:47 +01:00
Kyle Brennan
efe25d96f2 Fixes #7335
Handle "no data" by adding 'on() vector(0)' to each numerator
Relies on new variable $datasource
Also fixes legend for workspace startup panel
When exporting from Grafana, disable "export for sharing externally"
2021-12-23 22:43:47 +01:00
Gero Posmyk-Leinemann
893036754e [ops] Meta: Add alert ServerEventLoopLagTooHigh 2021-12-18 12:06:42 +01:00
Gero Posmyk-Leinemann
05ec1e39a8 [ops] Dashboard: Fix all server dashboard queries 2021-12-18 12:06:42 +01:00
Gero Posmyk-Leinemann
f21bd2fa59 [ops] Dashboard: fix Meta Overview 2021-12-18 12:06:42 +01:00
Prince Rachit Sinha
6285314e0e Add K3s cluster autoscaler dashboard 2021-12-16 07:36:40 +01:00
Gero Posmyk-Leinemann
fc5623e6cf [ops] Small dashboard/graph fixes 2021-12-01 17:30:27 +01:00
Manuel Alejandro de Brito Fontes
93ed40d14a Adjust sucess criteria dashboard 2021-11-30 15:07:26 +01:00
Gero Posmyk-Leinemann
ee703a4c7b [ops] Grafana: Fix datasource 2021-11-30 14:38:26 +01:00
Gero Posmyk-Leinemann
efc95d794a [ops] Grafana: Actually add Services dashboard 2021-11-30 14:38:26 +01:00
Gero Posmyk-Leinemann
43a54051f3 [ops] Add "gitpod-mixin" tag 2021-11-30 14:38:26 +01:00
Laurie T. Malau
0c4eb6317e [monitoring] Update overview dashboard 2021-11-26 11:40:22 +01:00
Gero Posmyk-Leinemann
236c1bb087 [ops] Add dashboard "Meta Services" 2021-11-23 15:41:20 +01:00
Christian Weichel
45e250bb5f [observability] Add ws team success criteria dashboard 2021-11-19 00:35:15 +01:00
Gero Posmyk-Leinemann
57f479ae44 [ops] server dashboard: reload on time range change 2021-11-18 10:50:15 +01:00
Gero Posmyk-Leinemann
e4d2e268bc [server] Add WebsocketClientContext 2021-11-18 10:50:15 +01:00
ArthurSens
30d2ffcb2e Replace table panel with graph
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2021-11-15 08:36:12 +01:00
ArthurSens
1f6195853b Add alert for normalized Load average higher than 10.
The same recording rule is also added to Gitpod / Overview dashboard, replacing the noisy neighborhood panel

Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2021-11-15 08:36:12 +01:00
Prince Rachit Sinha
b0552889e4 [ws-deployment] Add retry support within steps 2021-11-11 15:16:09 +01:00
Cornelius A. Ludmann
a7166daa72 Set version in Go components during build time 2021-11-11 10:23:08 +01:00
Prince Rachit Sinha
64f4da88a6 [ws-deployment] Add gitpod installation step 2021-11-10 09:25:08 +01:00
Manuel Alejandro de Brito Fontes
77321f8269 Update go modules 2021-11-05 10:33:03 +01:00
Prince Rachit Sinha
b4dddf84fe [ws-deployment] Introduce context wrapper and overrides 2021-11-02 16:30:10 +01:00
Manuel Alejandro de Brito Fontes
18ca7bc294 Update go modules 2021-11-02 15:50:10 +01:00
Prince Rachit Sinha
76a4428e8b [ws-deployment] Refactor deployment code for parallel processing 2021-11-02 08:03:10 +01:00
Prince Rachit Sinha
32fd7081bb [ws-deployment] Add install gitpod partial step 2021-11-01 18:26:09 +01:00
Prince Rachit Sinha
dd09006db9 [ws-deployment] Introduce --version-manifest flag and minor fixes 2021-10-28 11:56:05 +02:00
Prince Rachit Sinha
e723ca86d0 [ws-deployment] Set gcp credentials file path if present 2021-10-27 11:33:04 +02:00
Laurie T. Malau
a4a1719339 New overview dashboard 2021-10-22 16:13:00 +02:00
Prince Rachit Sinha
560f35b888 [ws-deployment] Prelim checking for ws-deployment 2021-10-22 14:08:59 +02:00