311 Commits

Author SHA1 Message Date
Jan Koehnlein
d30815e685 [owners] rename team meta to webapp 2022-01-26 08:27:17 +01:00
Manuel Alejandro de Brito Fontes
82d786e2bb Remove ws-scheduler 2022-01-24 20:08:17 +01:00
Kyle Brennan
71f543110f Trigger node high load warnings sooner 2022-01-21 22:24:14 +01:00
Thomas Schubart
c62ec6633b Trigger node high load warnings sooner 2022-01-21 17:41:13 +01:00
Thomas Schubart
ae1d476e34 Update workspace success criteria
The threshold line for workspace startup 95% case should be 40 seconds,
not 30 seconds.
2022-01-20 10:27:12 +01:00
Laurie T. Malau
ea76aec273 Add metric and plug in 2022-01-13 15:52:06 +01:00
Manuel Alejandro de Brito Fontes
4935b242b7 Remove workspace deployment 2022-01-01 13:34:55 +01:00
Kyle Brennan
821d463fb9 Helm is needed to support Observability 2021-12-23 22:43:47 +01:00
Kyle Brennan
efe25d96f2 Fixes #7335
Handle "no data" by adding 'on() vector(0)' to each numerator
Relies on new variable $datasource
Also fixes legend for workspace startup panel
When exporting from Grafana, disable "export for sharing externally"
2021-12-23 22:43:47 +01:00
Gero Posmyk-Leinemann
893036754e [ops] Meta: Add alert ServerEventLoopLagTooHigh 2021-12-18 12:06:42 +01:00
Gero Posmyk-Leinemann
05ec1e39a8 [ops] Dashboard: Fix all server dashboard queries 2021-12-18 12:06:42 +01:00
Gero Posmyk-Leinemann
f21bd2fa59 [ops] Dashboard: fix Meta Overview 2021-12-18 12:06:42 +01:00
Prince Rachit Sinha
6285314e0e Add K3s cluster autoscaler dashboard 2021-12-16 07:36:40 +01:00
Gero Posmyk-Leinemann
fc5623e6cf [ops] Small dashboard/graph fixes 2021-12-01 17:30:27 +01:00
Manuel Alejandro de Brito Fontes
93ed40d14a Adjust sucess criteria dashboard 2021-11-30 15:07:26 +01:00
Gero Posmyk-Leinemann
ee703a4c7b [ops] Grafana: Fix datasource 2021-11-30 14:38:26 +01:00
Gero Posmyk-Leinemann
efc95d794a [ops] Grafana: Actually add Services dashboard 2021-11-30 14:38:26 +01:00
Gero Posmyk-Leinemann
43a54051f3 [ops] Add "gitpod-mixin" tag 2021-11-30 14:38:26 +01:00
Laurie T. Malau
0c4eb6317e [monitoring] Update overview dashboard 2021-11-26 11:40:22 +01:00
Gero Posmyk-Leinemann
236c1bb087 [ops] Add dashboard "Meta Services" 2021-11-23 15:41:20 +01:00
Christian Weichel
45e250bb5f [observability] Add ws team success criteria dashboard 2021-11-19 00:35:15 +01:00
Gero Posmyk-Leinemann
57f479ae44 [ops] server dashboard: reload on time range change 2021-11-18 10:50:15 +01:00
Gero Posmyk-Leinemann
e4d2e268bc [server] Add WebsocketClientContext 2021-11-18 10:50:15 +01:00
ArthurSens
30d2ffcb2e Replace table panel with graph
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2021-11-15 08:36:12 +01:00
ArthurSens
1f6195853b Add alert for normalized Load average higher than 10.
The same recording rule is also added to Gitpod / Overview dashboard, replacing the noisy neighborhood panel

Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2021-11-15 08:36:12 +01:00
Prince Rachit Sinha
b0552889e4 [ws-deployment] Add retry support within steps 2021-11-11 15:16:09 +01:00
Cornelius A. Ludmann
a7166daa72 Set version in Go components during build time 2021-11-11 10:23:08 +01:00
Prince Rachit Sinha
64f4da88a6 [ws-deployment] Add gitpod installation step 2021-11-10 09:25:08 +01:00
Manuel Alejandro de Brito Fontes
77321f8269 Update go modules 2021-11-05 10:33:03 +01:00
Prince Rachit Sinha
b4dddf84fe [ws-deployment] Introduce context wrapper and overrides 2021-11-02 16:30:10 +01:00
Manuel Alejandro de Brito Fontes
18ca7bc294 Update go modules 2021-11-02 15:50:10 +01:00
Prince Rachit Sinha
76a4428e8b [ws-deployment] Refactor deployment code for parallel processing 2021-11-02 08:03:10 +01:00
Prince Rachit Sinha
32fd7081bb [ws-deployment] Add install gitpod partial step 2021-11-01 18:26:09 +01:00
Prince Rachit Sinha
dd09006db9 [ws-deployment] Introduce --version-manifest flag and minor fixes 2021-10-28 11:56:05 +02:00
Prince Rachit Sinha
e723ca86d0 [ws-deployment] Set gcp credentials file path if present 2021-10-27 11:33:04 +02:00
Laurie T. Malau
a4a1719339 New overview dashboard 2021-10-22 16:13:00 +02:00
Prince Rachit Sinha
560f35b888 [ws-deployment] Prelim checking for ws-deployment 2021-10-22 14:08:59 +02:00
ArthurSens
f173d2adcd [observability] Fix jsonnet format check
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2021-10-21 16:56:59 +02:00
ArthurSens
19c59e6754 Add missing description to WebsocketConnectionsNotClosing alert
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2021-10-21 15:12:59 +02:00
ArthurSens
8090214982 Add tests for alert description and severity
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2021-10-21 15:12:59 +02:00
Christian Weichel
1b603c1e7a [dashboards] Update agent-smith dashboard to latest metrics 2021-10-19 15:11:06 -03:00
ArthurSens
560a34a2cf Update runbooks' URL
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2021-10-19 12:37:06 -03:00
Christian Weichel
672a78aecd [owners] Fix platform ownership 2021-10-18 14:39:05 -03:00
Christian Weichel
cc38637231 [build] Clean up package structure 2021-10-18 14:39:05 -03:00
Christian Weichel
239606af6c [agent-smith] Improve dashboard 2021-10-18 11:01:05 -03:00
Gero Posmyk-Leinemann
e1741df347 [ops] Add "version" graph 2021-10-18 05:39:05 -03:00
Gero Posmyk-Leinemann
beed6cbb47 [ops] Add alert GitpodMetaMessagebusTotalQueues 2021-10-18 04:32:04 -03:00
ArthurSens
60b57475cf Improve documentantion regarding deployment to production
Signed-off-by: ArthurSens <arthursens2005@gmail.com>
2021-10-15 15:05:02 -03:00
Christian Weichel
d6055c7781 [agent-smith] Fix metrics 2021-10-15 12:09:02 -03:00
Christian Weichel
ddc37ce439 [observability] Add SLO for "regular not active" 2021-10-15 10:38:02 -03:00