Skip to content

Commit

Permalink
SRVKP-4532: factor k8s throttling into task panel
Browse files Browse the repository at this point in the history
Also:
- fix tekton-config.yaml yamllint error
- widen controller restart query
- make osp pruner cfg consistent with konflux prod

rh-pre-commit.version: 2.3.0
rh-pre-commit.check-secrets: ENABLED
  • Loading branch information
gabemontero committed Jul 25, 2024
1 parent c12b163 commit c9e9d5d
Show file tree
Hide file tree
Showing 2 changed files with 12 additions and 13 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -227,7 +227,7 @@
"type": "stat"
},
{
"description": "The number of times the pipelines controller has restarted",
"description": "The number of times any of the pipelines controllers hav restarted",
"fieldConfig": {
"defaults": {
"color": {
Expand Down Expand Up @@ -281,7 +281,7 @@
"targets": [
{
"editorMode": "code",
"expr": "sum(increase(kube_pod_container_status_restarts_total{namespace=\"openshift-pipelines\", pod=~\"tekton-pipelines-controller-.*\"}[2m]))",
"expr": "sum(increase(kube_pod_container_status_restarts_total{namespace=\"openshift-pipelines\", pod=~\"tekton-.*\"}[2m]))",
"legendFormat": "__auto",
"range": true,
"refId": "A"
Expand Down Expand Up @@ -409,7 +409,7 @@
"targets": [
{
"editorMode": "code",
"expr": "sum(increase(taskrun_pod_create_not_attempted_or_pending_count[2m]))",
"expr": "sum(increase(taskrun_pod_create_not_attempted_or_pending_count[2m])) - sum(increase(tekton_pipelines_controller_running_taskruns_throttled_by_quota[2m])) - sum(increase(tekton_pipelines_controller_running_taskruns_throttled_by_node[2m]))",
"legendFormat": "__auto",
"range": true,
"refId": "A"
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -190,13 +190,12 @@ spec:
kube-api-qps: 50
kube-api-burst: 50
pruner:
# The load on prod-rh01 is to the point now where tekton-results
# can fall too far behind. Until the watcher's log storage is rewritten
# etc with SRVKP-4347 or if we risk adding more processing power (threads,qps,burst)
# to the mem leak version of the watcher, we need the OSP pruner as a backup.
# a bit of an adjustment, we will prune once an hour now per https://crontab.guru/every-1-hour
# to line up with typical timeout settings.
keep: 10
resources:
- pipelinerun
schedule: 0 * * * *
# The load on prod-rh01 is to the point now where tekton-results
# can fall too far behind. Until the watcher's log storage is rewritten
# etc with SRVKP-4347 or if we risk adding more processing power (threads,qps,burst)
# to the mem leak version of the watcher, we need the OSP pruner as a backup.
disable: false
keep-since: 60
resources:
- pipelinerun
schedule: "*/10 * * * *"

0 comments on commit c9e9d5d

Please sign in to comment.