Skip to content

Simplifies the deployment reports structure

Mayra Cabrera requested to merge simplify-deployments-report into master

What does this MR do and why?

This commit simplifies the deployment reports by:

  • Combining the Resource and Summary columns into one. This takes the advantage of automatically expanding the issue title by prepending a '+'.
  • Adds a dedicated row into the Weekly overview section that contains the total of hours blocked in production and staging. This is useful for copy/paste the information in the spreadsheet.

Examples

Before

Click to expand

Deployment Blockers - Week: 2023-09-04-2023-09-10

Overview

Start date End date Production deployments blocked for # of blockers
2023-09-04 2023-09-10 33.5 11

Weekly overview

Resource Summary Blocker type gstg gprd
gitlab-org/release/tasks#6460 (closed) QA failure qa/specs/features/ee/browser_ui/2_plan/issues_analytics/issues_analytics_spec.b on gstg-cny ~"RootCause::Feature-Flag" 2.0 0.0
gitlab-org/release/tasks#6441 (closed) 2023-09-07 - Deployment blocker - Runners auth error 3.0 2.0
gitlab-org/release/tasks#6425 (closed) QA failure: qa/specs/features/browser_ui/3_create/repository/protected_tags_spec.rb ~"RootCause::Flaky-Test" 0.5 0.5
gitlab-org/release/tasks#6421 (closed) Tuesday 2023-09-05 11:38 UTC - gitlab-org/gitlab broken 16-3-stable-ee with rspec-ee system pg14 10/10, rspec-ee system pg14 9/10, rspec-ee system pg14 7/10, rspec-ee system pg14 6/10, rspec-ee system pg14 5/10, rspec-ee system pg14 4/10, rspec-ee ... 0.0 0.0
gitlab-org/release/tasks#6410 (closed) QA failure: qa/specs/features/api/9_data_stores/user_inherited_access_spec.rb ~"RootCause::Flaky-Test" 1.0 1.0
gitlab-org/release/tasks#6409 (closed) QA failure: qa/specs/features/browser_ui/9_data_stores/group/transfer_project_spec.rb ~"RootCause::Software-Change" 10.0 10.0
gitlab-org/release/tasks#6408 (closed) QA failure due to artifacts not being available 1.0 1.0
gitlab-com/gl-infra/production#16323 (closed) 2023-09-08: QA tests are failing on gstg-cny ~"RootCause::Software-Change" 7.0 7.0
gitlab-com/gl-infra/production#16322 (closed) 2023-09-08: gstg-cny-migrations job failing due the wrong Ansible password ~"RootCause::Needed" 4.0 0.0
gitlab-com/gl-infra/production#16315 (closed) 2023-09-07: SourceForge.net is DOWN. Deployment blocker ~"RootCause::External-Dependency" 10.0 10.0
gitlab-com/gl-infra/production#16297 (closed) 2023-09-04: ops.gitlab.net No active runners available ~"RootCause::Needed" 4.0 2.0

Additional incidents

Below is a list of production incidents created last week.

Click to expand
Resource Summary
gitlab-com/gl-infra/production#16341 (closed) 2023-09-11: PrometheusNotificationsBacklog in Ops
gitlab-com/gl-infra/production#16340 (closed) 2023-09-10: PostgreSQL queries dominating total query time
gitlab-com/gl-infra/production#16339 (closed) 2023-09-10: GCPLoadBalancerImbalancedBackends for gprd-haproxy-main-regional
https://gitlab.com/gitlab-com/gl-infra/production/-/issues/16338 2023-09-10: Attempted exploit of CVE-2019-11580
gitlab-com/gl-infra/production#16337 (closed) 2023-09-10: PostgreSQL queries dominating total query time
gitlab-com/gl-infra/production#16336 (closed) 2023-09-10: PostgreSQL queries dominating total query time
gitlab-com/gl-infra/production#16334 (closed) 2023-09-10: customers.gitlab.com blackbox probex failing
gitlab-com/gl-infra/production#16333 (closed) 2023-09-09: PostgreSQL queries dominating total query time
gitlab-com/gl-infra/production#16332 (closed) 2023-09-09: PvsServiceHttpApdexSLOViolation
gitlab-com/gl-infra/production#16331 (closed) 2023-09-09: PostgreSQL queries dominating total query time
gitlab-com/gl-infra/production#16330 (closed) 2023-09-09: PostgreSQL queries dominating total query time
gitlab-com/gl-infra/production#16329 (closed) 2023-09-09: PostgreSQL queries dominating total query time
gitlab-com/gl-infra/production#16328 (closed) 2023-09-09: PostgreSQL queries dominating total query time
gitlab-com/gl-infra/production#16327 (closed) 2023-09-09: PostgreSQL queries dominating total query time
gitlab-com/gl-infra/production#16326 (closed) 2023-09-09: PostgreSQL queries dominating total query time
gitlab-com/gl-infra/production#16325 (closed) 2023-09-09: websockets rails_requests violating slo in us-east1-d
gitlab-com/gl-infra/production#16324 (closed) 2023-09-08: PostgreSQL queries dominating total query time
gitlab-com/gl-infra/production#16319 (closed) 2023-09-07: Prometheus server unresponsive in ops environment
gitlab-com/gl-infra/production#16314 (closed) 2023-09-07: SSL certificate for kas.staging.gitlab.com:443 expires soon
gitlab-com/gl-infra/production#16313 (closed) 2023-09-06: Prometheus server unresponsive in ops environment
gitlab-com/gl-infra/production#16311 (closed) 2023-09-06: Intermittent failures in health checks for main stage git service when updating feature flags
gitlab-com/gl-infra/production#16307 (closed) 2023-09-05: gstg mailroom pod restarting
gitlab-com/gl-infra/production#16305 (closed) 2023-09-05: PostgreSQL queries dominating total query time
gitlab-com/gl-infra/production#16304 (closed) 2023-09-04: PostgreSQL queries dominating total query time
gitlab-com/gl-infra/production#16303 (closed) 2023-09-04: Blackbox probes for https://packages.gitlab.com are failing.
gitlab-com/gl-infra/production#16301 (closed) 2023-09-04: Elevated error rate on cny web

Instructions

  • Review the "Additional incidents" list and add the Deploys-blocked label if required.
  • Update the "weekly overview" table of this issue to also include:
    • any blocking CRs
    • the Root Cause type for each blocker (if this one is missing)
  • Update the Deployments metric review epic.
    • Add a new row to the Overview section: Copy and paste the information from the Overview section in this issue and link to this issue in the Breakdown of blockers column.
    • Update the Graph: Update the data on the spreadsheet and then update the graph on the Deployments metric review epic.

After

Click to expand

Deployment Blockers - Week: 2023-09-04-2023-09-10

Overview

Start date End date Production deployments blocked for # of blockers
2023-09-04 2023-09-10 33.5 11

Weekly overview

Issue Blocker type gstg gprd
QA failure qa/specs/features/ee/browser_ui/2_pl... (gitlab-org/release/tasks#6460 - closed) ~"RootCause::Feature-Flag" 2.0 0.0
2023-09-07 - Deployment blocker - Runners auth ... (gitlab-org/release/tasks#6441 - closed) 3.0 2.0
QA failure: qa/specs/features/browser_ui/3_crea... (gitlab-org/release/tasks#6425 - closed) ~"RootCause::Flaky-Test" 0.5 0.5
Tuesday 2023-09-05 11:38 UTC - `gitlab-org/gitl... (gitlab-org/release/tasks#6421 - closed) 0.0 0.0
QA failure: qa/specs/features/api/9_data_stores... (gitlab-org/release/tasks#6410 - closed) ~"RootCause::Flaky-Test" 1.0 1.0
QA failure: qa/specs/features/browser_ui/9_data... (gitlab-org/release/tasks#6409 - closed) ~"RootCause::Software-Change" 10.0 10.0
QA failure due to artifacts not being available (gitlab-org/release/tasks#6408 - closed) 1.0 1.0
2023-09-08: QA tests are failing on gstg-cny (gitlab-com/gl-infra/production#16323 - closed) ~"RootCause::Software-Change" 7.0 7.0
2023-09-08: gstg-cny-migrations job failing due... (gitlab-com/gl-infra/production#16322 - closed) ~"RootCause::Needed" 4.0 0.0
2023-09-07: SourceForge.net is DOWN. Deployment... (gitlab-com/gl-infra/production#16315 - closed) ~"RootCause::External-Dependency" 10.0 10.0
2023-09-04: ops.gitlab.net No active runners av... (gitlab-com/gl-infra/production#16297 - closed) ~"RootCause::Needed" 4.0 2.0
Total 42.5 33.5

Additional incidents

Below is a list of production incidents created last week.

Click to expand
Issue
2023-09-11: PrometheusNotificationsBacklog in Ops (gitlab-com/gl-infra/production#16341 - closed)
2023-09-10: PostgreSQL queries dominating total... (gitlab-com/gl-infra/production#16340 - closed)
2023-09-10: GCPLoadBalancerImbalancedBackends f... (gitlab-com/gl-infra/production#16339 - closed)
https://gitlab.com/gitlab-com/gl-infra/production/-/issues/16338+
2023-09-10: PostgreSQL queries dominating total... (gitlab-com/gl-infra/production#16337 - closed)
2023-09-10: PostgreSQL queries dominating total... (gitlab-com/gl-infra/production#16336 - closed)
2023-09-10: customers.gitlab.com intermittent a... (gitlab-com/gl-infra/production#16334 - closed)
2023-09-09: PostgreSQL queries dominating total... (gitlab-com/gl-infra/production#16333 - closed)
2023-09-09: PvsServiceHttpApdexSLOViolation (gitlab-com/gl-infra/production#16332 - closed)
2023-09-09: PostgreSQL queries dominating total... (gitlab-com/gl-infra/production#16331 - closed)
2023-09-09: PostgreSQL queries dominating total... (gitlab-com/gl-infra/production#16330 - closed)
2023-09-09: PostgreSQL queries dominating total... (gitlab-com/gl-infra/production#16329 - closed)
2023-09-09: PostgreSQL queries dominating total... (gitlab-com/gl-infra/production#16328 - closed)
2023-09-09: PostgreSQL queries dominating total... (gitlab-com/gl-infra/production#16327 - closed)
2023-09-09: PostgreSQL queries dominating total... (gitlab-com/gl-infra/production#16326 - closed)
2023-09-09: websockets rails_requests violating... (gitlab-com/gl-infra/production#16325 - closed)
2023-09-08: PostgreSQL queries dominating total... (gitlab-com/gl-infra/production#16324 - closed)
2023-09-07: Prometheus server unresponsive in o... (gitlab-com/gl-infra/production#16319 - closed)
2023-09-07: SSL certificate for kas.staging.git... (gitlab-com/gl-infra/production#16314 - closed)
2023-09-06: Prometheus server unresponsive in o... (gitlab-com/gl-infra/production#16313 - closed)
2023-09-06: Intermittent failures in health che... (gitlab-com/gl-infra/production#16311 - closed)
2023-09-05: gstg mailroom pod restarting (gitlab-com/gl-infra/production#16307 - closed)
2023-09-05: PostgreSQL queries dominating total... (gitlab-com/gl-infra/production#16305 - closed)
2023-09-04: PostgreSQL queries dominating total... (gitlab-com/gl-infra/production#16304 - closed)
2023-09-04: Blackbox probes for https://package... (gitlab-com/gl-infra/production#16303 - closed)
2023-09-04: Elevated error rate on cny web (gitlab-com/gl-infra/production#16301 - closed)

Instructions

  • Review the "Additional incidents" list and add the Deploys-blocked label if required.
  • Update the "weekly overview" table of this issue to also include:
    • any blocking CRs
    • the Root Cause type for each blocker (if this one is missing)
  • Update the Deployments metric review epic.
    • Add a new row to the Overview section: Copy and paste the information from the Overview section in this issue and link to this issue in the Breakdown of blockers column.
    • Update the Graph: Update the data on the spreadsheet and then update the graph on the Deployments metric review epic.

Author Check-list

  • Has documentation been updated?
Edited by Mayra Cabrera

Merge request reports

Loading