Retry delete_helm_release a few times
Context
We have noticed that the delete_helm_release
job can fail due to transient problems with the cluster (e.g. https://gitlab.com/gitlab-org/quality/engineering-productivity-infrastructure/-/issues/68#note_1253611556).
With !110188 (merged), we ensure that we won't start deploying a review-app if we're not starting from an empty namespace/helm release.
What does this MR do?
This MR goes a step further, and tries to make the delete_helm_release
job succeed by retrying a few times, e.g. in case the cluster is unresponsive for a few seconds.
MR acceptance checklist
This checklist encourages us to confirm any changes have been analyzed to reduce risks in quality, performance, reliability, security, and maintainability.
-
I have evaluated the MR acceptance checklist for this MR.