Improve alert details experience
As part of usability testing in #1555 (closed), we discovered that the default alert details page doesn't contain enough information by default to triage the alert.
Some things users mentioned missing/found confusing:
- Needed more details about how the alert was set up
- Needed more details about the current reported values, and information about thresholds that have been exceeded, baseline values
- Wanted to see health status of the associated cluster
- Iid field - people weren't sure what that was (just ID but, that wasn't clear to people)
- Expected a short summary of the alert and what the expected behavior should be
- Needed more descriptive alert titles
- Wanted to see relevant metrics, logs
Some of these things could be improved by structuring the alert payload differently/better to include more relevant information. But, are there things we can do to encourage people to add these details to their alerts? Should we consider adding additional fields (for example, with links to logs, links to runbooks, current reported values, thresholds) so that alert detail pages are more robust, by default?
Edited by Amelia Bauerly