Alerting center Beta
The Alerting Centre is a beta-phase feature developed by Nobl9 to manage site reliability and respond to production issues efficiently. It provides a centralized reference dashboard where you can access a heat-mapped timeline displaying all past and active alerts related to Service Level Objectives.
Using Nobl9 Alerting Center you can get:
Overviewβ
You can access Alerting Center from the Alerts tab in the main navigation panel.
Alerting Center and RBACβ
The Alerting Center displays all alerts for SLOs in projects where you have at least view permission. See RBAC for more details.
Heatmapβ
By default, the Alerting Center displays alerts fired in your organization in the last 24 hours. You can choose a larger time window by clicking on the time window picker and selecting the desired range:
Heatmap bucketsβ
The display unit of the Alerting Center consists of 60 buckets that are used to aggregate data. The level of aggregation increases as you extend the time window.
As you can see in the example below, if you select a time window of 1 hour, each visible bucket will aggregate data for 60 seconds. By the same token, if you choose a 24-hour time window, each bucket will aggregate data for 24 minutes. Finally, for a 1-week window, each bucket will represent 168 minutes.
Groupings and filtersβ
The Alerting Center allows you to group alerts on the heatmap by projects, services, alert policies and SLOs. You can also filter the Alerting Center heatmap further by statuses (triggered, resolved) and severity (low, medium, high).
Groupings allow you to organize and visualize alerts in one click based on different aspects of your system, providing a comprehensive understanding of their alerting patterns.
-
Group by services to find the most alerting services in your organization.
-
Filtering by
Triggered
status only is an easy way to monitor all alerts that are currently active in your organization.
Sorting by resource's nameβ
The resources shown on the alert heatmap are currently sorted in alphabetical order based on their names
. However, the legend on the heatmap's y-axis only displays their display names.
Consequently, you may notice in some cases that display names are not in the proper order. For instance, in the example given below, the SLO for intake response (response-duration
) has a name starting with 'r', which causes it to be displayed after newrelic-rolling-timeslices-threshold
.
Groupings and navigation to other tabs in Nobl9 webβ
When you click on any colored bucket on the heatmap, Nobl9 will display the list of all affected SLOs where alert fired in the specific time frame.
Currently, clicking the SLO links on the heatmap doesn't preserve filtering by severity on the SLO Details > Alerts view. To continue browsing through the heatmap with preserved filters and grouping, return to the browser tab you were redirected from.
Limitations overviewβ
The Alerting Center during its beta phase has the following limitations:
- It displays the most recent 20000 alerts within your organization.
- Resource presentation on the alert heatmap is currently alphabetically sorted by their
names
, notdisplayNames
. - Clicking on SLO links in the heatmap doesn't retain filtering by severity in the SLO details > Alerts view.
- Labels for active alerts and severity remain unchanged when filtering through the heatmap view.