Skip to content
Merged
Show file tree
Hide file tree
Changes from 2 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
46 changes: 30 additions & 16 deletions content/en/service_management/service_level_objectives/_index.md
Original file line number Diff line number Diff line change
Expand Up @@ -30,7 +30,7 @@ further_reading:
text: "Create and manage SLOs with Terraform"
- link: "https://www.datadoghq.com/blog/burn-rate-is-better-error-rate/"
tag: "Blog"
text: "Burn Rate is a Better Error Rate"
text: "Burn Rate is a Better Error Rate"
---

{{< jqmath-vanilla >}}
Expand Down Expand Up @@ -72,13 +72,13 @@ For a full comparison, see the [SLO Type Comparison][1] chart.

## Setup

Use Datadog's [Service Level Objectives status page][2] to create new SLOs or to view and manage all your existing SLOs.
Use Datadog's [Service Level Objectives manage page][2] to create new SLOs or to view and manage all your existing SLOs.

### Configuration

1. On the [SLO status page][2], select **New SLO +**.
1. On the [SLO manage page][2], select **New SLO +**.
2. Select the SLO type. You can create an SLO with any of the following types: [Metric-based][3], [Monitor-based][4], or [Time Slices][5].
3. Set a target and a rolling time window (past 7, 30, or 90 days) for the SLO. Datadog recommends you make the target stricter than your stipulated SLAs. If you configure more than one time window, select one to be the primary time window. This time window is displayed on SLO lists. By default, the shortest time window is selected.
3. Set a target and a rolling time window (past 7, 30, or 90 days) for the SLO. Datadog recommends you make the target stricter than your stipulated SLAs. If you configure more than one time window, select one to be the primary time window. This time window is displayed on SLO lists. By default, the shortest time window is selected.
4. Finally, give the SLO a title, describe it in more detail or add links in the description, add tags, and save it.

After you set up the SLO, select it from the [Service Level Objectives list view][2] to open the details side panel. The side panel displays the overall status percentage and remaining error budget for each of the SLO's targets, as well as status bars (monitor-based SLOs) or bar graphs (metric-based SLOs) of the SLI's history. If you created a grouped monitor-based SLO using one [multi alert monitor][6] or a grouped metric-based SLO using the [`sum by` clause][7], the status percentage and remaining error budget for each individual group is displayed in addition to the overall status percentage and remaining error budget.
Expand Down Expand Up @@ -115,12 +115,12 @@ To create, edit, and delete status corrections, users require the `slos_correcti

### Granular access controls

Restrict access to individual SLOs by specifying a list of [roles][10] that are allowed to edit it.
Restrict access to individual SLOs by specifying a list of [roles][10] that are allowed to edit it.

{{< img src="service_management/service_level_objectives/slo_set_permissions.png" style="width:100%; background:none; border:none; box-shadow:none;" alt="SLO permissions option in the cog menu">}}

1. Click on the SLO to open the details side panel.
1. Click the cog icon in the upper right of the panel.
1. Click on the SLO to open the details side panel.
1. Click the cog icon in the upper right of the panel.
1. Select **Permissions**.
1. Click **Restrict Access**.
1. The dialog box updates to show that members of your organization have **Viewer** access by default.
Expand All @@ -135,7 +135,7 @@ To maintain your edit access to the SLO, the system requires you to include at l

## Searching SLOs

The [Service Level Objectives status page][2] lets you run an advanced search of all SLOs so you can find, view, edit, clone or delete SLOs from the search results.
The [Service Level Objectives manage page][2] lets you run an advanced search of all SLOs so you can find, view, edit, clone or delete SLOs from the search results.

Advanced search lets you query SLOs by any combination of SLO attributes:

Expand All @@ -161,13 +161,27 @@ Sort SLOs by the *status* and *error budget* columns to prioritize which SLOs ne

### SLO tags

SLO tags can be used for filtering on the [SLO status page][2], creating [SLO saved views][17], or grouping SLOs to view. Tags can be added to SLOs in the following ways:
SLO tags can be used for filtering on the [SLO manage page][2], creating [SLO saved views][17], or grouping SLOs to view. Tags can be added to SLOs in the following ways:

- When you create or edit an SLO, you can add tags
- From the SLO list view, you can add and update tags in bulk using the *Edit Tags* and the *[Edit Teams][18]* dropdown options at the top of the SLO list.

{{< img src="service_management/service_level_objectives/slo_bulk_tag.png" alt="SLO list page displays the Edit Tag dropdown for bulk tag editing" >}}

### SLO burn rate indicator

Burn rate indicators identify SLOs that are consuming their error budget faster than their configured target permits within the rolling 2-hour evaluation window. They appear next to the applicable SLO names on the [SLO manage page][2].

{{< img src="/service_management/service_level_objectives/slo_burn_rate_indicator.png" alt="SLO burn rate indicator example" style="width:80%;" >}}

There are two possible indicator types:
- A red icon indicating a critical burn rate above 6 in the past 2 hours.
- A yellow icon indicating an elevated burn rate between 1 and 6 in the past 2 hours.

A visual chart accompanies each indicator to show where the burn rate falls relative to the elevated and critical thresholds, allowing quick assessment of the severity.

SLOs can be filtered by burn rate status: Critical, Elevated, and Healthy. For SLOs with a service tag, each burn rate indicator includes a direct link to the related service page for further investigation.

### SLO default view

The default SLO view is loaded when you land on the SLO list view.
Expand Down Expand Up @@ -258,11 +272,11 @@ With [Event Monitors][28], you can set up notifications to track SLO audit event
Learn without cost on real cloud compute capacity and a Datadog trial account. Enroll today to learn more about building Dashboards to track SLOs.
{{< /learning-center-callout >}}

After creating your SLO, you can visualize the data through Dashboards and widgets.
After creating your SLO, you can visualize the data through Dashboards and widgets.
- Use the SLO widget to visualize the status of a single SLO
- Use the SLO List widget to visualize a set of SLOs
- Graph 15 months' worth of metric-based SLO data with the [SLO data source][20] in both timeseries and scalar (query value, top list, table, change) widgets.
- Graph 15 months' worth of metric-based SLO data with the [SLO data source][20] in both timeseries and scalar (query value, top list, table, change) widgets.

For more information about SLO Widgets, see the [SLO widget][21] and [SLO List widget][22] pages. For more information on the SLO data source, see the guide on how to [Graph historical SLO data on Dashboards][20].

## SLO status corrections
Expand All @@ -277,7 +291,7 @@ When you apply a correction, the time period you specify is dropped from the SLO
- For metric-based SLOs, all good and bad events in the correction window are not counted.
- For Time Slice SLOs, the correction time window is treated as uptime.

You have the option to create one-time corrections for ad hoc adjustments, or recurring corrections for predictable adjustments that occur on a regular cadence. One-time corrections require a start and end time, while recurring corrections require a start time, duration, and interval. Recurring corrections are based on [iCalendar RFC 5545's RRULE specification][24]. The supported rules are `FREQ`, `INTERVAL`, `COUNT`, and `UNTIL`. Specifying an end date for recurring corrections is optional in case you need the correction to repeat indefinitely.
You have the option to create one-time corrections for ad hoc adjustments, or recurring corrections for predictable adjustments that occur on a regular cadence. One-time corrections require a start and end time, while recurring corrections require a start time, duration, and interval. Recurring corrections are based on [iCalendar RFC 5545's RRULE specification][24]. The supported rules are `FREQ`, `INTERVAL`, `COUNT`, and `UNTIL`. Specifying an end date for recurring corrections is optional in case you need the correction to repeat indefinitely.

For either type of correction, you must select a correction category that states why the correction is being made. The available categories are `Scheduled Maintenance`, `Outside Business Hours`, `Deployment`, and `Other`. You can optionally include a description to provide additional context if necessary.

Expand Down Expand Up @@ -314,13 +328,13 @@ To view, edit, and delete existing status corrections, click on the **Correction

#### Visualizing status corrections

For Metric-based and Time Slice SLOs with status corrections, there is a toggle in the SLO detail view that lets you enable or disable corrections in the UI. The toggle controls the charts and data in the "History" section of the SLO detail view. **Note:** Your overall SLO status and error budget will always take status corrections into consideration.
For Metric-based and Time Slice SLOs with status corrections, there is a toggle in the SLO detail view that lets you enable or disable corrections in the UI. The toggle controls the charts and data in the "History" section of the SLO detail view. **Note:** Your overall SLO status and error budget will always take status corrections into consideration.

{{< img src="service_management/service_level_objectives/correction-toggle.png" alt="SLO correction UI" style="width:100%;">}}

## SLO calendar view

The SLO Calendar View is available on the [SLO status page][2]. On the top right corner, switch from the "Primary" view to the "Daily", "Weekly", or "Monthly" view to see 12 months of historical SLO status data. The Calendar View is supported for Metric-based SLOs and Time Slice SLOs.
The SLO Calendar View is available on the [SLO manage page][2]. On the top right corner, switch from the "Primary" view to the "Daily", "Weekly", or "Monthly" view to see 12 months of historical SLO status data. The Calendar View is supported for Metric-based SLOs and Time Slice SLOs.

{{< img src="service_management/service_level_objectives/slo-calendar-view-2.png" alt="SLO calendar view" >}}

Expand All @@ -330,7 +344,7 @@ The SLO Calendar View is available on the [SLO status page][2]. On the top right
The CSV Export feature is in Preview. Complete the form to request access.
{{< /callout >}}

The SLO CSV Export feature is available on the [SLO status page][2] once you switch to the "Weekly" or "Monthly" Calendar View. In these views, you can access the new "Export to CSV" option to download a CSV of your historical SLO data with the following information:
The SLO CSV Export feature is available on the [SLO manage page][2] once you switch to the "Weekly" or "Monthly" Calendar View. In these views, you can access the new "Export to CSV" option to download a CSV of your historical SLO data with the following information:

- SLO id, name, and type
- SLO tags
Expand Down
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading