LogicMonitor

Reading time: 0 minute(s) (0 words)

LogicMonitor is a SaaS-based monitoring platform that automates IT system monitoring. It provides a centralized view of various IT components, including networks, cloud environments, servers, and applications. LogicMonitor offers automated monitoring and data insights.

Nobl9 integrates with LogicMonitor, enabling you to create threshold and ratio SLOs for tracking the reliability of your LogicMonitor devices and websites. To do this, select the required query type—device_metrics or website_metrics—in Nobl9 and provide the necessary parameters for your LogicMonitor device or website.

LogicMonitor parameters and supported features in Nobl9

General support:: Release channel: Beta; Connection method: Agent, Direct; Replay and SLI Analyzer: Historical data limit 30 days; Event logs: Supported; Query checker: Not supported; Query parameters retrieval: Supported; Timestamp cache persistence: Supported
Query parameters:: Query interval: 1 min; Query delay: 2 min; Jitter: 15 sec; Timeout: 30 sec
Agent details and minimum required versions for supported features:: Plugin name: n9logic_monitor; Query delay environment variable: LOGIC_MONITOR_QUERY_DELAY; Replay and SLI Analyzer: 0.81.0-beta; Query parameters retrieval: 0.76.0-beta; Timestamp cache persistence: 0.76.0-beta
Additional notes:: Create SLOs to track your LogicMonitor devices and websites; Bad over total ratio metrics

Creating SLOs with LogicMonitor

You can create SLOs using device or website LogicMonitor metrics.

Nobl9 Web

Navigate to Service Level Objectives.
Click .
Select a Service.
It will be the location for your SLO in Nobl9.
Select your LogicMonitor data source.
Modify Period for Historical Data Retrieval, when necessary.
- This value defines how far back in the past your data will be retrieved when replaying your SLO based on LogicMonitor.
- A longer period can extend the data loading time for your SLO.
- Must be a positive whole number up to the maximum period value you've set when adding the LogicMonitor data source.
Select the Metric type:
- Threshold metric: a single time series is evaluated against a threshold.
- A Ratio metric: two-time series for comparison. For a ratio metric:

A LogicMonitor query must follow the below patters:

Device metrics
Website metrics

Query Type: device_metrics
Device Data Source Instance ID: the identifier of a monitoring rule configuration or occurrence applied to your LogicMonitor device.
Graph ID: a 5-symbol graph configuration.
Line: the Datapoint value.
Enter the Line value in uppercase.

Find out how to retrieve these values in LogicMonitor.

SLI data retrieval

Tips on retrieving SLI data in LogicMonitor provide instructions on retrieving the required values.

Define the Time window for your SLO:
- Rolling time windows constantly move forward as time passes. This type can help track the most recent events.
- Calendar-aligned time windows are usable for SLOs intended to map to business metrics measured on a calendar-aligned basis.
Configure the Error budget calculation method and Objectives:
- Occurrences method counts good attempts against the count of total attempts.
- Time Slices method measures how many good minutes were achieved (when a system operates within defined boundaries) during a time window.
- You can define up to 12 objectives for an SLO.
Add the Display name, Name, and other settings for your SLO:
- Name identifies your SLO in Nobl9. After you save the SLO, its name becomes read-only.
  Use only lowercase letters, numbers, and dashes.
- Select No data anomaly alert to receive notifications when your SLO stops reporting data for a specified period:
  - Choose up to five supported Alert methods.
  - Specify the delay period before Nobl9 sends an alert about the missing data.
    From 5 minutes to 31 days. Default: 15 minutes
- Add alert policies, labels, and links, if required.
  Limits per SLO: 20 alert policies or links, 30 labels.
Click CREATE SLO.

SLO configuration use case

Check the SLO configuration use case for a real-life SLO example.

sloctl

To create an SLO based on LogicMonitor, pass the required configuration with the sloctl apply -f command. Your configuration must follow the pattern either for the threshold (raw) or ratio (count) metrics.

Refer to the YAML guide > SLO for more information about the fields.

Breaking changes in Nobl9 agent 0.87.0-beta

Nobl9 agent 0.87.0-beta and later require quotation marks around these fields:

LogicMonitor device SLOs:
- deviceDataSourceInstanceId
- graphId
LogicMonitor website SLOs:
- websiteId
- checkpointId
- graphName

Omitting quotation marks in these fields with Nobl9 agent 0.87.0-beta or later results in an error. Earlier agent versions do not require quotation marks.

Review and correct your LogicMonitor device and website SLOs as needed.

Device metrics

Threshold (rawMetric) device metrics
Ratio good over total (countMetric) device metrics
Ratio bad over total (countMetric) device metrics

Sample LogicMonitor threshold SLO
apiVersion: n9/v1alpha
kind: SLO
metadata:
  name: api-server-slo
  displayName: API Server SLO
  project: default
  labels:
    area:
      - latency
      - slow-check
    env:
      - prod
      - dev
    region:
      - us
      - eu
    team:
      - green
      - sales
  annotations:
    area: latency
    env: prod
    region: us
    team: sales
spec:
  description: Example LogicMonitor SLO
  indicator:
    metricSource:
      name: logic-monitor
      project: default
      kind: Agent
  budgetingMethod: Occurrences
  objectives:
    - displayName: Good response (200)
      value: 200
      name: ok
      target: 0.95
      rawMetric:
        query:
          logicMonitor:
            queryType: website_metrics
            line: MIN RTT
            websiteId: '1'
            checkpointId: '1044712023'
            graphName: responseTime
      op: lte
      primary: true
  service: api-server
  timeWindows:
    - unit: Month
      count: 1
      isRolling: false
      calendar:
        startTime: '2022-12-01 00:00:00'
        timeZone: UTC
  alertPolicies:
    - fast-burn-5x-for-last-10m
  attachments:
    - url: https://docs.nobl9.com
      displayName: Nobl9 Documentation
  anomalyConfig:
    noData:
      alertMethods:
        - name: slack-notification
          project: default
      alertAfter: 1h

Sample LogicMonitor ratio SLO
apiVersion: n9/v1alpha
kind: SLO
metadata:
  name: api-server-slo
  displayName: API Server SLO
  project: default
  labels:
    area:
      - latency
      - slow-check
    env:
      - prod
      - dev
    region:
      - us
      - eu
    team:
      - green
      - sales
  annotations:
    area: latency
    env: prod
    region: us
    team: sales
spec:
  description: Example LogicMonitor SLO
  indicator:
    metricSource:
      name: logic-monitor
      project: default
      kind: Agent
  budgetingMethod: Occurrences
  objectives:
    - displayName: Good response (200)
      value: 1
      name: ok
      target: 0.95
      countMetrics:
        incremental: true
        good:
          logicMonitor:
            queryType: device_metrics
            line: CONNECTIONSUCCESSES
            deviceDataSourceInstanceId: 933147615
            graphId: 11438
        total:
          logicMonitor:
            queryType: device_metrics
            line: CONNECTIONSESTABLISHED
            deviceDataSourceInstanceId: 933147615
            graphId: 11436
      primary: true
  service: api-server
  timeWindows:
    - unit: Month
      count: 1
      isRolling: false
      calendar:
        startTime: '2022-12-01 00:00:00'
        timeZone: UTC
  alertPolicies:
    - fast-burn-5x-for-last-10m
  attachments:
    - url: https://docs.nobl9.com
      displayName: Nobl9 Documentation
  anomalyConfig:
    noData:
      alertMethods:
        - name: slack-notification
          project: default
      alertAfter: 1h

Sample LogicMonitor ratio SLO
apiVersion: n9/v1alpha
kind: SLO
metadata:
  name: api-server-slo
  displayName: API Server SLO
  project: default
  labels:
    area:
      - latency
      - slow-check
    env:
      - prod
      - dev
    region:
      - us
      - eu
    team:
      - green
      - sales
  annotations:
    area: latency
    env: prod
    region: us
    team: sales
spec:
  description: Example LogicMonitor SLO
  indicator:
    metricSource:
      name: logic-monitor
      project: default
      kind: Agent
  budgetingMethod: Occurrences
  objectives:
    - displayName: Good response (200)
      value: 1
      name: ok
      target: 0.95
      countMetrics:
        incremental: true
        bad:
          logicMonitor:
            queryType: device_metrics
            line: CONNECTIONFAILURES
            deviceDataSourceInstanceId: 933147615
            graphId: 11437
        total:
          logicMonitor:
            queryType: device_metrics
            line: CONNECTIONSESTABLISHED
            deviceDataSourceInstanceId: 933147615
            graphId: 11436
      primary: true
  service: api-server
  timeWindows:
    - unit: Month
      count: 1
      isRolling: false
      calendar:
        startTime: '2022-12-01 00:00:00'
        timeZone: UTC
  alertPolicies:
    - fast-burn-5x-for-last-10m
  attachments:
    - url: https://docs.nobl9.com
      displayName: Nobl9 Documentation
  anomalyConfig:
    noData:
      alertMethods:
        - name: slack-notification
          project: default
      alertAfter: 1h

Website metrics

Threshold (rawMetric) website metrics
Ratio good over total (countMetric) website metrics
Ratio bad over total (countMetric) website metrics

Sample LogicMonitor threshold SLO
apiVersion: n9/v1alpha
kind: SLO
metadata:
  name: api-server-slo
  displayName: API Server SLO
  project: default
  labels:
    area:
      - latency
      - slow-check
    env:
      - prod
      - dev
    region:
      - us
      - eu
    team:
      - green
      - sales
  annotations:
    area: latency
    env: prod
    region: us
    team: sales
spec:
  description: Example LogicMonitor SLO
  indicator:
    metricSource:
      name: logic-monitor
      project: default
      kind: Agent
  budgetingMethod: Occurrences
  objectives:
    - displayName: Good response (200)
      value: 200.0
      name: ok
      target: 0.95
      rawMetric:
        query:
          logicMonitor:
            queryType: website_metrics
            line: MIN RTT
            websiteId: "1"
            checkpointId: "1044712023"
            graphName: responseTime
      op: lte
      primary: true
  service: api-server
  timeWindows:
    - unit: Month
      count: 1
      isRolling: false
      calendar:
        startTime: "2022-12-01 00:00:00"
        timeZone: UTC
  alertPolicies:
    - fast-burn-5x-for-last-10m
  attachments:
    - url: https://docs.nobl9.com
      displayName: Nobl9 Documentation
  anomalyConfig:
    noData:
      alertMethods:
        - name: slack-notification
          project: default
      alertAfter: 1h

Sample LogicMonitor ratio SLO
apiVersion: n9/v1alpha
kind: SLO
metadata:
  name: api-server-slo
  displayName: API Server SLO
  project: default
  labels:
    area:
      - latency
      - slow-check
    env:
      - prod
      - dev
    region:
      - us
      - eu
    team:
      - green
      - sales
  annotations:
    area: latency
    env: prod
    region: us
    team: sales
spec:
  description: Example LogicMonitor SLO
  indicator:
    metricSource:
      name: logic-monitor
      project: default
      kind: Agent
  budgetingMethod: Occurrences
  objectives:
    - displayName: Good response (200)
      value: 1
      name: ok
      target: 0.95
      countMetrics:
        incremental: true
        good:
          logicMonitor:
            queryType: website_metrics
            checkpointId: "7615"
            graphName: "responseTime"
            websiteId: "123213"
            line: CONNECTIONFAILURES
        total:
          logicMonitor:
            queryType: website_metrics
            checkpointId: "7615"
            graphName: "responseTime"
            websiteId: "123213"
            line: CONNECTIONFAILURES
      primary: true
  service: api-server
  timeWindows:
    - unit: Month
      count: 1
      isRolling: false
      calendar:
        startTime: 2022-12-01T00:00:00.000Z
        timeZone: UTC
  alertPolicies:
    - fast-burn-5x-for-last-10m
  attachments:
    - url: https://docs.nobl9.com
      displayName: Nobl9 Documentation
  anomalyConfig:
    noData:
      alertMethods:
        - name: slack-notification
          project: default

Sample LogicMonitor ratio SLO
apiVersion: n9/v1alpha
kind: SLO
metadata:
  name: api-server-slo
  displayName: API Server SLO
  project: default
  labels:
    area:
      - latency
      - slow-check
    env:
      - prod
      - dev
    region:
      - us
      - eu
    team:
      - green
      - sales
  annotations:
    area: latency
    env: prod
    region: us
    team: sales
spec:
  description: Example LogicMonitor SLO
  indicator:
    metricSource:
      name: logic-monitor
      project: default
      kind: Agent
  budgetingMethod: Occurrences
  objectives:
    - displayName: Good response (200)
      value: 1
      name: ok
      target: 0.95
      countMetrics:
        incremental: true
        bad:
          logicMonitor:
            queryType: website_metrics
            checkpointId: "7615"
            graphName: "responseTime"
            websiteId: "123213"
            line: CONNECTIONFAILURES
        total:
          logicMonitor:
            queryType: website_metrics
            checkpointId: "7615"
            graphName: "responseTime"
            websiteId: "123213"
            line: CONNECTIONFAILURES
      primary: true
  service: api-server
  timeWindows:
    - unit: Month
      count: 1
      isRolling: false
      calendar:
        startTime: 2022-12-01T00:00:00.000Z
        timeZone: UTC
  alertPolicies:
    - fast-burn-5x-for-last-10m
  attachments:
    - url: https://docs.nobl9.com
      displayName: Nobl9 Documentation
  anomalyConfig:
    noData:
      alertMethods:
        - name: slack-notification
          project: default

SLI values for good and total

When choosing the query for the ratio SLI (countMetrics), keep in mind that the values resulting from that query for both good and total:

Must be positive.
While we recommend using integers, fractions are also acceptable.

If using fractions, we recommend them to be larger than 1e-4 = 0.0001.

Shouldn't be larger than 1e+20.

Find more SLO samples in the Nobl9 SDK.

To check your created SLO with sloctl, run sloctl get slos. Also, you can check it on the Nobl9 Web in the SLO grid page.

You can replay your LogicMonitor SLOs in the following ways:

Go to the required SLO details. Click More Actions > Run Replay.
Run sloctl replay.

Tips on retrieving SLI data in LogicMonitor

For correct monitoring, Nobl9 SLOs based on LogicMonitor require specific data, depending on a resource you need to monitor—a device or website. To retrieve the required values, do the following:

I want to monitor a device
I want to monitor a website

Log in to your LogicMonitor dashboard.
In the navigation sidebar, click Resource Tree.
All your monitored devices are listed here.
Select the required Resource > DataSource > instance.

Device Data Source Instance ID

It's the identifier of a monitoring rule configuration or occurrence applied to a LogicMonitor device you need to monitor.

LogicMonitor UI location:

Go to the Info tab

Copy the system.instanceID property value

API endpoint for retrieval:

GET /device/devices/{deviceId}/devicedatasources/{hdsId}/instances

Nobl9 YAML definition:

deviceDataSourceInstanceId

Type: string. Put its value in quotation marks

Graph ID

Refers to the graph configuration associated with your required device and defines how monitoring data collected by this device is visualized and presented in LogicMonitor

LogicMonitor UI location:

The Graphs tab > click ellipses > Graph Definition (opens in the new tab)

The required device's URL ending

For example, XXXXX in ...exchangeDataSourceGraphs-XXXXX

API endpoint for retrieval:

GET /device/devices/{deviceId}/devicedatasources/{id}

Nobl9 YAML definition

graphId

Type: string. Put its value in quotation marks

Line

Refers to the datapoint of your required graph

LogicMonitor UI location:

The Graphs tab > click ellipses > Graph Definition (opens in the new tab)

The Datapoint value of your required graph found in the Lines table or the line.label value

API endpoint for retrieval:

GET /device/devicedatasourceinstances/{instanceId}/graphs/{graphId}/data

Nobl9 YAML definition:

line

Pass its value in uppercase

Log in to your LogicMonitor dashboard.
In the navigation sidebar, click ellipses (More) > Websites.
Select your required website in the navigation tree.

Website ID

The ID of the monitored website. This value uniquely identifies the website you track within LogicMonitor

LogicMonitor UI location:

The address bar with the required website selected

The number the website's URL ends with

For example, XXX in .../treeNodes#websiteGroups-5*,websites-XXX

API endpoint for retrieval:

GET /website/websites

Use the items.id value

Nobl9 YAML definition:

websiteId

Type: string. Put its value in quotation marks

Graph Name

The identifier of the specific graph you are interested in

LogicMonitor UI location:

The Graphs tab

The name of the graph displaying the website metric

Available values: status, performance (for some graphs), responsetime

API endpoint for retrieval

Retrieval is available using the LogicMonitor UI only

Nobl9 YAML definition:

graphName

Type: string. Put its value in quotation marks in your YAML definition

Checkpoint ID

The ID of a checkpoint associated with a website. A checkpoint is a specific monitoring location or probe used by LogicMonitor to test website availability and performance

LogicMonitor UI location:

Retrieval available with the API call only

API endpoint for retrieval:

GET website/websites/{websiteID}

Use the checkpoints.id value

Nobl9 YAML definition:

checkpointId

Type: string. Put its value in quotation marks in your YAML definition

Line

Refers to the label of your required graph's line

LogicMonitor UI location:

Retrieval available with the API call only

API endpoint for retrieval:

GET /website/websites/{websiteID}/checkpoints/{checkpointID}/graphs/{graphName}/data

Use the line.label value

Nobl9 YAML definition:

line

Pass its value in uppercase

LogicMonitor API rate limits

LogicMonitor limits GET requests to 500 per minute.

Read more about rate limiting in LogicMonitor.

You can optimize the rate limit usage, which is particularly beneficial when you have hundreds of SLIs. For this, keep an individual data source in Nobl9 per LogicMonitor instance, ensuring a single rate limit.

It makes possible batching queries by graphId and instanceId and reading time series from responses per graphId, instanceId, and line (an SLI identifier). As a result, Nobl9 can query for unique lines only once, even if graphId and instanceId include multiple lines.

The maximum number of lines graphId and instanceId can return varies by case and depends on how metrics are organized.