Splunk Observability (SignalFx)

Harness Continuous Verification (CV) integrates with Splunk Observability [SignalFx] to:

Verify that the deployed service is running safely and performing automatic rollbacks.
Apply machine learning to every deployment to identify and flag anomalies in future deployments.

This topic describes how to set up a Splunk Observability [SignalFx] health source when adding a CV step to your Continuous Deployment (CD).

important

When using Splunk Observability [SignalFx] health source, it is not possible to derive a metric by using a combination of other metrics.

Prerequisite

Ensure that Splunk Observability [SignalFx] is added as a verification provider in Harness. To learn how to add Splunk Observability [SignalFx] as a connector in Harness, go to Add Splunk Observability [SignalFX].

Set up continuous verification

To set up CV, you will need a Harness Monitored Service. In the simplest terms, a Monitored Service is basically a mapping of a Harness Service to a service monitored by your APM or logging tool.

You can set up a Monitored Service in the Verify step in a CD stage.

There are other ways to set up a Monitored Service. To learn more see: Monitored Service

In this topic we'll set up the Monitored Service as part of the Verify step.

Add Verify step

To add a Verify step to your pipeline, use one of the methods below.

Add a Verify step while building a deployment stage

If you're building a deployment stage and currently on the Execution Strategies page:

Select the Enable Verification option.

The Verify step gets added to the pipeline.
Select the Verify step.

The Verify settings page appears.

Add a Verify step to an existing deployment stage

If you already have a deployment stage:

Select the stage where you want to add the Verify step.
On the stage settings pane, select the Execution tab.
On the pipeline, hover over where you want the Verify step, select the + icon, and then choose Add Step.

The Step Library page appears. You can add a step at various points in the pipeline such as the beginning, end, in between existing steps, or below an existing step. Simply choose the location where you want to add the step and follow the prompts to add it.
In the Continuous Verification section, select Verify.

The Verify settings page appears.

Define name and time out information

In Name, enter a name for the Verification step.
In Timeout, enter a timeout value for the step. Harness uses this information to time out the verification. Use the following syntax to define timeout:
- w for weeks. For example, to define one week, enter 1w.
- d for days. For example, to define 7 days, enter 7d.
- h for hours. For example, to define 24 hours, enter 24h.
- m for minutes, For example, to define 100 minutes, enter 100m.
- s for seconds. For example, to define 500 seconds, enter 500s.
- ms for milliseconds. For example, to define 1000 milliseconds, enter 1000ms.
The maximum timeout value you can set is 53w. You can also set timeouts at the pipeline level.

Node filtering

note

Currently, this feature is behind the feature flag CV_UI_DISPLAY_NODE_REGEX_FILTER. Contact Harness Support to enable the feature.

The node filtering feature allows you to select specific nodes within your Kubernetes environment using the PodName label. This allows for focused analysis, enabling you to choose specific nodes as service instances for in-depth analysis.

Harness CV autonomously identifies new nodes as they are added to the cluster. However, the node filtering feature allows you to focus the analysis explicitly on the nodes that you want to analyze. Imagine you have a Kubernetes cluster with multiple nodes, and you want to analyze the performance of pods running on specific nodes. You want to analyze the nodes that match a certain naming pattern.

Procedure:

On the Verify settings page, expand Optional to navigate to the node filtering settings section.
(Optional) Select Use node details from CD if you want Harness CV to collect and analyze the metrics and log details for the recently deployed nodes.
Specify the Control Nodes and Test Nodes:
- Control Nodes: These are the nodes against which the test nodes are compared. You can specify the control nodes to provide a baseline for analysis.
- Test Nodes: These are the nodes that Harness CV evaluates and compares against the control nodes.
To specify the Control Nodes and Test Nodes, in one of the following ways:
- Type node names: Enter the names of specific nodes you want to include in the analysis.
- Use simple patterns (Regex): Define a regular expression pattern to match the nodes you want to filter. For example, if your nodes follow a naming convention such as "node-app-1", "node-app-2", and so on, you could use a pattern such as "node-app-*" to include all nodes with names starting with "node-app-".
Example: Let's say you want Harness CV to analyze the only nodes that have "backend" in their PodName label:
1. In the Control Nodes field, enter "backend-control-node" as the control node.
2. In the Test Nodes field, enter the pattern "backend-*" to include all nodes with names starting with "backend-".

Select a continuous verification type, sensitivity, and duration

In Continuous Verification Type, select a type that matches your deployment strategy. The following options are available:
- Auto: Harness automatically selects the best continuous verification type based on the deployment strategy.
- Rolling Update: Rolling deployment is a deployment technique that gradually replaces old versions of a service with a new version by replacing the infrastructure on which the service runs. Rolling updates are useful in situations where a sudden changeover might cause downtime or errors.
- Canary: Canary deployment involves a two-phased deployment. In phase one, new pods and instances with the new service version are added to a single environment. In phase two, a rolling update is performed in the same environment. Canary deployment helps to detect issues with the new deployment before fully deploying it.
- Blue Green: Blue-green deployment is a technique used to deploy services to a production environment by gradually shifting user traffic from an old version to a new one. The previous version is referred to as the blue environment, while the new version is known as the green environment. Upon completion of the transfer, the blue environment remains on standby in case of a need for rollback or can be removed from production and updated to serve as the template for future updates.
- Load Test: Load testing is a strategy used in lower-level environments, such as quality assurance, where a consistent load is absent and deployment validation is typically accomplished through the execution of load-generating scripts. This is useful to ensure that the application can handle the expected load and validate that the deployment is working as expected before releasing it to the production environment.
In Sensitivity, choose the sensitivity level. The available options are High, Medium, and Low. When the sensitivity is set to high, even minor anomalies are treated as verification failures. This ensures that even the slightest issue is detected and addressed before releasing the deployment to production.
In Duration, choose a duration. Harness will use the data points within this duration for analysis. For instance, if you select 10 minutes, Harness will analyze the first 10 minutes of your log or APM data. It is recommended to choose 10 minutes for logging providers and 15 minutes for APM and infrastructure providers. This helps you thoroughly analyze and detect issues before releasing the deployment to production.
In the Artifact Tag field, reference the primary artifact that you added in the Artifacts section of the Service tab. Use the Harness expression <+serviceConfig.artifacts.primary.tag> to reference this primary artifact. To learn about artifact expression, go to Harness expression.
Select Fail On No Analysis if you want the pipeline to fail if there is no data from the health source. This ensures that the deployment fails when there is no data for Harness to analyze.

Create a monitored service

Harness CV monitors the health trend deviations using logs and metrics obtained from the health source, such as APM and logging tools, via a monitored service.

To create a monitored service:

In the Monitored Service Name section, select Click to autocreate a monitored service.

Harness automatically generates a monitored service name by combining the service and environment names. The generated name appears in the Monitored Service Name field. Note that you cannot edit the monitored service name.

If a monitored service with the same name and environment already exists, the Click to autocreate a monitored service option is hidden and the existing monitored service is assigned to the Verify step by Harness.

note

If you've set up a service or environment as runtime values, the auto-create option for monitored services won't be available. When you run the pipeline, Harness combines the service and environment values to create a monitored service. If a monitored service with the same name already exists, it will be assigned to the pipeline. If not, Harness skips the Verification step.

For instance, if you input the service as todolist and the environment as dev, Harness creates a monitored service with the name todolist_dev. If a monitored service with that name exists, Harness assigns it to the pipeline. If not, Harness skips the Verification step.

Autocreate monitored service

Add a health source

A health source is an APM or logging tool that monitors and aggregates data in your deployment environment.

Define health source

To add a health source:

In the Health Sources section, select + Add New Health Source.

The Add New Health Source dialog appears.
On the Define Health Source tab, do the following:
1. In the Define Health Source section, select Splunk Observability [SignalFx] as the health source type.
2. In the Health Source Name field, enter a name for the health source.
3. In the Connect Health Source section, select Select Connector.
  The Create or Select an Existing Connector dialog appears.
4. Select a connector for the Splunk Observability [SignalFx] health source, and then select Apply Selected.
  The selected connector appears in the Select Connector dropdown.
5. Select Next.
  
  The Configuration tab appears.

note

Currently, Harness supports only Splunk Observability [SignalFx] metrics. The Splunk Observability [SignalFx] Metrics option is selected by default in the Select Feature field.

Define log configuration settings

On the Configuration tab, select + Add Metric.
The Add Metric dialog appears.
Enter the following information, and then select Submit:
- Metric name: Enter a name for the metric. For example, Memory Metric.
- Group name: If the group to which you want to add the metric already exists, select it.
  If you want to create a new group, select + Add New. In the Add Group Name dialog enter a group name, and then select Submit.
In the Add Metric dialog, select Submit.
New group and metric are created. The query specifications and mapping settings are displayed. These settings help you get the desired metric data from the Splunk Observability [SignalFx] platform and map it to Harness service. To learn about Splunk Observability [SignalFx] metrics and queries, go to https://dev.splunk.com/observability/docs/signalflow.

Define a query

In the Query box, enter your metric query, and then select Run Query.
Sample data is displayed in the Records box. The Chart box displays the graph corresponding to the sample data. This helps you verify if the query that you have built is correct.

Sample query to check the number of Kubernetes pods in ready state

Query: data("k8s.container.ready")

Query - disk usage

Assign services

In the Assign section, select the services to which you want to apply the Sumo Logic metric. Following options are available:

Continuous Verification (Applied to the pipelines in the Continuous Deployment): Select this option to use the metric data in the Continuous Deployment pipeline to ensure that the deployed service is running safely and to perform automatic rollbacks. In addition, the metric will be used to apply Machine Learning in detecting and highlighting future deployment issues.
Service Health: Select this option to use the metric data to track the changes in the health trend of your monitored service.
Service Level Indicator (SLI): Select this option to use the metric data to measure the SLI and obtain the performance of the service.

Configure risk profile

If you select Continuous Verification (Applied to the pipelines in the Continuous Deployment) or Service Health, expand the section below and follow the instructions for configuring the risk profile.

Risk Profile settings

Risk Profile

note

The Risk Profile section is only visible if you have selected Continuous Verification (Applied to the pipelines in the Continuous Deployment) or Service Health in the Assign section.

Under Risk Category, select one of the following options:

Errors
Infrastructure
Performance/Throughput
Performance/Other
Performance/Response Time

Under Deviation Compared To Baseline, select the following settings to measure your service's behavior and calculate deviations from the health source:

Higher counts = higher risk
Lower counts = higher risk

You can select multiple options.

Map service instance identifier

note

The Map service instance identifier section is visible only if you have selected Continuous Verification (Applied to the pipelines in the Continuous Deployment) in the Assign section.

In Service Instance Identifier (only needed for CV), specify the service instance identifier, which represents a dynamically created service that you deploy using Harness. The default value is _sourceHost.

Advanced (Optional)

note

The Advanced (Optional) section is only visible if you have selected Continuous Verification (Applied to the pipelines in the Continuous Deployment) in the Assign section.

Ignore Thresholds

You can select the types of events for which you want to set thresholds in CV. Metrics that match the selected rules will not be flagged as anomalous, regardless of the analysis.

To set the Ignore Thresholds for CV:

Go to the Ignore Thresholds tab and select the + Add Threshold button.
From the Metric dropdown, select the desired metric for which you want to set the rule.
In the Criteria field, choose the type of criteria you want to apply for the threshold:

Absolute Value: Select this option and enter the Greater than and Lesser than values.
Percentage Deviation: Select this option and enter the Lesser than value.

Fail-Fast Thresholds

You can select the type of events for which you want to set thresholds in CV. Any metric that matches the selected rules will be marked as anomalous and cause the Workflow state to fail.

To set fail-fast thresholds for CV, follow these steps:

Go to the Fail-Fast Thresholds tab and select the + Add Threshold button.
From the Metric dropdown, select the desired metric for which you want to set the rule.
In the Action field, select what the CV should do when applying the rule:

Fail Immediately
Fail after multiple occurrences
Fail after consecutive occurrences

In the Count field, set the number of occurrences. This setting is only visible if you have selected Fail after multiple occurrences or Fail after consecutive occurrences in the Action field. The minimum value must be two.
In the Criteria field, choose the type of criteria you want to apply for the threshold:

Absolute Value: Select this option and enter the Greater than and Lesser than values.
Percentage Deviation: Select this option and enter the Lesser than value.

Save the health source settings

After configuring all the settings, select Submit to add the health source to the Verify step.
Select Apply Changes to save the changes made to the Verify step.

Run the pipeline

To run the pipeline:

In the upper-right corner, select Run.

The Run Pipeline dialog appears.
In the dialog, do the following:
- Tag: If you did not add a tag in the** Artifact Details** settings, select it now.
- Skip preflight check: Select this option if you want to skip the preflight check.
- Notify only me about execution status: Select this option if you want Harness to alert only you about the execution status.
Select Run Pipeline.

The pipeline starts running.

View results

The Summary section displays the following details when the Verify step begins:

Metrics in violation
Log clusters in violation
Error clusters in violation

Note that it may take some time for the analysis to begin. The screenshot below shows a Verification step running in a deployment:

Verification summary

Console view

The console view displays detailed logs of the pipeline, including verification logs. To view the console, select View Details in the Summary section or turn on the Console View toggle switch in the upper-right corner.

Verification step console view

By default, the console displays logs of only the anomalous metrics and affected nodes. To see all logs, clear the Display only anomalous metrics and affected nodes check box.

Verification step console view all data

The following screenshots show successful and failed verifications in a deployment run:

Successful verification

Passed verification step

Failed verification

Failed verification step

Set a pinned baseline

note

Currently, this feature is behind the feature flag SRM_ENABLE_BASELINE_BASED_VERIFICATION. Contact Harness Support to enable the feature.

You can set specific verification in a successful pipeline execution as a baseline. This is available with Load Testing as the verification type.

Set successful verification as a baseline

To set a verification as baseline for future verifications:

In Harness, go to Deployments, select Pipelines, and find the pipeline you want to use as the baseline.
Select the successful pipeline execution with the verification that you want to use as the baseline.

The pipeline execution is displayed.
On the pipeline execution, navigate to the Verify section, and then select Pin baseline.

The selected verification is now set as the baseline for future verifications.

Replace an existing pinned baseline

To use a new baseline from a pipeline and replace the existing pinned baseline, follow these steps:

In Harness, go to Deployments, select Pipelines, and find the pipeline from which you want to remove the baseline.
Select the successful pipeline execution with the verification that you have previously pinned as the baseline.
On the pipeline execution, navigate to the Verify section, and then select Pin baseline.

A confirmation alert message appears, asking if you want to replace the existing pinned baseline with the current verification. After you confirm, the existing pinned baseline gets replaced with the current verification.

Prerequisite​

Set up continuous verification​

Add Verify step​

Add a Verify step while building a deployment stage​

Add a Verify step to an existing deployment stage​

Define name and time out information​

Node filtering​

Select a continuous verification type, sensitivity, and duration​

Create a monitored service​

Add a health source​

Define health source​

Define log configuration settings​

Define a query​

Assign services​

Configure risk profile​

Risk Profile​

Map service instance identifier​

Advanced (Optional)​

Ignore Thresholds​

Fail-Fast Thresholds​

Save the health source settings​

Run the pipeline​

View results​

Console view​

Set a pinned baseline​

Set successful verification as a baseline​

Replace an existing pinned baseline​

Prerequisite

Set up continuous verification

Add Verify step

Add a Verify step while building a deployment stage

Add a Verify step to an existing deployment stage

Define name and time out information

Node filtering

Select a continuous verification type, sensitivity, and duration

Create a monitored service

Add a health source

Define health source

Define log configuration settings

Define a query

Assign services

Configure risk profile

Risk Profile

Map service instance identifier

Advanced (Optional)

Ignore Thresholds

Fail-Fast Thresholds

Save the health source settings

Run the pipeline

View results

Console view

Set a pinned baseline

Set successful verification as a baseline

Replace an existing pinned baseline