Skip to main content

AI SRE Onboarding Guide for Incident Responders

Last updated on

This guide walks you through the essentials of using Harness AI SRE as a responder or engineer.

You'll learn how to navigate the dashboard, respond to incidents, collaborate with your team, and leverage runbooks and AI-powered tools to resolve issues faster.

Your administrator has already configured the integrations and incident types — this guide focuses on what you need to know to be effective as an incident responder from day one.

Prerequisites

Before getting started, confirm the following with your administrator:

ItemDetails
Harness account accessYou have been added to your organization's Harness account with appropriate permissions
Slack / Teams connectedThe Harness AI SRE bot is installed in your team's Slack workspace or Microsoft Teams environment
Monitoring tools configuredYour organization's monitoring tools (Datadog, New Relic, Grafana, etc.) are already integrated
On-call schedule (if applicable)You've been added to your team's on-call rotation in PagerDuty, OpsGenie, or a similar tool
Need admin setup first?

If your organization hasn't configured AI SRE yet, share the Administrator Onboarding Guide with your platform team to get started.

1. Explore the AI SRE dashboard

Get familiar with the dashboard layout, active incidents, alerts, and key metrics at a glance.

2. Respond to an incident

Learn how to acknowledge, triage, and begin working on an incident when you're paged or alerted.

3. Create an incident manually

Sometimes you'll spot an issue before automated monitoring catches it. Learn how to declare an incident manually.

4. Use runbooks during an incident

Runbooks guide you through predefined response steps and can automate common actions during an incident.

5. Use the AI Scribe Agent

The AI Scribe Agent works alongside you during incidents to reduce manual overhead and improve post-incident learning.

  • Automatic Summaries — The AI Scribe monitors your incident channel conversations and generates real-time summaries of key decisions, actions, and findings.
  • Timeline Generation — It constructs a structured timeline of the incident based on channel activity, status changes, and runbook execution.
  • Post-Incident Reports — After resolution, the AI Scribe drafts a post-incident report pulling from the incident timeline, channel discussions, and metadata — giving you a head start on your retrospective.

To access AI Scribe outputs, navigate to the incident detail page and look for the AI Summary and Timeline sections.

Learn more

See the full AI Scribe Agent documentation for details on how AI-powered documentation works and how to get the most out of it.

Next steps

You're now equipped to respond to incidents effectively with Harness AI SRE. To deepen your skills and get even more out of the platform, explore:

  • Slack Commands Reference: Master the full set of slash commands for managing incidents directly from Slack.
  • Understanding Incident Types: Learn how your organization's incident types map to severity levels, responder teams, and escalation paths.
  • Browsing Runbooks: Explore the runbook library to understand the automated playbooks available to you.
  • Integration Overview: See which monitoring, communication, and ITSM tools are connected to your AI SRE environment.
  • AI Scribe Agent: Dive deeper into AI-powered incident documentation and insights.