Escalation Policies

Define who gets notified, when, and what happens if they don't respond.

How Escalation Works

When an incident triggers, NotifyHero walks through your escalation policy level by level:

Level 1 — notify immediately (e.g., on-call engineer)
Wait for acknowledgment
If no response after the timeout → Level 2 (e.g., team lead)
Continue until someone acknowledges or all levels are exhausted

Level 1: On-Call Schedule "Backend Primary" → wait 5 min
Level 2: On-Call Schedule "Backend Backup" → wait 10 min
Level 3: User "vp-engineering@your-company.com" → wait 15 min
Level 4: Repeat from Level 1

Creating an Escalation Policy

Go to Settings → Escalation Policies → New Policy.

Each level requires:

Target — a schedule, a specific user, or a team
Timeout — minutes to wait before escalating (0 = notify simultaneously with next level)

Targets

| Target Type | Description | |-------------|-------------| | Schedule | Notify whoever is currently on-call | | User | Notify a specific person | | Team | Notify all members of a team |

Multi-Level Escalation

Add as many levels as needed. Common patterns:

Standard (3-level)

L1: Primary on-call → 5 min
L2: Backup on-call → 10 min
L3: Engineering manager → 15 min

Aggressive (for critical services)

L1: Primary on-call → 2 min
L2: Primary + Backup on-call → 3 min
L3: Entire team → 5 min
L4: VP Engineering → 10 min

Flat (small team)

L1: Entire team → 0 min (everyone notified immediately)

Repeat Behavior

At the bottom of the policy, choose what happens after all levels are exhausted:

Repeat from Level 1 — loops back (recommended for critical services)
Stop escalating — no further notifications
Repeat N times — loops a set number of times, then stops

Tip: For production services, always enable repeat. An unacknowledged critical incident should never go silent.

Urgency Rules

Set urgency based on severity or time of day:

By Severity

Critical / Error → High urgency (phone + SMS)
Warning / Info → Low urgency (email + Slack)

By Time

Business hours (09:00–18:00) → Low urgency
Off-hours → High urgency

Combined

Critical at 3 AM → High urgency, aggressive escalation
Warning at 2 PM → Low urgency, standard escalation

Fallback Behavior

If an incident goes unacknowledged through all escalation levels and all repeats:

The incident is marked as Escalation Exhausted
Optionally notify a fallback target (e.g., a Slack channel or email alias)
The incident remains open and visible on the dashboard

Configure fallback at Escalation Policy → Advanced → Fallback Target.

Assigning to Services

Each service has exactly one escalation policy. Assign it at Service → Settings → Escalation Policy.

Multiple services can share the same policy. For example, all services owned by the Backend team can use the "Backend Escalation" policy.

Best Practices

Keep timeouts short for critical services — 2–5 minutes per level
Use schedules, not users — individual users go on vacation; schedules don't
Test your policies — trigger a test incident and verify the escalation chain works
Review quarterly — teams change, so should your policies