PagerDuty Blog

Finally, Have Quality Off-Call Time with On-Call Scheduling Best Practices

Anything can happen while you’re on-call. You can experience a quiet, incident-free shift or suffer a severe outage that makes your head explode. Since you don’t know what you’ll get, you always have to be prepared for anything. Being on-call is stressful enough as is so we strive make it less painful with easy scheduling and actionable alert routing to the right person. When you’re not on-call, we’ll leave you alone so you can enjoy your personal time off.

Flexible Schedules For Your Unique Needs

Everyone wants time off to recharge. With rotating on-call shifts, you can. On-call should not be limited by location or time zone, so expand on-call to those previously out of range to fairly share the responsibility. PagerDuty on-call schedules contain Layers that are made up of team members with the same rotation. Rotations are customizable and can be restricted to specific times of the day. If you have a global team want everyone to be able to save their nights for romance (or whatever they’re into), create Follow-The-Sun schedules by with business hours only layers.

Follow_the_sun_schedules

On-call schedules should stay consistent so there is no confusion if you’re on or off. When changes are needed because a teammate is sick or on going on vacation, save time from having to re-do the entire schedule with a one-time change using Overrides.

Overrides

  • Layers are rotational schedules made up of a group of people who rotate the same shift
  • Follow-The-Sun Schedule makes it possible for global teams to divide 24/7 on-call responsibility by daylight hours so no one has to be on-call at 2 AM
  • Overrides are one-time changes to an on-call schedule and does not change your recurring rotation

Tip: Many customers hand-off their weekly on-call duties on a weekday so both parties are in work-mode. Additionally, many customers noted that they have a lot of company holidays on Mondays so they hand-off on-call shifts on Tuesdays during the middle of their business hours for a smoother transition.

Have Backups for All Alerts

Even the most reliable engineer can miss an incident from time-to-time. To ensure that outages aren’t extended from missed alerts, Escalation Policies automatically re-route alerts to the standby on-call engineer. Alerts can be escalated to specific Users or to Users who are part of an on-call schedule. By setting Primary and Secondary On-Call Schedules, an initial owner will be assigned the incident and should anything happen, a teammate is available as a backup to catch missed alerts. Setting up a Primary and Secondary On-call Schedule is the same process as setting up a general on-call rotation.

  • Escalation Policies re-route incidents to another person or team if they are missed by the primary incident owner
  • Primary and Secondary On-Call Schedules are on-call schedules that can be used in an Escalation Policy instead of Users

Escalation Policies should always be set in order to catch missed alerts. Even though it seems like being on a Secondary On-Call Schedule makes your on-call life twice as long, you are likely not going to have to take an action because are supporting your responsive engineer. And when you do, you know you are helping your teammate out who may be stuck in an emergency.

Escalations_Missed_Alerts

Tip: Primary and Secondary On-Call Schedules should be staggered so the same person isn’t on-call at the same time on both schedules. After alerts have escalated past the primary and secondary on-call engineer, have a manager be the anchor. Usually, alerts will be responded to before it gets to that point, but as a fail-safe if last person of the Escalation Policy misses the incident as well, you program the Escalation policy to cycle through the Escalation Policy multiple times.

Always Know When You’re On (And Off!)

When you’re off-call, you don’t want to spend personal time thinking about when you’re going to be on-call again. With PagerDuty’s Hand-Off Notifications, we’ll let know when your shift starts and ends. Additionally, if you prefer to centralize your PagerDuty on-call schedule on Gmail or Outlook, Export Schedules or sync through iCal.

  • Hand-off Notifications tell you when your on-call shift is about to begin and when it ends
  • Exporting Schedules gives you the full flexibility to choose how you like to manage your on-call calendars

How PagerDuty On-Call Scheduling and Alert Routing Work together:

PagerDuty_On_call_schedule_alert_routing

  1. At 10:05 am PST, your monitoring tool detected that your database is on fire and sent an incident to PagerDuty
  2. PagerDuty routes the incident to the DBA team that has scheduled a Follow-the-Sun on-call rotation with primary and secondary on-call calendar. Since it’s 10:05 am PST, the US primary on-call person is paged,and the Indian DBA team who is off-call is not disturbed
  3. Since the primary on-call person missed their alerts, the secondary on-call person is now notified

Have a Life, Even While On-Call

Why make on-call harder for yourself? Major outages happen and you want to know as soon as possible, but they don’t occur all the time. Unchain yourself from your desk with PagerDuty alerts – we’ll find you wherever you are. Also with PagerDuty, when you’re off, you’re really off allowing you to fully recharge and be ready for the next call of on-call duty.