Turn any signal into insight and action. See how PagerDuty Digital Operations Management Platform integrates machine data and human intelligence to improve visibility and agility across organizations.
Connect insights to real-time action by aligning teams through the shared language of business impact.
Check out the latest products we’ve been working on—including event intelligence, machine learning, response automation, on-call, analytics, operations health management, integrations, and more.
Digital Operations Management arms organizations with the insights needed to turn data into opportunity across every operational use case, from DevOps, ITOps, Security, Support, and beyond.
Over 300 Integrations
Discover DevOps best practices with our library of webinars, whitepapers, reports, and much more.
Learn best practices and get support help with resources from our award-winning support team.
See how PagerDuty works with our live product demo — twice a week, every week.
We've created a maturity model to assist on the journey to digital operations excellence. Take our short assessment to find out where your team falls!
Interactive, simple-to-use API and technical documentation enables users to easily try updates and extend PagerDuty.
Engage with users and PagerDuty experts from our global community of 200k+ users. Become a member, connect, and share insights for success.
Get all your PagerDuty-related questions answered by exploring our in-depth support documentation and community forums.
Using Data to Dismantle a Criminal Industry Human trafficking is a $150 billion dollar criminal industry that denies freedom to over 40 million people globally—and...
PagerDuty helps organizations transform their digital operations. Learn more about PagerDuty's mission and what we do.
Meet our experienced and passionate executive team.
We are risk-taking innovators dedicated to delivering amazing products and delighting customers. Join us and do the best work of your career.
With the PagerDuty Foundation, we are committed to doing our part in giving back to the community.
Many solutions offer email alerts to notify customers of an issue. Email alerts are effective if you’re in front of your inbox all day, but the reality is we usually aren’t. Missed alerts extend outages and impact your company’s revenue and customer loyalty. To know about issues quickly, thousands of customers have chosen PagerDuty for effective incident alerting. This post will explain PagerDuty alerting concepts and best practices around how to set them up so you can increase uptime.
Make Alerts Work For You
Each PagerDuty User can customize their Contact Methods and Notification Rules to get alerted how you want. If the primary on-call engineer misses alerts, alerts can be sent to other teammates until it is responded to based on Escalation Policies.
We recommend for all users to set up at least 3 Contact Methods and 3 Notification Rules to ensure they never miss alerts. By default, there is a Notification Rule to notify the incident owner immediately via email when the incident is assigned to them.
Tip: Depending on the type of incidents that occur in your system, set up alerts based on your cost of downtime and customer service-level agreements (SLAs).
Escalation Policies are safety nets for missed incidents, and they automatically re-route alerts to specific Users or On-Call Schedules:
We recommend Escalation Policies for every incident. If you typically have high severity incidents, dispatch incidents to another person sooner rather than later to ensure that it gets addressed quickly.
Note: Escalation Policies override personal Notification Rules, so each User should make their Notification Rules tighter than their Escalation Policies. If you escalate issues after 30-minutes, have all your personal alerts completed within that timeframe. This helps to ensure you receive all your alerts and have the chance to respond before it is escalated to another teammate.
Default PagerDuty Safety Nets
Alerts can be acknowledged, re-assigned or resolved. In case an acknowledged alert is forgotten, all Services are set with a default 30-minute Incident Acknowledge Timeout. This returns an incident to Trigger state and alerts will be re-started. Additionally, if an incident accidentally left open, by default, PagerDuty will Auto-Resolve Incidents that are open for 4 hours.
Reduce Alert Fatigue
Now that you have told us how you’d like to be contacted when incidents occur, PagerDuty helps decrease alerting headaches by de-duping, bundling, and appending alerts. Incidents from API-based integrations are de-duped, bundled, and appended automatically. With email-based integrations, you can set specific filters to reduce alert fatigue.
During an outage, multiple alerts for the same issue make it difficult to get to the root of the problem. Spend less time diagnosing and more time fixing with PagerDuty. These three features make it easier for users to be aware of critical issues, faster. With PagerDuty, you can decrease the alerting noise and decrease downtime.
Put PagerDuty Alerting Concepts Into Action
1. When PagerDuty receives an alert from your monitoring system, an incident is created in PagerDuty. If there are multiple alerts for the same issue, PagerDuty will de-dupe the alerts into one incident to reduce alerting noise.
2. Multiple on-call teams can be connected to PagerDuty and PagerDuty routes alerts to the right on-call person to fix it. Teams set Escalation Policies to determine who should be notified if the primary person misses their alerts.
3. Once the primary on-call person is found, alerts will be sent in the combination of their choosing. Based upon the team’s Escalation Policies, if the primary person doesn’t respond, the next on-call superhero is called into action.
4. When Users receive alerts, they can choose to acknowledge, resolve or reassign the incident with a SMS or phone call reply, or within the mobile app or web UI.
This is a guest post by Ilan Rabinovitch, Director of Product Management at Datadog. The convergence of rapid feature development, automation, continuous delivery, and the shifting...
Dynamic Notifications are now out in the wild! With our launch today, we give PagerDuty users the power to dynamically adjust how they are notified...
600 Townsend St., #200
San Francisco, CA 94103
905 King Street West, Suite 600
Toronto, ON, M6K 3G9, Canada
1416 NW 46th St., St. 301
Seattle, WA 98107
5 Martin Place
1 Fore St,
London EC2Y 9DT
© 2009 - 2018