| In Alerting

One of the great things about PagerDuty is our API. With our API, you can integrate with a wide variety of partners, and also extend and customize your PagerDuty experience. Our customers have done a number of cool things, including creating custom reports and dashboards, creating status pages to let customers and internal stakeholders know about incidents, and automating the details of their incident response. The PagerDuty API helps you respond to incidents more efficiently. But where do you get started? We highlight some examples of cool tools.

We, as IT professionals, have ever-expanding access to more accurate Ops telemetry. With this data, we have an incredible amount of visibility into what’s going on. However, more information isn’t always a good thing when it comes to alerting. You can definitely have too many alerts, and alert fatigue is a growing problem among Operations teams. More detailed telemetry isn’t bad; it’s just that much of this information is generally better suited for forensics rather than alerting. Event Enrichment and PagerDuty team up to help you battle alert fatigue.

| In Alerting

Want to improve your email alerts? Think again. Monitoring systems can help you better manage your uptime, but even though you may spend a lot of time configuring checks and thresholds to identify problems early, your alerts are only as good as your incident response processes. Here are 5 reasons why you should ditch email alerts if you’re still using them.

| In Alerting, DevOps, Operations Performance

As the pace of development and business continues to scale, teams need an agile and collaborative work environment to succeed. Moving to a DevOps model is a critical part of setting your engineering teams up to succeed, but making the transition can be challenging for many companies. In this post, we share some strategies for making the transition.

| In Alerting

PagerDuty was built around one simple idea: waking up the right people when things break. When an event is triggered, PagerDuty makes magic to notify…

| In Alerting, Operations Performance

A little while back, we blogged on key performance metrics that top Operations teams track. Mean time to resolution (MTTR) was one of those metrics….

| In Alerting, Operations Performance

This is the third post in our series on using data to improve your IT operations. The second post on making your metrics meaningful is…

| In Alerting, Operations Performance

New Zealand is located on the southern tier of the Pacific “Ring of Fire”, which makes it no stranger to seismic activity. On average, there…

This post is the second in our series about how you can use data to improve your IT operations. Our first post was on alert fatigue….

| In Alerting, Operations Performance

This is the first post in our series on how you can use data to improve your IT operations. The second post is on about best…

| In Alerting, Operations Performance

Living in a data-rich world is a blessing and a curse. Flexible monitoring systems, open APIs, and easy data visualization resources make it simple to…

Building is second nature for many engineers. Naturally you could build a solution to solve the problems of: Having multiple monitoring tools for your infrastructure…

| In Alerting, Operations Performance

Updated 7/24/2014: This blog post was updated to more accurately reflect Arup’s talk. Arup Chakrabarti, PagerDuty’s operations engineer manager, stopped by Heavybit Industries’ HQ to discuss…

Keeping our customers happy is a source of pride at PagerDuty. While having a reliable product that is loved by our customers makes our lives…

| In Alerting, Operations Performance

“With PagerDuty we have be able to consolidate our alert stream” – Chris Peters, Operations Lead at Expensify “I can’t imagine life without PagerDuty. Having…

| In Alerting, DevOps, Operations Performance

This is the final post of our series about transitioning to a DevOps culture (for now). To start from the beginning check out, Why You…

| In Alerting, DevOps, Operations Performance

This is the third post in a series to help your engineering team transition into a DevOps organizational model. Here we’ll discuss how to scale…

| In Alerting, Operations Performance

This is the second post in a series to help your engineering team transition into a DevOps organizational model. Here we’ll discuss how to start…

Search