| In Alerting, Operations Performance

Since our first on-call best practices post back in March 2011, on-call scheduling methods have remained mostly unchanged. Many teams start off with sending email…

| In Alerting, Operations Performance

Earlier this month at Nagios World Conference North America, Arup Chakrabarti, Operations Engineering Team Lead @ PagerDuty gave a talk on “What You Should Monitor and…

| In Alerting, Operations Performance

Whether you’re a PagerDuty customer or discovering us for the first time, we’ve put together this post to highlight some of hacked workarounds that inspired…

| In Alerting, Operations Performance

TL;DR; We brought our deploy time down from 10 minutes to 50 seconds. When I joined PagerDuty over a year ago, our application consisted of…

| In Alerting, Operations Performance

Hiring software engineers is hard.  We all know this.  If you get past the problem of sourcing and landing good candidates (which is hard in…

| In Alerting, Operations Performance

Customers always expect great support from every business, and they ought to if they pay a premium for it. Providing awesome support is a lot…

| In Alerting, Operations Performance

Monitoring your infrastructure. It can be challenging, but that’s why you have all of the tools in place to make sure you don’t miss a…

| In Alerting, Operations Performance

I get a lot of requests to handle & escalate phone calls as well as alerts from monitoring systems. Here’s a code sample that lets…

This is Part 1 in a multi-part series dealing with tips for being on-call.