Don’t let the hardboiled-sounding name of our latest integration scare you off, because this monitoring service is a great way to get notified when one of your mission-critical scheduled tasks suddenly sleeps with the fishes. Dead Man’s Snitch is an uptime-monitor for cron or periodic jobs like backups or batch processing, and it alerts you when your jobs don’t run so you can investigate before it becomes a problem.

| In Alerting, Announcements, Community, Features

After getting our exciting entries from you guys and narrowing it down to five stellar finalists, you guys all helped #pickyourpage and now, we have our alert sound contest winner.

| In Announcements, Community, Partnerships

StatusCast is an application status page tool that allows you to proactively communicate uptime status to your end-users to improve customer satisfaction and loyalty. You can use StatusCast with PagerDuty to build a hosted status page in minutes and keep users informed with status updates via email, SMS, Twitter, Slack, HipChat etc.

| In Announcements, Community, Events

We’re excited to announce our first-ever custom alert sound contest! Beginning September 21, 2015, we will accept submissions for a chance to be included as an alert sound in our mobile app. We have a great community, and we want to see them get creative. Or ironic. Or immature. Songs, clever noises, avant-garde recordings of one hand clapping – all are welcome. Send your best creation to pickyourpage@pagerduty.com.

As indicated in a survey conducted by Forrester Research, a well-constructed IT Operations management system provides fast alert notification, keeps business-critical incidences from occurring at a minimum, and focuses on automation as a way of addressing issues. What we are actually seeing in the field today, however, doesn’t seem to line up with this approach. According to a recent Forrester thought leadership paper, incident resolution practices today are tactical, reactive, and harm commercial success. Listed below are some observations we are seeing with IT Organizations in the Enterprise.

| In Alerting, Announcements, Community, Features, On-Call Life

We’re pleased to announce our fourth major mobile release, which brings some significant improvements to the performance and usability of key parts of the app. With all these changes, it’s faster and easier than ever to see, investigate, and take action on problems in your system — driving down resolution time and helping your team improve your operations performance.

| In Announcements, Community, Reliability

We are delighted to announce that our Customer Support and Advocacy team won the Silver Stevie® Award in the Customer Service Department of the Year category in the 2015 International Business Awards. The award demonstrates PagerDuty’s commitment to its customers, as evidenced by a satisfaction rating that averaged 98.3 percent throughout 2014.

Etsy occasionally runs an engineer exchange program, where they trade engineers with another tech company to give both organizations insight into what the other does differently. PagerDuty was their most recent participant, and in May, I had the pleasure of spending a week at Etsy’s office in Brooklyn. I learned from their practices, observed what they were doing well, and gained insight into their team dynamics. Etsy has an amazing culture, and I observed the customs they put into place to maintain their environment of empathy, autonomy, and learning. It was a great example of the traditions a company can foster to maintain a productive and happy work environment.

| In Announcements, Community, Partnerships

Opsmatic provides real-time visibility of any change to the live state of your infrastructure and intelligently alerts you before trouble begins. The recent addition of Assertions gives you a precise way to check and enforce policy across all your hosts. It’s only natural that Opsmatic has partnered with PagerDuty to ensure flawless alerting and effective incident collaboration. PagerDuty’s operations performance platform ensures that the right people on your team get alerted and can resolve incidents before they become emergencies.

| In Community

Long-time PagerDuty customers Dropbox, Flipboard, and Splunk spoke about their hard-won experience, shared war stories, and discussed what they’ve learned about operations at scale. They also had advice about how what they’ve learned can be applied to other teams. We were delighted to talk with customers, partners, and the extended community about what it means to be operationally mature. Here is what was said about Operational Maturity.

This is a guest blog post written by Anthony Gibbons, the Operations Manager at Airhead Education. Anthony gives his perspective as a startup setting up PagerDuty as their IT Operations Software: “With the advent of cloud services and companies willing to integrate with each other, it is now entirely possible for a small startup to use the same monitoring tools as industry stars such as Airbnb, Pinterest and Path… It probably took me an hour to integrate all of my services with PagerDuty.”

| In Announcements, Community, Events

Five (count ‘em) PagerDuty engineers/product managers were chosen to speak at Velocity Santa Clara next week.
BOOM! What a beautiful world we live in. But what are they going to be speaking about? We’re glad you asked.

| In Announcements, Community, Partnerships

With CloudMonix’s core objective of simplifying, streamlining and automating routine or complex tasks for Cloud System Administrators and IT Professionals – we are always on look to improve the way we deliver our services. That’s why we have partnered up with PagerDuty, to deliver instant alerts and notifications on PagerDuty’s leading Incident Management platform.

ZooKeeper, for those who are unaware, is a well-known open source project which enables highly reliable distributed coordination. It is trusted by many around the world, including PagerDuty. It provides high availability and linearizability through the concept of a leader, which can be dynamically re-elected, and ensures consistency through a majority quorum. The leader election and failure detection mechanisms are fairly mature, and typically just work… until they don’t. How can this be? Well, after a lengthy investigation, we managed to uncover four different bugs coming together to conspire against us, resulting in random cluster-wide lockups. Two of those bugs laid in ZooKeeper, and the other two were lurking in the Linux kernel. This is our story.

Application Performance Monitoring (APM) systems like AppDynamics can provide incredibly rich information about what’s happening with your IT infrastructure, and can identify performance issues before they create big problems. However, this information is only as good as your ability to respond to it. PagerDuty can extend the capabilities of AppDynamics Alert & Respond policies to ensure incidents are noticed, responded to, and fixed quickly.

Today we’re announcing the integration of PagerDuty with Webmon, a website monitoring and escalation service that lets you be the first to know when an online service goes down.

| In Community, Events, On-Call Life

We hosted our first user group last week at PagerDuty HQ! Not only did we gather our awesome customers and enjoy the taco bar and cervezas, but we got to learn a lot from our them, share our roadmap – and our customers learned from each other, too. We really value user feedback as part of how and why we build our product. We wanted to share some key takeaways from our sessions during the event.

| In Announcements, Community, Partnerships, Product

Streamline AWS Security Management with PagerDuty and Evident.io This is a guest blog post by John Martinez, Principal Solution Architect at Evident.io. At Evident.io, one…