Turn any signal into insight and action. See how PagerDuty Digital Operations Management Platform integrates machine data and human intelligence to improve visibility and agility across organizations.
Learn how PagerDuty can accelerate your cloud migration.
Check out the latest features we’ve been working on—including event intelligence, machine learning, response automation, on-call, analytics, operations health management, integrations, and more.
Digital Operations Management arms organizations with the insights needed to turn data into opportunity across every operational use case, from DevOps, ITOps, Security, Support, and beyond.
Over 200 Integrations
Discover DevOps best practices with our library of webinars, whitepapers, reports, and much more.
Learn best practices and get support help with resources from our award-winning support team.
See how PagerDuty works with our live product demo — twice a week, every week.
Join live and on-demand webinars for product deep dives, industry trends, configuration training, and use case-specific best practices.
Interactive, simple-to-use API and technical documentation enables users to easily try updates and extend PagerDuty.
Engage with users and PagerDuty experts from our global community of 200k+ users. Become a member, connect, and share insights for success.
Get all your PagerDuty-related questions answered by exploring our in-depth support documentation and community forums.
In part one of this two-part series, I went over focusing on low-effort tasks that produce the highest value and ways to increase leverage in...
PagerDuty helps organizations transform their digital operations. Learn more about PagerDuty's mission and what we do.
Meet our experienced and passionate executive team.
We are risk-taking innovators dedicated to delivering amazing products and delighting customers. Join us and do the best work of your career.
With the PagerDuty Foundation, we are committed to doing our part in giving back to the community.
GREE is the global leader in free-to-play games. They leverage the power of gaming by combining the power of mobile and making it social. When you can play the games you want, when and where you want, it’s easy to face off against friends, discover new games and grab the top spot among your friends on the leaderboard.
Each day millions of GREE gamers open their mobile devices and expect to find their favorite games waiting to be played.
GREE has been a successful gaming company in Japan for almost a decade, but when the company opened its doors in the US two years ago their team was faced with several technical challenges to keep their games online and available for their dedicated and growing fan base.
At first, GREE planned to build a Network Operation Center (NOC) team to monitor, respond to and resolve incidents that occurred within their infrastructure. However, they quickly realized that using a NOC would be very slow and the on-call teams would be prone to making errors.
Inevitably, NOC-created incidents made it difficult to account for the human factor of escalating issues. Without a proper escalation policy in place it’s easy to brush an incident off until the morning because you don’t want to wake up one of your team members, especially if you were unsure who the right person for the job would be. Incidents may be more serious than they initially appear and cause a game to be offline for hours.
Often an incident would cause all-too-familiar chaos:
“Where’s the runbook? Is this an app bug or a system issue? Who’s the dev for this game?… Anyone have their phone number? …. I can’t find the dev? Anyone know who his manager is?”
For GREE, this human factor of escalation caused several delays and decrease their overall Mean Time to Repair (MTTR).
Before PagerDuty, GREE’s escalation procedures were a very slow and manual process. Ops Engineers were conservative when escalating issues to developers or their managers because the gravity of an alert wasn’t always immediately known.
With PagerDuty, GREE quickly moved from a structure of only having Ops Engineers on-call and transitioned to a DevOps model for on-call management and alerting.
“Getting devs on-call means we are targeting the appropriate teams with actionable alerts.”
Using PagerDuty has allowed GREE to make sure issues are funneled to the right team members for the job. Each Monday morning, GREE syncs up to review alerts from the previous week and goes over the schedule and on-call rotation for the new week. It’s imperative that each team member is made aware of who is the primary, secondary and manager on-call for the week.
“With PagerDuty it’s clear when our developers are supposed to be on-call, who is the primary and secondary Ops on-call, and who the manager for the week is.”
In their meeting, alerts that are not actionable are downgraded so they will not reoccur during the next on-call rotation.
PagerDuty also gives GREE visibility into each of their games’ system. GREE uses the PagerDuty API to sync the availability of each of their games with their status dashboard.
This dashboard offers a visual representation of their games’ statuses that everyone in the company can seen.
When an alert is received for a game, its corresponding icon on the dashboard will flash red. After the incident is resolved the games icon will indicate that they are fully back online.
To further keep their team in the loop, GREE integrates PagerDuty with Skypebot. When an incident is triggered in PagerDuty and alert will appear within the dedicated chat room for the game impacted. These dedicated chat room include the game’s developers, product managers and VPs to keep all of a game’s stakeholders in the loop as well. GREE can also trigger a PagerDuty incident for the on-call engineer directly from the chat window.
GREE reports that they are fixing incidents in their system much faster than they were before using PagerDuty because the right people are getting notified to resolve incidents directly.
“With PagerDuty, it’s clear which person in Ops and Engineering is on-call at each moment, for each of our games.”
600 Townsend St., #200
San Francisco, CA 94103
260 Queen St W #300,
Toronto, ON M5V 1Z8, Canada
1416 NW 46th St., St. 301
Seattle, WA 98107
5 Martin Place
1 Fore St,
London EC2Y 9DT
© 2009 - 2018