PagerDuty Blog

Real-time Visibility for ITOps

Businesses today are increasingly reliant on the Internet and on their IT infrastructure for their business processes. Increasing risk and heightened revenue impact of IT infrastructure outages is an outcome of this burgeoning reliance, one which demands that companies both large and small become adept at IT Operations.

The response to IT Operations incidents today is highly sophisticated, particularly at enterprise business scale. An army of responders and subject matter experts are needed to deal with the complexity, pace of change, distributed nature of teams, speed of delivery, and the potential for profound business impact. Transparent communications and streamlined interfaces between functions across the entire enterprise is paramount in resolving IT Operations issues at scale. PagerDuty manages the contextual ecosystem of data and human response information for both centralized (e.g. SOC/NOC/TOC) and distributed (e.g. DevOps) IT Operations teams to ensure the right people and workflows are orchestrated to accelerate incident response and remediation.

To assist the heroic efforts of IT Operations teams around the world in their battles to minimize the business impact of incidents, we’ve launched Operations Command Console and Intelligence Applications.

Introducing the Operations Command Console

The PagerDuty Operations Command Console provides a new and innovative way to understand and minimize the impact of service outages throughout your IT infrastructure. IT Operations workflow and orchestration capabilities are significantly improved through the use of the Operations Command Console which is suitable for both Enterprise Central Operations and DevOps at scale.

Consisting of modular, configurable applications, each of which provides a lens into a particular area of IT Operations workflow, Operations Command Console provides shared context to incidents throughout your business. One of the core features of Operations Command Console is the ability to decrease cognitive load on operations teams by converting continuous flows of data into actionable context and information. IT Operations teams using Operations Command Console benefit from a much greater understanding of incident blast radii and improved orchestration of incident response.

The Operations Command Console can be enabled in a matter of minutes, using the information your company already sends to PagerDuty.

operations-command-console

The initial release of the Operations Command Console is supported  by the release of four Intelligence Applications.

Services Health Application

The Services Health Application provides a view of the services across your IT Infrastructure that have active incidents associated with them. You can sort by a variety of different contexts including Recent Activity and Most Impacted (by open incidents). 

service-health-application-blog

Major Incidents Application

The Major Incidents Application provides a view of open incidents in your infrastructure sorted by Duration and Priority. Just as with the Service Health Application, only active incidents are shown in keeping with the core Operations Command Console philosophy of focusing on actionable information.

major-incidents-blog

Responders Application

The Responders Application provides an overview of the Responders associated with the open incidents. Both active and available on-call responders for each Service are visible in this application.

responders-blog

Infrastructure Health Application

The Infrastructure Health Application provides a visual overview of the alert clusters across the services and hosts in your infrastructure.

As leading edge indicators of potential incidents are identified, alert clusters are aggregated across services by the Infrastructure Health Application. These clusters appear on the right side of the view and grow as more alerts cluster into the same services. This mechanism provides compelling visualizations of incident blast radius across your services and is a powerful tool with which to accelerate incident response.

infrastructure-health-blog

The Operations Command Console and Intelligence Applications, leverage the data already delivered to PagerDuty, so setup and onboarding occurs in a matter of minutes. Once provisioned, the Operations Command Console allows IT teams to customize application layouts to best fit their workflows. Options include hiding and expanding content, pivoting and filtering data, and arranging console views that best meet their needs in receiving the optimal operations awareness of their business or infrastructure services.

Focusing on any of the elements in a given application highlights relevant context in the other applications, providing enhanced operations awareness across your services and infrastructure. One of the key features of the Operations Command Console is the ability to show related data across all relevant applications thus highlighting and exposing relationships and dependencies across various facets of your IT Operations.

Together, these new products enable IT Operations teams large and small to optimize their workflows and slash their Mean Time To Recovery. Minimizing business impact is the most important goal of IT Operations and with the Operations Command Console and Infrastructure Health, that goal has never been more attainable.

Sign up for your trial today and accelerate your incident response using our new Operations Command Console and Intelligence Applications!