Technical Enterprise Incident Manager

Job Locations US
Requisition ID
2026-167566
Position Category
Engineering
Clearance
Agency Clearance

Responsibilities

 

We are seeking a highly motivated and technically skilled Technical Enterprise Incident Manager with strong Cloud Platform DevSecOps Engineering and application experience to lead enterprise incident response, service restoration efforts, and operational reliability initiatives. This individual will serve as the central point of coordination during major incidents, ensuring rapid resolution, clear communication, and continuous service improvement across enterprise infrastructure and applications. 

The ideal candidate possesses a strong operational background, excellent communication skills, and hands-on technical expertise in infrastructure, cloud technologies, monitoring, automation, and IT service management processes. This role requires the ability to drive incident response while also identifying systemic reliability improvements.This position may require participation in an after-hours and weekend on-call rotation supporting enterprise production incidents and critical outage management activities. 

 

Key Responsibilities:

 

Enterprise Incident Management

  • Lead and coordinate Incident bridge calls involving infrastructure, application, network, cloud, security, and vendor teams.
  • Drive rapid service restoration while maintaining accurate timelines, communications, and executive updates.
  • Ensure incidents are prioritized appropriately based on business impact and operational risk.
  • Manage escalation procedures and engage leadership when required.
  • Monitor SLA compliance and ensure incident response metrics are consistently achieved.

Cloud Platform DevSecOps Engineering

  • Improve platform reliability, availability, observability, and operational maturity.
  • Work with application teams to facilitate issues and implement root cause remediations.
  • Develop and enhance monitoring, alerting, and dashboarding capabilities.
  • Analyze trends, KPIs, and operational metrics to proactively identify reliability risks.
  • Support implementation of resiliency strategies including redundancy, failover, capacity planning, and performance optimization.
  • Create and maintain cloud architecture and service dependency diagrams using Cloudcraft.
  • Utilize Datadog for monitoring, alert correlation, dashboards, incident investigation, and performance analysis.
  • Assist with production readiness reviews and operational acceptance activities.
  • Participate in after-hours on-call incident management rotation as required.

Operational Excellence

  • Develop and maintain incident management procedures, runbooks, and knowledge articles.
  • Ensure accurate ticket documentation within ServiceNow.
  • Drive continual service improvement initiatives aligned with ITIL and SRE best practices.
  • Collaborate with cross functional teams to improve communication, escalation paths, and operational workflows.
  • Support audit, compliance, and operational reporting requirements. 

Qualifications

Required Qualifications:

  • Bachelor’s degree and 5 years of experience or 9 years with a Highschool diploma.
  • At least 5 years of experience in Cloud Incident Management, Operations Engineering, NOC, SRE, Application or Production Support environments.
  • Experience leading enterprise Major Incident response efforts in a 24x7 operational environment.
  • Strong understanding of ITIL Incident and Problem Management processes.
  • Hands-on experience with infrastructure technologies including:
    • Windows/Linux Servers
    • Networking concepts
    • Cloud platforms (AWS, Azure, or GCP)
    • Load balancers, proxies, DNS, and firewalls
  • Experience with monitoring and observability platforms such as:
    • Datadog
    • Cloudcraft
    • CloudWatch
  • Experience using Cloudcraft to document and visualize cloud environments and application dependencies.
  • Experience using ServiceNow or similar ITSM platforms.
  • Strong analytical, troubleshooting, and organizational skills.
  • Excellent written and verbal communication skills with ability to facility meetings as well as brief technical teams and executive leadership.
  • Must be a US Citizen.
  • Must be able to obtain and maintain the required agency clearance.  

Preferred Qualifications:

  • Experience in a Site Reliability Engineering (SRE) or Cloud Platform DevOps environment.
  • Familiarity with CI/CD pipelines and Infrastructure as Code (IaC).
  • Experience supporting federal, healthcare, financial, or other highly regulated environments.
  • ITIL Foundation certification preferred.
  • SRE, cloud, or operational certifications are a plus.

 

 

 

Peraton Overview

Peraton is a next-generation national security company that drives missions of consequence spanning the globe and extending to the farthest reaches of the galaxy. As the world’s leading mission capability integrator and transformative enterprise IT provider, we deliver trusted, highly differentiated solutions and technologies to protect our nation and allies. Peraton operates at the critical nexus between traditional and nontraditional threats across all domains: land, sea, space, air, and cyberspace. The company serves as a valued partner to essential government agencies and supports every branch of the U.S. armed forces. Each day, our employees do the can’t be done by solving the most daunting challenges facing our customers. Visit peraton.com to learn how we’re keeping people around the world safe and secure.

Target Salary Range

$86,000 - $138,000. This represents the typical salary range for this position. Salary is determined by various factors, including but not limited to, the scope and responsibilities of the position, the individual’s experience, education, knowledge, skills, and competencies, as well as geographic location and business and contract considerations. Depending on the position, employees may be eligible for overtime, shift differential, and a discretionary bonus in addition to base pay.

EEO

EEO: Equal opportunity employer, including disability and protected veterans, or other characteristics protected by law.

Options

Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
Share on your newsfeed