We currently have a career opportunity for an Incident Recovery Manager to join our team located in Charlotte, NC.
Job Overview:
The Incident Recovery Manager will lead the recovery of critical incidents following major disruptions and drive strategies to ensure effective, timely restoration of services. This role is responsible for managing major incident recovery, leading post-mortem reviews, developing SOPs and runbooks, and implementing proactive measures to prevent future failures through the adoption of SRE principles.
As a dynamic and motivated leader, you will play a key role in shaping and enhancing production support and operational capabilities. You will help identify and implement best-fit solutions that improve stability, reliability, and efficiency across the organization.
This is an opportunity to drive meaningful change and enhance the end user experience. If you are passionate about production operations, stability, SRE, and observability, and have a proven track record of success, we invite you to join us in advancing resilience and operational excellence.
Perficient is always looking for the best and brightest talent and we need you! We’re a quickly-growing, global digital consulting leader, and we’re transforming the world’s largest enterprises and biggest brands. You’ll work with the latest technologies, expand your skills, and become a part of our global community of talented, diverse, and knowledgeable colleagues.
Responsibilities
- Major Incident Support: Drive cross-functional teams to resolve critical incidents and attend post-mortem/post-incident reviews.
- Root Cause Analysis (RCA): Investigate underlying causes of major incidents, utilizing techniques like 5-Why, Fishbone, Blameless RCA and other techniques
- Recovery Strategy & Planning: Develop and test incident recovery plans, establish SOPs, knowledge base and mock drills
- Self Sufficiency: Develop playbooks by coordinating with domain owners and ensure more self-sufficiency and diagnosis accuracy
- Process Improvement: Identify opportunities to improve IT service reliability and reduce operational risks related to people, process and technology
- Feedback Loop: Provide continuous feedback to Observability, Automation, Resiliency and Domain teams on improving observability posture, automation, single points of failures, architectural and design gaps
- Training and Development: Mentor and develop other team members, providing training. Stay current with industry best practices and technologies, fostering a culture of continuous learning and professional growth.
- Performance Monitoring & Analytics: Utilize analytical and technical skills to assess system performance, monitor incident trends, and drive continuous improvement initiatives.
- Cross-Functional Collaboration: Collaborate closely with engineering teams, and third-party vendors during major incidents and on system design, feasibility, and architecture to improve stability and meet resilience objectives.
Qualifications
- Progressive and proven experience and expertise in Production Services, Recovery and Problem Management, SRE, DevOps, or related fields with development background preferred
- Strong understanding of foundational technology components across infra, cloud and app to be able to diagnose, ask right questions and effectively lead recovery of a critical incident
- Hands-on experience with observability tools, logs and diagnostics to be able to troubleshoot and coach people
- Experience with cloud platforms (e.g., Azure, AWS, GCP) including high availability and disaster recovery architectures
- Experience with incident management and ITSM tools (e.g., ServiceNow, PagerDuty, Opsgenie) and automated workflows
- Demonstrated success in complex, large-scale enterprise production support and operations environments, including experience working with large geographically distributed teams
- Understanding of CI/CD pipelines and deployment strategies (e.g., blue-green, canary) and their impact on production stability
- Excellent communication and interpersonal skills, with a focus on collaboration and relationship-building
- Able to communicate effectively with CXOs and convey complex technical details into business terms
- Ability to influence and drive change across the organization
- Analytical mindset with the ability to translate data into actionable insights
- Experience in analyzing incident trends and implementing process improvements to enhance operational efficiency
- Strong decision-making skills under high-pressure incident scenarios with the ability to balance speed and risk
- Proactive mindset with a focus on prevention, continuous improvement, and operational excellence
- Strong ownership and accountability with a bias toward action and results
Ability to mentor, coach, and elevate technical and operational capabilities across teams
ABOUT THE TEAM
Our App Modernization team helps businesses transform legacy systems and build future-ready applications. We deliver end-to-end solutions—combining cloud migration, custom application development, multi-cloud strategies, and modern UI and API integration. With expertise in DevSecOps, modern frameworks, and enterprise platforms, our team of engineers, architects, and project leaders partner with leading brands to drive innovation, accelerate delivery, and create lasting business impact. We also integrate AI-driven capabilities—such as intelligent automation, predictive analytics, and generative development tools—to enhance scalability, performance, and user experience.
ADDITIONAL INFORMATION
Perficient, Inc. proudly provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, gender, sexual orientation, national origin, age, disability, genetic information, marital status, amnesty, or status as a protected veteran in accordance with applicable federal, state and local laws. Perficient, Inc. complies with applicable state and local laws governing nondiscrimination in employment in every location in which the company has facilities. This policy applies to all terms and conditions of employment, including, but not limited to, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, and training. Perficient, Inc. expressly prohibits any form of unlawful employee harassment based on race, color, religion, gender, sexual orientation, national origin, age, genetic information, disability, or covered veterans. Improper interference with the ability of Perficient, Inc. employees to perform their expected job duties is absolutely not tolerated.
Disability Accommodations: Perficient is committed to providing a barrier-free employment process with reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or accommodation due to a disability, please contact us.
Applications will be accepted until the position is filled or the posting is removed.
The salary range for this position takes into consideration a variety of factors, including but not limited to skill sets, level of experience, applicable office location, training, licensure and certifications, and other business and organizational needs. The new hire salary range displays the minimum and maximum salary targets for this position across all US locations, and the range has not been adjusted for any specific state differentials. It is not typical for a candidate to be hired at or near the top of the range for their role, and compensation decisions are dependent on the unique facts and circumstances regarding each candidate. A reasonable estimate of the current salary range for this position is $81978 to $149880. Please note that the salary range posted reflects the base salary only and does not include benefits or any potential variable compensation programs. Information regarding the benefits available for this position are in our benefits overview.
#LI-MG1
About UsPerficient is the global AI and technology consulting firm disrupting the traditional consulting model. Powered by our 7,000+ advisors, engineers, and designers, Perficient implements AI-first solutions that break conventions and deliver outcomes that matter. Proudly serving clients that represent the world’s most innovative brands, and in collaboration with our powerful technology partner ecosystem, we bring deep industry expertise and data-driven design to redefine how businesses run and succeed. Perficient is different. For real. Learn more at perficient.com.
Similar Jobs
What you need to know about the Charlotte Tech Scene
Key Facts About Charlotte Tech
- Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
- Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
- Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
- Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
- Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus


