Available Locations: Hybrid - Austin, Washington D.C.
Cloudflare's Edge Architecture was designed for massive redundancy and resilience. Over the past few years, Cloudflare has also invested heavily in upleveling the resiliency and fault tolerance of the Control Plane, Logs and Analytics that our customers use to configure, monitor and manage their Cloudflare services. The investments that have landed are incorporated into Cloudflare's Service Delivery Business Continuity and Disaster Recovery policies.
But there are still significant advances remaining for our Business Continuity and Disaster Recovery policies. We are expanding the scope of services covered to include new customer-facing services that have new and unique architectural and operational characteristics as well as all key operational / supporting services. We are also including critical new risk vectors. We will uplift from discrete Service Delivery BCP and Business System BCPs into a converged Enterprise Business Continuity plan across all critical business functions.
This role is for the BCP/DR Engineer who will work closely with the BCP/DR/HA Engineering Leader to contribute to the overall BCP/DR strategy, translate that strategy into prioritized actions, work closely across the organization to enlist and engage critical teams to drive solutions and processes that advance Cloudflare's resilience and posture as Critical Infrastructure for our customers.
Responsibilities:
- Working closely with the BCP/DR TPM, provide engineering leadership for the Infrastructure Resilience investment response to Code Orange. Complete the planned investment plan and incorporate into our BCP.
- Drive shared definition and formalize Cloudflare's preparedness to respond to alternate types of attacks. This likely takes a multi-phase approach:
- Current state Response Model: Define people, processes and options and decision tree for how we would respond with current architecture and risk profile
- Identify and lead prioritization of investment opportunities to mitigate risk. Drive recommendation to approved plan, manage investment plan to implementation.
- Revise Response Model based on changing risk landscape.
- Customer facing contributions: Create appropriate customer collateral showing Cloudflare's approach, reducing deal friction. Participate in customer calls on BCP/DR/HA.
- Engineer 'Prove It' approach and evidence collection. A critical component of resiliency is ensuring that as-operating meets the objectives of as-designed. This requires consistent, frequent in-production validations to avoid drift and ensure fault tolerance. This engineer will play a leadership role in defining, prioritizing and work with the engineering teams executing the prioritized validations, managing a growing portfolio of Chaos testing.
Examples of Desirable Skills, Knowledge, and Experience:
- Ability to analyze threat vectors and map them through our distributed systems to develop remediation plans and test scenarios to validate desired response patterns
- Deep understanding of BCP concerns such as HA, DR and Ransomware
- Ability to negotiate across engineering organizations to develop and agree upon product requirements to meet business continuity objectives.
- Ability to prioritize to deliver the best outcome for the investment available
- Ability to abstract complex technical solutions into customer-consumable information
- Strong communication skills across both internal and external, technical and non-technical audiences.
Bonus Points:
- Deep understanding of Cloudflare's Edge and Core architectures
- Distributed systems experience
- Publishing externally facing, customer consumable materials
- Audit participation and remediation experience
Compensation
Compensation may be adjusted depending on work location.
- For Colorado-based hires: Estimated annual salary of $137,000 - $167,000
- For New York City, Washington, and California (excluding Bay Area) based hires: Estimated annual salary of $154,000 - $188,000
Equity
This role is eligible to participate in Cloudflare's equity plan.
Benefits
Cloudflare offers a complete package of benefits and programs to support you and your family. Our benefits programs can help you pay health care expenses, support caregiving, build capital for the future and make life a little easier and fun! The below is a description of our benefits for employees in the United States, and benefits may vary for employees based outside the U.S.
Health & Welfare Benefits
- Medical/Rx Insurance
- Dental Insurance
- Vision Insurance
- Flexible Spending Accounts
- Commuter Spending Accounts
- Fertility & Family Forming Benefits
- On-demand mental health support and Employee Assistance Program
- Global Travel Medical Insurance
Financial Benefits
- Short and Long Term Disability Insurance
- Life & Accident Insurance
- 401(k) Retirement Savings Plan
- Employee Stock Participation Plan
Time Off
- Flexible paid time off covering vacation and sick leave
- Leave programs, including parental, pregnancy health, medical, and bereavement leave
Top Skills
Similar Jobs at Cloudflare
What you need to know about the Charlotte Tech Scene
Key Facts About Charlotte Tech
- Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
- Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
- Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
- Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
- Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus