ServiceNow Logo

ServiceNow

Director, Software Engineering - Observability - Monitoring & Alerting

Posted 8 Days Ago
Be an Early Applicant
Remote or Hybrid
Hiring Remotely in Orlando, FL
170K-297K Annually
Expert/Leader
Remote or Hybrid
Hiring Remotely in Orlando, FL
170K-297K Annually
Expert/Leader
Lead the development of observability applications, managing a high-performing engineering team and focusing on proactive insights and incident resolution.
The summary above was generated by AI
Company Description
It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today - ServiceNow stands as a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500®. Our intelligent cloud-based platform seamlessly connects people, systems, and processes to empower organizations to find smarter, faster, and better ways to work. But this is just the beginning of our journey. Join us as we pursue our purpose to make the world work better for everyone.
Job Description
ServiceNow is seeking a Director of Engineering to lead the development of our next-generation Observability applications, supporting global cloud services organization. In this highly technical and strategic leadership role, you will shape the vision and execution of observability for cloud-native systems at scale, enabling proactive insight, faster incident resolution, and enhanced customer experience.
You'll lead a high-impact engineering organization responsible for delivering cohesive, intelligent observability solutions - including metrics, logs, traces, events - through self-serving & intuitive user experiences.
What you get to do in this role:
  • Define and execute the Observability vision, strategy, and roadmap, aligned with organizational reliability and performance goals.
  • Lead, scale, and mentor a high-performing, cross-functional engineering team (including managers, engineers, and tech leads) & drive AI-first development.
  • Design, deliver, and operate both open source and 3rd party observability products to monitor health & availability across infrastructure, applications, and services.
  • Implement AI/ML-powered observability for anomaly detection, predictive alerting, noise reduction, and root cause analysis.
  • Develop self-service frameworks that empower developers & customers to onboard services with best-in-class, AI-powered observability.
  • Partner with SRE, Engineering and Product teams to improve reliability, self-healing, and auto-remediation using observability insights.
  • Foster a culture of engineering excellence, including strong practices around code quality, testing, CI/CD & operational rigor
  • Foster innovation and experimentation with emerging technologies (e.g. AI Ops, Agentic workflows, LLMs, eBPF, etc.).
  • Define operational excellence KPIs, conduct postmortems, and continuously evolve the observability and performance review culture.

Qualifications
To be successful in this role you have:
  • 10+ years of experience building large-scale distributed systems, cloud platforms, or infrastructure.
  • 8+ years of engineering leadership experience managing & scaling high-performing teams and leaders.
  • 2+ years of hands-on experience leading AI native initiatives in engineering
  • Proven success in building and operating scalable observability & alerting solutions (based on logging, metrics, tracing, etc.) in production.
  • Strong understanding of OpenTelemetry, TSDB (VictoriaMetrics/Prometheus), Kafka, Grafana, and other modern observability tools.
  • Strong understanding of Observability across the full stack - from frontend user interactions down to the infrastructure and networks supporting the system.
  • Experience in network observability technologies & products is preferred (Flow monitoring, ThousandEyes, GNMi etc)
  • Outstanding communication and leadership skills with the ability to influence across all levels of the organization.
  • Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
  • Strong grasp on cloud-native technologies such as Kubernetes, and container observability.
  • Passion for operational excellence, reliability engineering, and measurable outcomes.
  • Experience in multi-cloud platforms (AWS, Azure, Google Cloud) and their observability ecosystems is a plus

GCS-23
For positions in this location, we offer a base pay of $169,600 - $296,800, plus equity (when applicable), variable/incentive compensation and benefits. Sales positions generally offer a competitive On Target Earnings (OTE) incentive compensation structure. Please note that the base pay shown is a guideline, and individual total compensation will vary based on factors such as qualifications, skill level, competencies, and work location. We also offer health plans, including flexible spending accounts, a 401(k) Plan with company match, ESPP, matching donations, a flexible time away plan and family leave programs. Compensation is based on the geographic location in which the role is located and is subject to change based on work location.
Additional Information
Work Personas
We approach our distributed world of work with flexibility and trust. Work personas (flexible, remote, or required in office) are categories that are assigned to ServiceNow employees depending on the nature of their work and their assigned work location. Learn more here . To determine eligibility for a work persona, ServiceNow may confirm the distance between your primary residence and the closest ServiceNow office using a third-party service.
Equal Opportunity Employer
ServiceNow is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, creed, religion, sex, sexual orientation, national origin or nationality, ancestry, age, disability, gender identity or expression, marital status, veteran status, or any other category protected by law. In addition, all qualified applicants with arrest or conviction records will be considered for employment in accordance with legal requirements.
Accommodations
We strive to create an accessible and inclusive experience for all candidates. If you require a reasonable accommodation to complete any part of the application process, or are unable to use this online application and need an alternative method to apply, please contact [email protected] for assistance.
Export Control Regulations
For positions requiring access to controlled technology subject to export control regulations, including the U.S. Export Administration Regulations (EAR), ServiceNow may be required to obtain export control approval from government authorities for certain individuals. All employment is contingent upon ServiceNow obtaining any export license or other approval that may be required by relevant export control authorities.
From Fortune. ©2025 Fortune Media IP Limited. All rights reserved. Used under license.

Top Skills

AWS
Azure
GCP
Grafana
Kafka
Kubernetes
Opentelemetry
Prometheus
Tsdb
Victoriametrics

Similar Jobs at ServiceNow

12 Hours Ago
Remote or Hybrid
West Palm Beach, FL, USA
Junior
Junior
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
As a Business Analyst, you will develop analytics tools, manage data scripts, prototype AI agents, and contribute to data workflows for enhanced insights.
Top Skills: Ai AgentsAWSAzureDatabricksGCPJSONLlm AppsPower BIPythonSnowflakeSQLYaml
12 Hours Ago
Remote or Hybrid
Orlando, FL, USA
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
As a Business Product Manager, you'll drive the strategy and implementation of self-service technology tools, collaborate across teams, manage projects, and ensure compliance with data regulations.
Top Skills: Ai TechnologiesCsmItsmNow AssistNow Support PortalServicenow PlatformStrategic Portfolio Management
12 Hours Ago
Remote or Hybrid
Orlando, FL, USA
Mid level
Mid level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Major Incident Manager drives the resolution of technical problems, coordinating teams, ensuring communication with customers and executives, and following incident management processes.
Top Skills: Ai-Driven ToolsMySQLOracleRelational Databases

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account