Vanguard Logo

Vanguard

Senior Reliability Engineer

Reposted 2 Days Ago
Be an Early Applicant
In-Office
2 Locations
Mid level
In-Office
2 Locations
Mid level
The Observability and Resiliency Engineer will improve engineering practices, support resilient application design, implement OpenTelemetry for JavaScript, and contribute to architectural solutions.
The summary above was generated by AI
Shape the Future of Observability at Vanguard

Join PITech's Site Reliability Engineering team and lead cutting-edge SRE initiatives that impact hundreds of applications and millions of investors. You'll architect and build enterprise-scale resiliency solutions, driving our ambitious 2026 roadmap. This is an opportunity to combine deep technical expertise with strategic influence — designing OpenTelemetry integrations, implementing distributed tracing at scale, automating incident responses, and pioneering AI-enhanced diagnostics and analysis. Work alongside a collaborative, technically-focused team where your innovations in resilience engineering will shape Vanguard's next generation of client experiences.

At Vanguard, we pride ourselves on delivering an exceptional client experience to all investors; at the core of this experience are systems that reside in a technically complex and constantly evolving resiliency landscape. Passionate, technically skilled engineers are at the center of our resiliency operations, and we are looking to grow our team.

We are seeking an experienced engineer with broad, end-to-end software development experience, including operating applications in a microservices environment in production at scale. This role goes beyond feature implementation - it requires someone who can design, build, and support resilient systems from the ground up.

As a Senior Reliability Engineer at Vanguard, you will play a critical role in solving impactful operational problems. You are curious and take a proactive approach to identifying problems and making improvements. You balance innovative thinking with pragmatism and understand the long-term impacts of technical decisions. You communicate complex ideas clearly and collaborate effectively to deliver scalable solutions.

Core Responsibilities
  • Improve resiliency engineering practices across platforms and applications, including resilient application design patterns, system observability and deployment strategies
  • Incident detection, troubleshooting, and resolution.
  • Develop automation for incident response and infrastructure management
  • Develop and support OpenTelemetry integrations for multiple application platforms (browser, ECS, lambda, etc) and languages (JavaScript, Java)
  • Contribute to architectural decisions and support implementation of solutions.
Skills and Qualifications
  • Expertise in JavaScript (server-side and client-side execution environments) or Java.
  • Working knowledge of Python (or similar scripting language)
  • Strong knowledge of resiliency engineering techniques for both platforms and applications.
  • Experience troubleshooting complex production issues and implementing effective mitigations.
  • Hands-on experience with AWS services and cloud infrastructure.
  • Familiarity with OpenTelemetry specification and core APIs.
  • Practical experience developing and operating software in distributed systems environments.

Special Factors

Sponsorship

Vanguard is not offering visa sponsorship for this position.

About Vanguard

At Vanguard, we don't just have a mission—we're on a mission.

To work for the long-term financial wellbeing of our clients. To lead through product and services that transform our clients' lives. To learn and develop our skills as individuals and as a team. From Malvern to Melbourne, our mission drives us forward and inspires us to be our best.

How We Work

Vanguard has implemented a hybrid working model for the majority of our crew members, designed to capture the benefits of enhanced flexibility while enabling in-person learning, collaboration, and connection. We believe our mission-driven and highly collaborative culture is a critical enabler to support long-term client outcomes and enrich the employee experience.

Top Skills

AWS
JavaScript
Microservices
Opentelemetry

Vanguard Charlotte, North Carolina, USA Office

Two North Falls Plaza, Charlotte, NC, United States, 28217

Similar Jobs

9 Days Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
140K-170K Annually
Senior level
140K-170K Annually
Senior level
AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
The Senior Site Reliability Engineer will enhance system reliability, develop production-grade code, implement observability tools, conduct root cause analyses, and collaborate on system design for scalability.
Top Skills: ArgocdCi/CdDockerGitopsGoGrafanaHoneycombJenkinsKubernetesOpentelemetryPrometheusPythonTerraform
16 Days Ago
Hybrid
Exton, PA, USA
102K-169K Annually
Senior level
102K-169K Annually
Senior level
Automotive • Cloud • Greentech • Information Technology • Other • Software • Cybersecurity
As a Senior Software Engineer in Reliability Engineering, you will create automation for incident response, enhance system reliability, and leverage AI for smarter operations and observability to improve engineering impact.
Top Skills: AIApi DesignC#Ci/CdGitGoInfrastructure As CodeJavaJavaScriptPythonRuby
14 Days Ago
Easy Apply
Remote or Hybrid
2 Locations
Easy Apply
187K-224K Annually
Senior level
187K-224K Annually
Senior level
eCommerce • Healthtech • Kids + Family • Retail • Social Media
Seeking a Senior Software Engineer, Site Reliability to ensure system stability, scalability, and reliability, while optimizing AWS infrastructure using modern DevOps practices and tools like Terraform, Docker, and Kubernetes.
Top Skills: AWSCircleCICronitorDatadogDockerGithub ActionsJenkinsKubernetesMySQLPagerdutyReactRedisRuby On RailsSentrySidekiqTerraform

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account