Dimensional Fund Advisors Jobs

Senior Site Reliability Engineer - Observability

Dimensional Fund Advisors

Senior Site Reliability Engineer - Observability

Reposted 4 Days Ago

Be an Early Applicant

In-Office

Charlotte, NC, USA

Senior level

In-Office

Charlotte, NC, USA

Senior level

Own reliability and scalability of on-prem observability platforms (ELK, Grafana); handle production escalations, capacity planning, SLOs, onboarding, automation, IaC (Terraform/Helm/Ansible), upgrades, security hardening, and platform modernization.

The summary above was generated by AI

Job Description:About the Role: We are looking for a Senior SRE to join our Platform Engineering team as the operations owner of our observability platforms. You’ll be responsible for the reliability, scalability, and continued evolution of the tools that give our engineering organization visibility into everything they build and run. The current observability platform is primarily comprised of on-premises ELK (Elasticsearch, Logstash, Kibana) Stack and Grafana, with some exposure to New Relic and SolarWinds. This is a hybrid role: roughly half your time will be spent on steady-state operations and platform support, and the other half on engineering projects that meaningfully advance the platforms you support. It’s a great fit for someone who is genuinely motivated by the pursuit of excellence – not just sustaining what works but relentlessly refining it. You take pride in the platforms you own, and that pride drives you to keep improving them, whether that means tightening an SLO, eliminating a source of toil, or building something that gives teams faster insight into their systems. What You’ll Work On: Operations & Reliability (~ 50%)

Serve as a primary escalation point for production support involving the ELK Stack, Grafana, and New Relic
Own platform health, capacity planning, and performance tuning for on-premises observability infrastructure – including Elasticsearch cluster management, index lifecycle policies, and retention strategies
Monitor and maintain SLOs for the observability platforms, ensuring the tools engineers depend on are highly available and performant
Support engineering teams in onboarding to observability platforms – helping teams instrument their applications, build dashboards, and define meaningful alerts
Manage patching, upgrades, and configuration management across the observability stack
Collaborate with security to harden platform configurations and manage software vulnerabilities
Contribute to on-call rotations and maintain runbooks and escalation procedures

Platform Engineering (~ 50%)

Design and build tooling/automation to reduce toil and improve the experience for teams using observability platforms
Lead or contribute to platform modernization initiatives – e.g., improving ingestion pipelines, scaling platform capacity, standardizing Grafana dashboard and alerting patterns, or evaluating new capabilities within the existing stack
Develop and maintain infrastructure-as-code (Terraform, Helm, Ansible, etc.) for platform components
Build and enforce standards around logging metrics and alerting that help engineering teams adopt observability best practices at scale
Participate in design reviews and contribute to the overall platform roadmap

What We’re Looking For:

Bachelor’s degree in a technical field or equivalent practical experience
5+ years of experience in SRE, DevOps, or platform engineering roles
Deep hands-on experience with the ELK Stack – Elasticsearch cluster operations, Logstash pipeline development, Kibana, and index lifecycle management
Strong experience with Grafana, including data source integrations, dashboard design, and alerting
Solid understanding of observability principles
Experience operating on-premises infrastructure, including capacity planning, server management, and the operational tradeoffs with managed cloud services
Proficiency in Python for automation and tooling; familiarity with shell scripting 
Strong Linux systems knowledge and comfort working with configuration management tools (e.g., Ansible, Chef, Puppet, etc.)
Demonstrated ability to drive incidents to resolution and communicate clearly under pressure
A bias toward automation and a low tolerance for repetitive manual work

Nice to Have:

Experience with Prometheus
Experience with New Relic administration or APM instrumentation
Familiarity with log shipping agents and pipeline tools such as Beats, Fluentd, or Fluent Bit
Experience with distributed tracing tools like OpenTelemetry
Exposure to cloud-based observability offerings and experience thinking through hybrid strategies
Prior experience building or governing observability standards across a large engineering organization

#LI-Hybrid

Dimensional offers a variety of programs to help take care of you, your family, and your career, including comprehensive benefits, educational initiatives, and special celebrations of our history, culture, and growth.

It is the policy of the Company to provide equal opportunity for all employees and applicants. The Company recruits, hires, trains, promotes, compensates, and administers all personnel actions without regard to actual or perceived race, color, religion, religious practice, creed, sex, sex stereotyping, pregnancy (which includes pregnancy, childbirth, and medical conditions related to pregnancy, childbirth, or breastfeeding), caregiver status, gender, gender identity, gender expression, transgender identity, national origin, age, mental or physical disability, ancestry, medical condition, marital status, familial status, domestic partnership status, military or veteran status or service, unemployment status, citizenship status or alienage, sexual orientation, status as a victim of domestic violence, status as a victim of stalking, status as a victim of sex offenses, genetic information, political activities or recreational activities, arrest or conviction record, salary history, natural hairstyle or any other status protected by applicable law except as otherwise required or permitted by law or regulation applicable to the Company or its affiliates.

Charlotte, United States

Similar Jobs

Waabi

Staff Software Engineer

16 Days Ago

Remote or Hybrid

148K-249K Annually

Senior level

148K-249K Annually

Senior level

Transportation

Design and develop Waabi's observability stack, optimize performance, build automation tooling, and support application requirements while leading projects and mentoring teams.

Top Skills: AWSC/C++DockerGoGrafanaJavaKubernetesOpentelemetryPythonRust

Boeing

Architect

36 Minutes Ago

In-Office

211K-285K Annually

Expert/Leader

211K-285K Annually

Expert/Leader

Aerospace • Information Technology • Software • Cybersecurity • Design • Defense • Manufacturing

Lead system-of-systems architecture, requirements, and interface definition for next-generation mission systems. Provide technical leadership, conduct engineering reviews, collaborate with customers/stakeholders, and ensure Open Mission Systems compliance and cross-program reuse. Support analyses, sensor management, and multi-level security architecture in an agile environment.

Top Skills: AvionicsMission SystemsMulti-Level Security ArchitectureOpen ArchitecturesOpen Mission SystemsRadarSensor Fusion

Boeing

Senior Electrical Workplace Coach - Executive Fleet

37 Minutes Ago

In-Office

91K-107K Annually

Senior level

91K-107K Annually

Senior level

Aerospace • Information Technology • Software • Cybersecurity • Design • Defense • Manufacturing

Coach and train production technicians on aircraft electrical and avionics modifications and repairs. Teach hands-on skills (soldering, crimping, terminations, fiber optics, cable routing), operate and instruct on electrical test equipment, interpret drawings and process specs, lead teams to improve quality, reduce rework, and report to site leadership while maintaining safety and compliance.

Top Skills: AvionicsCable RoutingCoaxial CableCrimpingDmmFiber Optic WiringLight Loss TesterMilli-Ohm MeterOscilloscopeShielded WireSolderingTdrTerminationsWire Harness

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Dimensional Fund Advisors

Senior Site Reliability Engineer - Observability

Dimensional Fund Advisors Charlotte, North Carolina, USA Office

Similar Jobs

Staff Software Engineer

Architect

Senior Electrical Workplace Coach - Executive Fleet

What you need to know about the Charlotte Tech Scene

Key Facts About Charlotte Tech