Optum Jobs

Site Reliability Engineer - Remote

Optum

Site Reliability Engineer - Remote

Posted Yesterday

Be an Early Applicant

In-Office or Remote

Hiring Remotely in Eden Prairie, MN

73K-130K Annually

Mid level

In-Office or Remote

Hiring Remotely in Eden Prairie, MN

73K-130K Annually

Mid level

Architect, build, and operate AWS commercial and government cloud infrastructure and platform services. Implement IaC, Kubernetes (EKS/AKS) management, observability, automation, incident response, and compliance (FedRAMP/NIST). Participate in on-call rotations and support production resiliency, performance, and security.

The summary above was generated by AI

Requisition Number: 2371543
Opportunities with Logistics Health Incorporated (LHI), part of the Optum family of business. We're dedicated to simplifying the logistics of complex workforce health programs with cost-effective solutions and a seamless distribution process. With offices in La Crosse, Wis., a satellite office in Chicago and remote employees throughout the country, we have a variety of rewarding career opportunities for you. Elevate your career as you help us create a healthier tomorrow for everyone and discover the meaning behind Caring. Connecting. Growing together.
The Site Reliability Engineer will architect, develop, and maintain Optum Serve's cloud environment in both the commercial and government AWS cloud. The role will work closely with software engineers, architects, and DevOps engineers to architect and maintain a secure, resilient and high performance cloud infrastructure.
To support this mission, OSIT has initiated a multi year modernization program aimed at updating and enhancing enterprise technology systems in accordance with modern design standards.
You'll enjoy the flexibility to work remotely * from anywhere within the U.S. as you take on some tough challenges. For all hires in the Minneapolis or Washington, D.C. area, you will be required to work in the office a minimum of four days per week.
Primary Responsibilities:

Build, maintain, and operate IaaS and PaaS infrastructure in AWS commercial and government clouds
Work closely with dev teams to identify and measure SLOs, SLAs and SLIs
Act a solid contributor to development of platform services including architecture, provisioning, configuration, deployment, and support
Perform integrations with central logging, metrics dashboards, instrumentation, incident monitoring and management
Build/integrate/administer systems and tools that enable engineering teams to observe their applications in production with autonomy (Dashboards, APMs)
Support software and/or cloud-infrastructure in an on-call rotation basis
Assist with identification and remediation of technical problems at the root cause by continuously implementing automation, self-healing, and real-time monitoring to production systems
Maintain and improve operational tooling, frameworks
Build frameworks that test the performance and resiliency of our platform services/tools
Automate alerts for metrics on performance, cost, vulnerabilities, risk, compliance violations
Improve processes and champion automation of any manual items around support.

You'll be rewarded and recognized for your performance in an environment that will challenge you and give you clear direction on what it takes to succeed in your role as well as provide development for other roles you may be interested in.
Required Qualifications:

3+ years of experience working within a cloud engineer/SRE role
Proven solid knowledge of AWS services (ex. VPC, EC2, S3, ECS, Cloudformation, Lambda, EKS, RDS, ELB, Route53, RedShift)
Proven expert knowledge and hands on production experience in Kubernetes (EKS or AKS) cluster setup and management required.
Experience with infrastructure as code (IaC) tools like Terraform
Experience with Kubernetes deployment tools like Helm, ArgoCD, Flux
Demonstrated solid awareness of networking and internet protocols
Proven understanding of identity and access management (IAM)
Experience supporting infrastructure in production cloud environments
Proven knowledge of Encryption (KMS), Public Key Infrastructure (PKI), understanding of OWASP
Experience working with RESTful services
Experience supporting environments adhering compliance standards like FedRAMP and NIST (800-171|53)
Experience with monitoring tools (CloudWatch, VPC Flow Logs, Splunk, Dynatrace, Graphana, Prometheus)
Demonstrated familiarity with IDEs and Source Control tools like Azure DevOps, Github or Gitlab.
Ability to participate in 24/7 on-call rotation
United States Citizenship
If you are offered this position, you will be required to provide extensive personal information to obtain and maintain a suitability or determination of eligibility for a Confidential/Secret or Top Secret security clearance as a condition of your employment

Preferred Qualifications:

Bachelor's Degree in Computer Science, Information Technology, Software Engineering, Math, Physics
Master's Degree with coursework focused on advanced algorithms, mathematics in computing, data structures or related field
Expert knowledge of deploying Production grade applications in AWS
Demonstrated passion about infrastructure automation

*All employees working remotely will be required to adhere to UnitedHealth Group's Telecommuter Policy
Pay is based on several factors including but not limited to local labor markets, education, work experience, certifications, etc. In addition to your salary, we offer benefits such as, a comprehensive benefits package, incentive and recognition programs, equity stock purchase and 401k contribution (all benefits are subject to eligibility requirements). No matter where or when you begin a career with us, you'll find a far-reaching choice of benefits and incentives. The salary for this role will range from $72,800 to $130,000 annually based on full-time employment. We comply with all minimum wage laws as applicable.
At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone-of every race, gender, sexuality, age, location and income-deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes - an enterprise priority reflected in our mission.
UnitedHealth Group is an Equal Employment Opportunity employer under applicable law and qualified applicants will receive consideration for employment without regard to race, national origin, religion, age, color, sex, sexual orientation, gender identity, disability, or protected veteran status, or any other characteristic protected by local, state, or federal laws, rules, or regulations.
UnitedHealth Group is a drug - free workplace. Candidates are required to pass a drug test before beginning employment.

Similar Jobs at Optum

Optum

Site Reliability Engineer

8 Days Ago

In-Office or Remote

Expert/Leader

Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics

Define and scale SRE standards across teams, implement SLOs/SLIs/error budgets, build observability and resiliency patterns, drive automation and AIOps, improve reliability for large-scale Azure cloud systems, and influence engineering and platform teams.

Top Skills: Ai/MlAiopsAutomationAzureError BudgetsIncident ManagementLogsObservability (MetricsOpentelemetrySlisSlosTracing)

Optum

Site Reliability Engineer

13 Days Ago

In-Office or Remote

73K-130K Annually

Mid level

73K-130K Annually

Mid level

Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics

The Site Reliability Engineer will design, develop, and support a secure cloud infrastructure while collaborating with development and DevOps teams, ensuring high performance and reliability of systems.

Top Skills: AWSAzureDynatraceGrafanaKubernetesPrometheusPulumiSplunkTerraform

Optum

Senior Site Reliability Engineer

13 Days Ago

In-Office or Remote

92K-164K Annually

Senior level

92K-164K Annually

Senior level

Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics

The Senior Site Reliability Engineer will architect and maintain cloud infrastructure, collaborating with software and DevOps engineers while ensuring security and performance.

Top Skills: ArgocdAWSAzureAzure MonitorDynatraceFluxGraphanaHelmKubernetesPrometheusPulumiRestful ServicesSplunkTerraform

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus