Optum Logo

Optum

Site Reliability Engineer - Remote

Posted Yesterday
Be an Early Applicant
In-Office or Remote
Hiring Remotely in Eden Prairie, MN
73K-130K Annually
Mid level
In-Office or Remote
Hiring Remotely in Eden Prairie, MN
73K-130K Annually
Mid level
Architect, build, and operate AWS commercial and government cloud infrastructure and platform services. Implement IaC, Kubernetes (EKS/AKS) management, observability, automation, incident response, and compliance (FedRAMP/NIST). Participate in on-call rotations and support production resiliency, performance, and security.
The summary above was generated by AI
Requisition Number: 2371543
Opportunities with Logistics Health Incorporated (LHI), part of the Optum family of business. We're dedicated to simplifying the logistics of complex workforce health programs with cost-effective solutions and a seamless distribution process. With offices in La Crosse, Wis., a satellite office in Chicago and remote employees throughout the country, we have a variety of rewarding career opportunities for you. Elevate your career as you help us create a healthier tomorrow for everyone and discover the meaning behind Caring. Connecting. Growing together.
The Site Reliability Engineer will architect, develop, and maintain Optum Serve's cloud environment in both the commercial and government AWS cloud. The role will work closely with software engineers, architects, and DevOps engineers to architect and maintain a secure, resilient and high performance cloud infrastructure.
To support this mission, OSIT has initiated a multi year modernization program aimed at updating and enhancing enterprise technology systems in accordance with modern design standards.
You'll enjoy the flexibility to work remotely * from anywhere within the U.S. as you take on some tough challenges. For all hires in the Minneapolis or Washington, D.C. area, you will be required to work in the office a minimum of four days per week.
Primary Responsibilities:
  • Build, maintain, and operate IaaS and PaaS infrastructure in AWS commercial and government clouds
  • Work closely with dev teams to identify and measure SLOs, SLAs and SLIs
  • Act a solid contributor to development of platform services including architecture, provisioning, configuration, deployment, and support
  • Perform integrations with central logging, metrics dashboards, instrumentation, incident monitoring and management
  • Build/integrate/administer systems and tools that enable engineering teams to observe their applications in production with autonomy (Dashboards, APMs)
  • Support software and/or cloud-infrastructure in an on-call rotation basis
  • Assist with identification and remediation of technical problems at the root cause by continuously implementing automation, self-healing, and real-time monitoring to production systems
  • Maintain and improve operational tooling, frameworks
  • Build frameworks that test the performance and resiliency of our platform services/tools
  • Automate alerts for metrics on performance, cost, vulnerabilities, risk, compliance violations
  • Improve processes and champion automation of any manual items around support.

You'll be rewarded and recognized for your performance in an environment that will challenge you and give you clear direction on what it takes to succeed in your role as well as provide development for other roles you may be interested in.
Required Qualifications:
  • 3+ years of experience working within a cloud engineer/SRE role
  • Proven solid knowledge of AWS services (ex. VPC, EC2, S3, ECS, Cloudformation, Lambda, EKS, RDS, ELB, Route53, RedShift)
  • Proven expert knowledge and hands on production experience in Kubernetes (EKS or AKS) cluster setup and management required.
  • Experience with infrastructure as code (IaC) tools like Terraform
  • Experience with Kubernetes deployment tools like Helm, ArgoCD, Flux
  • Demonstrated solid awareness of networking and internet protocols
  • Proven understanding of identity and access management (IAM)
  • Experience supporting infrastructure in production cloud environments
  • Proven knowledge of Encryption (KMS), Public Key Infrastructure (PKI), understanding of OWASP
  • Experience working with RESTful services
  • Experience supporting environments adhering compliance standards like FedRAMP and NIST (800-171|53)
  • Experience with monitoring tools (CloudWatch, VPC Flow Logs, Splunk, Dynatrace, Graphana, Prometheus)
  • Demonstrated familiarity with IDEs and Source Control tools like Azure DevOps, Github or Gitlab.
  • Ability to participate in 24/7 on-call rotation
  • United States Citizenship
  • If you are offered this position, you will be required to provide extensive personal information to obtain and maintain a suitability or determination of eligibility for a Confidential/Secret or Top Secret security clearance as a condition of your employment

Preferred Qualifications:
  • Bachelor's Degree in Computer Science, Information Technology, Software Engineering, Math, Physics
  • Master's Degree with coursework focused on advanced algorithms, mathematics in computing, data structures or related field
  • Expert knowledge of deploying Production grade applications in AWS
  • Demonstrated passion about infrastructure automation

*All employees working remotely will be required to adhere to UnitedHealth Group's Telecommuter Policy
Pay is based on several factors including but not limited to local labor markets, education, work experience, certifications, etc. In addition to your salary, we offer benefits such as, a comprehensive benefits package, incentive and recognition programs, equity stock purchase and 401k contribution (all benefits are subject to eligibility requirements). No matter where or when you begin a career with us, you'll find a far-reaching choice of benefits and incentives. The salary for this role will range from $72,800 to $130,000 annually based on full-time employment. We comply with all minimum wage laws as applicable.
At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone-of every race, gender, sexuality, age, location and income-deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes - an enterprise priority reflected in our mission.
UnitedHealth Group is an Equal Employment Opportunity employer under applicable law and qualified applicants will receive consideration for employment without regard to race, national origin, religion, age, color, sex, sexual orientation, gender identity, disability, or protected veteran status, or any other characteristic protected by local, state, or federal laws, rules, or regulations.
UnitedHealth Group is a drug - free workplace. Candidates are required to pass a drug test before beginning employment.

Similar Jobs at Optum

8 Days Ago
In-Office or Remote
Expert/Leader
Expert/Leader
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Define and scale SRE standards across teams, implement SLOs/SLIs/error budgets, build observability and resiliency patterns, drive automation and AIOps, improve reliability for large-scale Azure cloud systems, and influence engineering and platform teams.
Top Skills: Ai/MlAiopsAutomationAzureError BudgetsIncident ManagementLogsObservability (MetricsOpentelemetrySlisSlosTracing)
13 Days Ago
In-Office or Remote
73K-130K Annually
Mid level
73K-130K Annually
Mid level
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
The Site Reliability Engineer will design, develop, and support a secure cloud infrastructure while collaborating with development and DevOps teams, ensuring high performance and reliability of systems.
Top Skills: AWSAzureDynatraceGrafanaKubernetesPrometheusPulumiSplunkTerraform
13 Days Ago
In-Office or Remote
92K-164K Annually
Senior level
92K-164K Annually
Senior level
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
The Senior Site Reliability Engineer will architect and maintain cloud infrastructure, collaborating with software and DevOps engineers while ensuring security and performance.
Top Skills: ArgocdAWSAzureAzure MonitorDynatraceFluxGraphanaHelmKubernetesPrometheusPulumiRestful ServicesSplunkTerraform

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account