Rockwell Automation Logo

Rockwell Automation

Senior Cloud Operations Engineer – Plex

Reposted Yesterday
In-Office or Remote
5 Locations
Senior level
In-Office or Remote
5 Locations
Senior level
The Senior Cloud Operations Engineer will maintain and scale our Kubernetes platform, implement infrastructure as code, and collaborate with teams for automation and incident response.
The summary above was generated by AI

Rockwell Automation is a global technology leader focused on helping the world’s manufacturers be more productive, sustainable, and agile. With more than 28,000 employees who make the world better every day, we know we have something special. Behind our customers - amazing companies that help feed the world, provide life-saving medicine on a global scale, and focus on clean water and green mobility - our people are energized problem solvers that take pride in how the work we do changes the world for the better.

We welcome all makers, forward thinkers, and problem solvers who are looking for a place to do their best work. And if that’s you we would love to have you join us!

Job Description

Position Summary:

We are looking for a Senior Cloud Operations Engineer with a focus on Kubernetes and Automation to join our Plex Cloud Operations team. You will support the application tier both in our private and public cloud data centers. You will maintain and assist scaling our Kubernetes-based platform to ensure high availability, security, and performance. You will work closely with platform, development, security, and infrastructure teams to automate operations and improve multi-cluster management. You will also participate in an on-call rotation to support critical operations. You will report to the Cloud Operations Manager.

Your Responsibilities:
  • Maintain and improve our Kubernetes platform, ensuring high availability and scalability.
  • Implement infrastructure/configuration as code to automate operations. (Terraform, Ansible, Helm, Flux, Kustomize)
  • Enhance observability and logging using OpenTelemetry and Elastic.
  • Building automated solutions that enable resiliency and self-healing of applications.
  • Managing Server Operating Systems (Windows and Linux).
  • Managing Web Servers (IIS 10).
  • Troubleshoot production incidents, perform root cause analysis, and drive reliability improvements.
  • Evaluate and implement cloud-native technologies to enhance platform efficiency.
  • Collaborate with security teams to ensure best practices for container security and compliance.
  • Work with multi-cluster management solutions such as Rancher, Cluster API (CAPI), or other Kubernetes fleet management tools.
  • Manage Kubernetes infrastructure on Azure and vSphere.
  • Participate in an on-call rotation to support platform operations and respond to incidents.
The Essentials - You Will Have:
  • Bachelor's Degree or equivalent years of relevant work experience.
  • Legal authorization to work in the U.S. We will not sponsor individuals for employment visas, now or in the future, for this job opening.
The Preferred - You Might Also Have:
  • Typically requires 5+ years of relevant professional experience in a cloud infrastructure, platform engineering, or operations role.
  • 3+ years working with Kubernetes in a production environment.
  • Proficiency with Terraform and Ansible.
  • Load balancer experience (F5 LTM, Azure Load Balancer)
  • Public Cloud experience (Microsoft Azure or Amazon Web Services)
  • Experience with Linux administration and container runtimes (Docker, containerd)
  • Familiarity with observability tools (OpenTelemetry, Elastic, PRTG, and Dynatrace).
  • Experience managing multi-cluster Kubernetes environments. (Rancher & Cluster API).
  • Solid understanding of RBAC, security policies, and secrets management in Kubernetes.· Hands-on experience with Azure and vSphere as Kubernetes infrastructure providers.
  • Capability to analyze packet captures using tools such as Wireshark.
  • Strong understanding of IPv4/IPv6, FTP, HTTP, SSL/TLS, HTML, XML
  • Knowledge of .Net website functionality.
  • The ability to participate in an on-call rotation for platform support.
  • Prior experience in SRE or Platform Engineering roles.
  • Degree in Computer Science or related area.
What We Offer:
  • Health Insurance including Medical, Dental and Vision
  • 401k
  • Paid Time off
  • Parental and Caregiver Leave
  • Flexible Work Schedule where you will work with your manager to enjoy a work schedule that can be flexible with your personal life.
  • To learn more about our benefits package, please visit at www.raquickfind.com.

This position is part of a job family. Experience will be the determining factor for position level and compensation.

At Rockwell Automation we are dedicated to building a diverse, inclusive and authentic workplace, so if you're excited about this role but your experience doesn't align perfectly with every qualification in the job description, we encourage you to apply anyway. You may be just the right person for this or other roles.

#LI-Remote

#LI-LifeAtROK

#LI-MG4

We are an Equal Opportunity Employer including disability and veterans. 

If you are an individual with a disability and you need assistance or a reasonable accommodation during the application process, please contact our services team at +1 (844) 404-7247.

Top Skills

Amazon Web Services
Ansible
Azure
Docker
Elastic
F5 Ltm
Flux
Helm
Kubernetes
Kustomize
Azure
Opentelemetry
Terraform
Vsphere

Similar Jobs

Mid level
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
The Safety Modeling Engineer will develop and analyze models to assess collision outcomes and severity for automated driving systems, using statistical and machine learning methods.
Top Skills: Ci/CdDockerGitJenkinsJIRAKubernetesPoetryPythonSQLTerraform
Mid level
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Lead the development of driver behavior models for automated driving systems, integrating statistical and machine learning models to analyze human performance in safety-critical scenarios.
Top Skills: DockerGitJenkinsJIRAKubernetesPythonSQLTerraform
4 Hours Ago
Remote or Hybrid
United States
Senior level
Senior level
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
The AV Safety Analytics Engineer will develop data analytics infrastructure for automated vehicle safety, utilizing cloud processing and statistical methods. Responsibilities include creating data visualizations, monitoring metrics, and ensuring data integrity across systems.
Top Skills: DockerGitJenkinsJIRAKubernetesNumpyPandasPlotly/DashPower BIPythonShinySQLTableauTerraform

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account