TensorWave Logo

TensorWave

Site Reliability Engineer

Reposted 9 Days Ago
Remote
Hiring Remotely in USA
Senior level
Remote
Hiring Remotely in USA
Senior level
The Senior SRE Engineer will design, build, and maintain resilient infrastructure systems, manage infrastructure-as-code, and write tooling in various languages.
The summary above was generated by AI

Our mission at Tensorwave Cloud is to build seamless, secure, reliable, and resilient AI infrastructure at scale, eliminating barriers and challenging the status quo to empower builders and support AI innovation.

About the role

We are seeking a Site Reliability Engineer with a strong background in software engineering to build and maintain highly scalable, secure, and resilient infrastructure.

You’ll play a critical role in designing low-level systems, automating infrastructure with modern tooling, and ensuring platform reliability.

This role is ideal for someone who’s comfortable working at the intersection of systems programming and DevOps - writing code in Go, Javascript, Rust, C, or Zig while also managing infrastructure with NixOS, Kubernetes, and Terraform.

Responsibilities

  • Design, build, and maintain infrastructure systems using Linux and NixOS

  • Manage infrastructure-as-code with Terraform to provision and scale resources

  • Architect and operate Kubernetes clusters with a focus on performance, security, and automation

  • Write high-performance tooling and internal utilities in Go or Rust

  • Develop and maintain CI/CD pipelines for infrastructure and code deployments

  • Monitor system performance, resolve issues, and improve reliability through observability tooling

  • Collaborate closely with engineering teams to support deployment strategies and development workflows

Required Experience

  • Bachelor of Science in Computer Science, Computer Engineering, or a related technical field, or equivalent practical experience

  • 5+ years in DevOps, Site Reliability, or Infrastructure Engineering roles

  • Proficiency in one or more low-level languages Rust or Go

  • Deep experience with Linux systems and configuration management

  • Hands-on experience with Terraform, Kubernetes, and containerized environments

  • Strong understanding of systems programming, performance tuning, and operating system internals

  • Familiarity with CI/CD practices and infrastructure monitoring/alerting tools

What We Bring

  • Mission driven company

  • Competitive Salary

  • Stock Options

  • 100% paid Medical, Dental, and Vision insurance

  • Flexible PTO

  • Paid Holidays

  • 401(k)

  • Parental Leave

  • Flexible Spending Account

  • Short Term Disability Insurance

  • Life and Voluntary Supplemental Insurance

  • Mental Health Benefits through Spring Health

We’re looking for resilient, adaptable people to join our team, people who believe in the mission and think at massive scale. The solutions that worked on a handful of devices will not work at Exascale. Be prepared to be pushed daily, to learn a lot, and literally build the future.

Tensorwave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, national origin, or veteran status.

Top Skills

C
Go
JavaScript
Kubernetes
Nixos
Rust
Terraform
Zig

Similar Jobs

Yesterday
Remote or Hybrid
New York, NY, USA
130K-180K Annually
Senior level
130K-180K Annually
Senior level
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
The Staff Software Engineer will oversee SAP BTP CPI applications' operational support, manage incidents, collaborate with various teams, and ensure high system performance.
Top Skills: AbapCloud ApplicationsCpiErp SystemsIdocJSONOdataRestSap AribaSap BtpSap C4CSap CallidusSap Success FactorsSfapiSftpSoapWorkdayXML
6 Days Ago
Easy Apply
Remote
USA
Easy Apply
152K-179K Annually
Senior level
152K-179K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Site Reliability Engineer will enhance CI/CD frameworks, automate cloud infrastructure, manage Kubernetes and AWS services, and ensure operational excellence.
Top Skills: AnsibleAWSBashChefCi/CdDockerGitKubernetesPuppetPythonRubySaltTerraform
7 Days Ago
Remote or Hybrid
2 Locations
160K-180K Annually
Expert/Leader
160K-180K Annually
Expert/Leader
Artificial Intelligence • Other • Security • Software • Analytics • Big Data Analytics
The Lead Site Reliability Engineer will oversee the reliability and scalability of the infrastructure, lead a team in operational execution, ensure best practices in SRE, and mentor senior engineers.
Top Skills: Ci/CdDockerGitopsGoKubernetesLinuxPythonTerraform

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account