Cubist Logo

Cubist

Site Reliability Engineer (SRE), Platform Engineering Team

Reposted 18 Days Ago
Remote
Hiring Remotely in USA
Mid level
Remote
Hiring Remotely in USA
Mid level
As a Site Reliability Engineer, you'll enhance observability, logging, and tracing, collaborating with engineers to optimize performance and security of infrastructure.
The summary above was generated by AI

Overview:

Most of the world’s digital infrastructure has decades of reliability engineering behind it. Web3, by contrast, is newer and still catching up to the standards of modern production systems. At Cubist, we’re laying the groundwork the industry needs: secure, high-assurance infrastructure for managing cryptographic keys, which are the backbone of authentication and access control in blockchain-based systems.

Our flagship key management platform, CubeSigner, powers high-stakes financial systems across crypto and fintech. Major institutions and protocols use it to control billions in digital assets, where even a single signing error or second of downtime can have dire financial consequences.

As a Site Reliability Engineer (SRE) at Cubist, you’ll work on the foundation that keeps this platform running: scaling deployments, tightening observability, and building the automation and guardrails that keep critical infrastructure safe and performant.

About the Cubist Product:

At its core, CubeSigner is serious infrastructure: a signing platform that blends hardware security modules, enclaves, and cloud-native services to deliver performance, security, and auditability. It enforces programmable policies for how keys can be used and provides verifiable guarantees about every transaction. Because a key mediates everything a company does in Web3, CubeSigner is a critical part of every customer’s technology stack and serves as a collaboration point between many teams and functions.

Our Platform Team is responsible for maintaining SaaS, dedicated instance, and on-premise deployments across multiple geographic regions. CubeSigner is trusted by teams who treat reliability, latency, and security as non-negotiable (and who expect the same from their infrastructure partners).

About the Cubist Team:

Cubist’s founders include a former fintech executive and professors from UC San Diego and Carnegie Mellon University who have published over 80 papers on computer security. Members of the Cubist engineering team have designed and specified the cryptography underlying Ethereum and Avalanche, deployed fine-grained isolation in Firefox, discovered serious bugs in Google Chrome and Linux, and built the automated reasoning tools that companies like Amazon and Certora rely on.

We’ve raised over $15mm led by Polychain and have the runway to scale, and more importantly are generating real revenue working with serious teams who use CubeSigner as critical infrastructure. Our edge has always been the depth of our engineering; as we grow, that same focus on correctness, performance, and reliability continues to guide every architecture decision.

About the SRE role:

We're looking for a technically sharp and systems-minded SRE to drive improvements in observability, tracing, and logging across our engineering organization. You'll collaborate closely with engineers and leadership to proactively identify availability risks, embed observability into our CI/CD pipelines, and define sane guardrails for infrastructure and deployment. Your contributions will ensure our systems are resilient, maintain high performance, and shape how we scale securely.

What we’re looking for:

  • Hands-on experience with cloud infrastructure, particularly AWS, and infrastructure-as-code using AWS CDK

  • Familiarity with observability and telemetry tools such as Sentry, OpenTelemetry, and the LGTM stack

  • Deep understanding of tracing, performance tuning, networking, and containerization

  • Experience building and maintaining reliable systems with a focus on platform scalability and resilience

  • Demonstrated ability to solve complex challenges

  • Proficiency in modern programming languages such as Rust, TypeScript, and Go, with strong familiarity with WebAssembly (Wasm) engines

  • Strong command of Git for version control and collaborative development workflows

  • Strong technical communication skills

What gives you an edge:

  • Familiarity with Kubernetes

  • Experience with observability stacks: Loki, Mimir, Tempo, Prometheus, Grafana

  • Experience with GCP

  • Knowledge of Terraform, and Ansible

  • Exposure to Real User Monitoring (RUM) tools

Tasks and Responsibilities:

  • Improve tracing and logging across the engineering organization

  • Optimize operational workflows and tooling to drive efficiency and execution across the team Communicate and collaborate effectively with engineers and leadership to identify improvements in performance and availability

  • Work with engineers to build observability into the CI/CD process

  • Creating guard rails for CI/CD and infrastructure

  • Apply security best practices to ensure we meet the highest standards and relevant compliance requirements

Compensation & Perks:

  • Competitive salary and meaningful equity

  • 100% employer-paid medical, dental, vision and life insurance benefits

  • Company-sponsored 401(k), plus Health and Dependent Care FSAs

  • Flexible PTO so you can unplug, recharge, and come back inspired

  • Home office stipend to keep your setup sharp (we’re remote-first!)

  • Regular company retreats to connect and celebrate wins

Why Join Us?

As an SRE at Cubist, you’ll play a critical role building, automating, and maintaining a high-assurance system. You’ll work across the stack—and with the product code itself—to drive impactful projects that are solving real problems. Your expertise will shape how we scale, monitor, and secure our platform as we grow. This is a chance to bring your skills to a growing company at the forefront of security technology.

We’re a team of builders, problem-solvers, and big thinkers reimagining what security can be. Our culture values curiosity, precision, and teamwork, and we’re eager to work with others who share that mindset. If this sounds like your vibe, we’d love to talk.

Top Skills

Ansible
AWS
Aws Cdk
GCP
Git
Go
Grafana
Kubernetes
Lgtm
Loki
Mimir
Opentelemetry
Prometheus
Rust
Sentry
Tempo
Terraform
Typescript
Webassembly

Similar Jobs

13 Days Ago
Remote or Hybrid
United States
160K-180K Annually
Senior level
160K-180K Annually
Senior level
Artificial Intelligence • Other • Security • Software • Analytics • Big Data Analytics
The Lead Site Reliability Engineer will oversee the Infrastructure SRE team, focusing on system reliability, automation, and mentoring while collaborating with product engineering.
Top Skills: Ci/CdDatadogDockerElk StackGitopsGoKubernetesLinux/UnixNew RelicNoSQLPrometheusPythonSQLStackdriverTerraform
Yesterday
Easy Apply
Remote
United States
Easy Apply
Senior level
Senior level
Travel
The Senior Site Reliability Engineer will design and maintain reliable systems, manage infrastructure on AWS, and enhance security and performance while collaborating with teams to ensure high availability and efficiency.
Top Skills: AWSBackboneChefDatadogGitGithub ActionsJavaJenkinsJqueryMongoDBNoSQLPostgresPrometheusPythonReactRequirejsTerraformTerragrunt
2 Days Ago
In-Office or Remote
127K-165K Annually
Senior level
127K-165K Annually
Senior level
Mobile • Software • Analytics
The Senior Site Reliability Engineer will ensure the reliability and performance of large-scale infrastructure, lead systems design, and promote operational excellence through automation and collaboration with cross-functional teams.
Top Skills: AerospikeAlertmanagerArgocdAWSBashCloudFormationFoundationdbGoGrafanaJavaKafkaKotlinKubernetesLinuxLokiPagerdutyPrometheusPythonSparkTerraform

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account