Prove Logo

Prove

Senior Site Reliability Engineer

Posted 15 Days Ago
Remote
Hiring Remotely in United States
165K-180K
Senior level
Remote
Hiring Remotely in United States
165K-180K
Senior level
The Senior Site Reliability Engineer will design, implement, and manage scalable systems, enhance observability, and ensure high availability of services while collaborating with engineering teams.
The summary above was generated by AI
About Prove 

As the world moves to a mobile-first economy, businesses need to modernize how they acquire, engage with and enable consumers. Prove’s phone-centric identity tokenization and passive cryptographic authentication solutions reduce friction, enhance security and privacy across all digital channels, and accelerate revenues while reducing operating expenses and fraud losses. Over 1,000 enterprise customers use Prove’s platform to process 20 billion customer requests annually across industries, including banking, lending, healthcare, gaming, crypto, e-commerce, marketplaces, and payments. For the latest updates from Prove, follow us on LinkedIn.

Prove is driving the future of digital identity. We are looking for Provers who know how to make an impact. We’re talking self-starting professionals who thrive in a fast-paced environment, process information quickly, and make intelligent decisions. The work is challenging and requires not only smart but natural curiosity and tenacity. Teamwork is also important to us – we work together and play together.   

Prove has big plans, and we’re excited about the future. If this sounds like the place for you – come join our team! 

Title: Senior Site Reliability Engineer

Department: Platform Engineering 

Reports To: Manager, Site Reliability

FLSA Status: Exempt

Location: Seattle, WA


Position Overview

We are seeking an experienced Senior Site Reliability Engineer to join our Platform Engineering team. In this role, you will be instrumental in designing, implementing, maintaining and deploying highly available complex, scalable and reliable systems leveraging automation, effective monitoring and infrastructure-as code. Working closely with our application engineering teams to ensure our services meet the highest standards of reliability, performance, and security.


Key Responsibilities


Observability Leadership

  • Design and implement comprehensive observability solutions across our infrastructure and within applications
  • Lead the initiative to establish a companywide instrumentation standard based in Opentelemetry wide events.
  • Build advanced monitoring dashboards that provide real-time visibility into system health and performance
  •  Establish metrics, logging, and tracing systems that enable quick identification and resolution of issues
  •  Create alerting thresholds and automated responses based on service level objectives (SLOs)
  •  Drive a culture of observability throughout the engineering organization

Container Orchestration

  •   Lead Kubernetes cluster management, optimization, and scaling initiatives
  •   Design and implement infrastructure-as-code deployments for container-based applications
  •   Optimize container resource allocation and utilization
  •   Build automated deployment pipelines that ensure consistent, reliable releases
  •   Establish best practices for containerization and orchestration across teams

Infrastructure Management

  •  Design, build, and maintain scalable cloud infrastructure on AWS
  •  Implement infrastructure-as-code using tools such as Terraform
  •  Automate routine operational tasks to reduce toil and improve efficiency
  •  Ensure infrastructure security compliance and implement least-privilege access controls
  •  Optimize cloud resource utilization and costs

Incident Response

  •   Integrate observability-driven alerts with our Incident Management systems
  •   Lead incident response efforts during service disruptions
  •   Conduct thorough post-incident reviews and implement preventative measures
  •   Use observability data to perform root cause analysis and system improvements
  •   Document incidents, responses, and lessons learned to build organizational knowledge

Performance Optimization

  •   Identify and resolve performance bottlenecks across the technology stack
  •   Conduct capacity planning and scaling exercises to meet future demands
  •   Implement auto-scaling solutions based on performance metrics
  •   Optimize database performance and query efficiency
  •   Design and implement application stress testing methods and systems

Required Qualifications

  • 5+ years of experience in Site Reliability Engineering, DevOps, or similar roles. Software Engineering roles with a strong infrastructure and production engineering aspect also qualify.
  • Expert knowledge of observability platforms and practices (OpenTelemetry, Prometheus, Grafana, Jaeger, ELK stack / Splunk, etc)
  • Experience with Kubernetes and container orchestration
  • Strong experience with infrastructure-as-code tools (Terraform, CloudFormation, Pulumi)
  • Proficiency in at least one programming language ( Go, Python, Java)
  • Deep understanding of cloud platforms (AWS, GCP, or Azure)
  • Experience implementing and managing CI/CD pipelines
  • Knowledge of network architecture and security principles
  • Bachelor's degree in Computer Science, Engineering, or equivalent practical experience

Preferred Qualifications


  • Experience with distributed systems and microservice architectures
  • Knowledge of security best practices and compliance requirements
  • Experience with database administration (PostgreSQL, MySQL)
  • Familiarity with service mesh technologies (Istio, Linkerd)
  • Contributions to open-source projects
  • AWS/GCP certifications
  • Experience in the identity verification or financial technology industry
  • Hands-on experience with OpenTelemetry standards and technology
  • Application development experience

This position description should not be considered the final description of the position. The position description is not intended to be an all-inclusive list of duties and standards of the positions. It should be assumed that we would, to some extent, structure responsibilities in accordance with the successful candidate’s capabilities and changing business conditions. Incumbents will follow any other instructions, and perform any other related duties, as assigned by their supervisor.

The anticipated salary range for this role is $165,000 - $180,000 plus variable commission / company bonus. Offered salary will be determined by the applicant’s education, experience, knowledge, skills, geo-location and abilities, as well as internal equity and alignment with market data.

Benefits & Perks for FTE Provers:

  • Competitive salaries & Bonus Plan (for eligible roles) and Equity Plan
  • Modern Health for financial, mental, and physical wellness
  • 401(k) Retirement Plan & Match (US Offices) and Local Country Pension (International Offices)
  • Unlimited Vacation and Flexible hours
  • Comprehensive medical benefits for you and your family ❤️
  • Emotional & Physical Wellness – Access to wellness services (EAP & Prove Well-Being Reimbursement)
  • Bottomless snacks & beverages for certain office locations
  • Daily GrubHub stipend for lunch if coming into the office (US Offices)
  • A great place to work and connect with other talented Provers like yourself!

Don’t meet every single requirement? Studies have shown that women and people of color are less likely to apply to jobs unless they meet every single qualification. At Prove we are dedicated to building a diverse, inclusive and authentic workplace, so if you’re excited about this role but your past experience doesn’t align perfectly with every qualification in the job description, we encourage you to apply anyways. You may be just the right candidate for this or other roles.

Equal Opportunity Employment:
Prove is an equal opportunity employer committed to providing equal employment opportunity for all people regardless of race, color, religion, gender or sexual orientation, age, marital status, national origin, citizenship status, disability, veteran status or other personal characteristics 

Privacy & Data Protection:
When you are applying for a job at Prove, we collect and use your personal information in the job application process. To understand more about how Prove uses your personal information, please see our Recruitment Privacy Policy on our website.

Top Skills

AWS
Azure
CloudFormation
Elk Stack
GCP
Go
Grafana
Jaeger
Java
Kubernetes
Opentelemetry
Prometheus
Pulumi
Python
Splunk
Terraform

Similar Jobs

Yesterday
In-Office or Remote
San Francisco, CA, USA
172K-269K Annually
Senior level
172K-269K Annually
Senior level
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
As a Senior SRE Manager, you will lead a team of Site Reliability Engineers, promote cultural change in technical quality, and influence organizational strategy.
Top Skills: Agile Software Development MethodologiesLarge-Scale Distributed SystemsMicroservicesSoftware Engineering Principles
18 Days Ago
Easy Apply
Remote
US
Easy Apply
124K-266K Annually
Senior level
124K-266K Annually
Senior level
Cloud • Security • Software • Cybersecurity • Automation
As a Senior Site Reliability Engineer, you'll ensure reliability and scalability of user-facing services, automate workflows, and uphold compliance standards for public sector services.
Top Skills: AnsibleAWSElkGCPGitlabGoGrafanaKubernetesPrometheusRubyTerraform
17 Hours Ago
Remote
USA
130K-140K
Senior level
130K-140K
Senior level
Consumer Web • Digital Media • Software
The Senior Site Reliability Engineer will manage system incidents, enhance monitoring and database infrastructure, and collaborate on scalable systems to maintain reliability as usage scales.
Top Skills: AWSClickhouseKubernetesMySQLPostgresRedis

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account