Underdog Fantasy Logo

Underdog Fantasy

Senior Site Reliability Engineer

Posted 17 Days Ago
Remote
2 Locations
150K-180K
Senior level
Remote
2 Locations
150K-180K
Senior level
The Senior Site Reliability Engineer will manage incident responses, guide SLO setups, lead capacity planning, develop disaster recovery plans, and collaborate on architecture for high availability, while focusing on automation and tooling.
The summary above was generated by AI
We’re Underdog.

The fastest-growing sports gaming company – ever.

We build innovative games, products, and experiences for American sports fans.

We’re here to shake up the fastest growing industry with bold ideas, custom-built tech, and the drive to win.

Founded in 2020, our team has built four of today’s most widely played fantasy games and launched the Underdog Sportsbook – built entirely in-house with our own technology. That means we control our product, move fast, and create experiences you won’t find anywhere else. 

In just over two years, we’ve reached over a $1.2 billion valuation, with investors like BlackRock, Spark Capital, SV Angel, Mark Cuban, Kevin Durant, and Adam Schefter. And we’re just getting started.

At Underdog, we believe that sports are for everyone. Join us.

What you’ll do:

  • Own and maintain the incident response process, including defining procedures, tools, and best practices
  • Guide teams in establishing and monitoring Service Level Objectives (SLOs), including setting up alerts and reporting systems
  • Lead capacity planning initiatives, focusing on both short and long-term scalability while optimizing costs
  • Develop and implement disaster recovery plans, including regular testing and regulatory compliance
  • Collaborate with teams on architecture decisions to ensure high availability and scalability
  • Manage launch and event planning for high-traffic occasions, focusing on infrastructure preparation and capacity management (a.k.a. Launch Readiness)
  • Act as an internal expert and consultant for monitoring tools like Datadog and Pagerduty and infrastructure like AWS and Kubernetes
  • Emphasis on automation and tooling to scale our workload
  • Jump in and out of repos written in languages like Ruby, Python, Go, Typescript, Swift, Kotlin, and SQL to support efforts described above

Who you are:

  • 6+ years of experience in site reliability engineering, cloud infrastructure, and/or web application development
  • A strong written and verbal communicator
  • Collaborative by nature
  • Someone who enjoys using research, data, and experiments to make decisions; you believe “Hope is not a strategy.”
  • You enjoy working directly with customers (generally engineers or other people inside the company)
  • You think long-term about what is best for the business and its customers
  • You are excited to take ownership
  • You are very comfortable around an IDE, working with multiple languages, multiple web application frameworks, AWS services, Kubernetes, PostgreSQL
  • You can work independently to learn new languages/technologies as needed
  • You enjoy deploying changes to production quickly, multiple times a week if necessary

Even better if you have:

  • Experience with PostgreSQL SQL query optimization, tweaking autovacuum settings, table statistics, different index types, etc.
  • Experience with Redis/Valley Optimization
  • Experience with Datadog or similar products
  • Experience working as a web application developer, frontend or backend, especially in React and Ruby on Rails
  • Experience with AWS cost optimization
  • Read the Google SRE books or similar books, or have other forms of SRE training
  • Actively leveraging the capabilities of AI to augment abilities and gain knowledge about interested domains


Our target starting base salary range for this position is between $150,000 and $180,000, plus target equity. The starting base salary will depend on a number of factors including the candidate’s skills and experience, among other things.

What we can offer you:

  • Unlimited PTO (we're extremely flexible with the exception of the first few weeks before & into the NFL season)
  • 16 weeks of fully paid parental leave
  • A $500 home office allowance
  • A connected virtual first culture with a highly engaged distributed workforce
  • 5% 401k match, FSA, company paid health, dental, vision plan options for employees and dependents

#LI-REMOTE

This position may require sports betting licensure based on certain state regulations.

Underdog is an equal opportunity employer and doesn't discriminate on the basis of creed, race, sexual orientation, gender, age, disability status, or any other defining characteristic.

Top Skills

AWS
Datadog
Go
Kotlin
Kubernetes
Pagerduty
Postgres
Python
Redis
Ruby
SQL
Swift
Typescript

Similar Jobs

2 Hours Ago
Remote
United States
161K-180K Annually
Senior level
161K-180K Annually
Senior level
Consumer Web • Digital Media • Information Technology • News + Entertainment • Social Media
The Senior Site Reliability Engineer will enhance infrastructure resilience, optimize system performance, and improve both physical and cloud systems while collaborating with engineering teams.
Top Skills: AnsibleCC++DockerGoJavaKubernetesPythonTerraformUnix/Linux
6 Days Ago
Easy Apply
Remote
13 Locations
Easy Apply
160K-185K
Senior level
160K-185K
Senior level
Consumer Web • Enterprise Web • Mobile • Productivity • Software
The Senior Site Reliability Engineer will ensure service reliability and performance while optimizing DevOps practices for efficient deployment and automation in a fast-paced environment.
Top Skills: AWSAzureBashDatadogDockerGCPGitlab CiGoGrafanaJenkinsKubernetesMetabasePrometheusPythonTerraform
Yesterday
Remote
United States
118K-231K Annually
Senior level
118K-231K Annually
Senior level
Big Data • Cloud • Software • Database
The Senior Site Reliability Engineer will design, implement, and enhance systems for infrastructure development, focusing on automation, reliability, and developer experience.
Top Skills: AWSAzureBazelCrossplaneGCPGithub ActionsKubernetesTerraform

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account