Upstart logo, people working, and tagline "Build the future, join the team"
Upstart Logo

Upstart

Director of Reliability

Posted 2 Days Ago
Be an Early Applicant
Easy Apply
Remote
2 Locations
217K-301K Annually
Senior level
Easy Apply
Remote
2 Locations
217K-301K Annually
Senior level
The Director of Reliability will lead the Site Reliability Engineering team to ensure platform reliability, performance, and scalability, implementing automation and observability practices while aligning SRE initiatives with business objectives.
The summary above was generated by AI

About Upstart

Upstart is the leading AI lending marketplace partnering with banks and credit unions to expand access to affordable credit. By leveraging Upstart's AI marketplace, Upstart-powered banks and credit unions can have higher approval rates and lower loss rates across races, ages, and genders, while simultaneously delivering the exceptional digital-first lending experience their customers demand. More than 80% of borrowers are approved instantly, with zero documentation to upload.

Upstart is a digital-first company, which means that most Upstarters live and work anywhere in the United States. However, we also have offices in San Mateo, California; Columbus, Ohio; and Austin, Texas.

Most Upstarters join us because they connect with our mission of enabling access to effortless credit based on true risk. If you are energized by the impact you can make at Upstart, we’d love to hear from you!

The Team

As the Director of Reliability, you’ll be the strategic force ensuring our platform is not only always-on but relentlessly performant, and scalable. You’ll lead the Site Reliability Engineering (SRE), Compute, Quality, Runtime and Deployment teams to build resilient systems all while championing automation, observability, and incident excellence. This role is at the heart of Upstart’s mission: making credit more accessible and fair by ensuring the tech behind it never misses a beat.

Your leadership will transform how we build, deploy, and maintain reliable systems, ensuring our technology is an enabler rather than a bottleneck. As the driving force behind resilient and scalable infrastructure, you will enable our company to deliver exceptional customer  experiences, support innovation, and sustain long-term growth.

This isn’t just an operational role—it’s a strategic leadership position that will define the future of our platform’s reliability and performance.

How you’ll make an impact:

  • You will proactively prevent downtime and service disruptions by implementing robust monitoring, alerting, and automation strategies.
  • Your efforts in optimizing system performance will improve response times, reduce latency, and enhance overall customer satisfaction.
  • By championing automation and observability, you will reduce manual toil and free up engineering teams to focus on innovation.
  • Your leadership will help create self-healing systems, reducing the need for reactive firefighting and improving developer productivity.
  • You will lead the development of a world-class incident response process, ensuring quick resolution of outages and minimizing business impact.
  • You will empower teams with SRE best practices, breaking down silos between development, operations, and security teams.
  • By aligning SRE initiatives with business objectives, you will help balance reliability with speed of innovation, ensuring that engineering teams can ship features quickly without sacrificing stability.
  • Your contributions will directly support revenue growth by reducing service disruptions and ensuring a seamless user experience


What we’re looking for: 

  • Minimum requirements:
    • 10+ years of experience in software engineering, DevOps, or Site Reliability Engineering, with at least 5+ years in a leadership role.
    • Proven experience leading large-scale, mission-critical distributed systems with a focus on reliability, scalability, and security.
    • Expertise in cloud platforms such as AWS, Azure, or Google Cloud.
    • Strong background in observability tools like Prometheus, Grafana, Datadog, New Relic, or Splunk.
    • Experience with infrastructure as code (Terraform, CloudFormation) and containerization (Docker, Kubernetes).
    • Strong understanding of networking, security, and performance optimization.
    • Demonstrated success in building high-performing SRE teams and implementing best practices.
  • Preferred qualifications:
    • Experience building and leading teams that deliver big impact
    • Experience developing and maintaining large scale distributed systems in AWS
    • Ability to influence and lead others without direct authority
    • Strong product and analytical mindset that allows you to think in terms of ROI, risk, and trade offs
    • Experience working at companies that have gone through periods of rapid business or organizational growth while maintaining high standards


Position Location - This role is available in the following locations: Remote, San Mateo, Columbus, Austin 

Time Zone Requirements - This team operates on the East/West Coast time zones.

Travel Requirements -  As a digital first company, the majority of your work can be accomplished remotely. The majority of our employees can live and work anywhere in the U.S but are encouraged to to still spend high quality time in-person collaborating via regular onsites. The in-person sessions’ cadence varies depending on the team and role; the Engineering team meets quarterly for 4-5 consecutive days at a time.


What you'll love: 

  • Competitive Compensation (base + bonus & equity)
  • Comprehensive medical, dental, and vision coverage with Health Savings Account contributions from Upstart 
  • 401(k) with 100% company match up to $4,500 and immediate vesting and after-tax savings
  • Employee Stock Purchase Plan (ESPP)
  • Life and disability insurance
  • Generous holiday, vacation, sick and safety leave  
  • Supportive parental, family care, and military leave programs
  • Annual wellness, technology & ergonomic reimbursement programs
  • Social activities including team events and onsites, all-company updates, employee resource groups (ERGs), and other interest groups such as book clubs, fitness, investing, and volunteering
  • Catered lunches + snacks & drinks when working in offices

 

#LI-REMOTE

#LI-Director 

At Upstart, your base pay is one part of your total compensation package.  The anticipated base salary for this position is expected to be within the below range. Your actual base pay will depend on your geographic location–with our “digital first” philosophy, Upstart uses compensation regions that vary depending on location. Individual pay is also determined by job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

In addition, Upstart provides employees with target bonuses, equity compensation, and generous benefits packages (including medical, dental, vision, and 401k).

United States | Remote - Anticipated Base Salary Range

$217,400$300,900 USD

Upstart is a proud Equal Opportunity Employer. We are dedicated to ensuring that underrepresented classes receive better access to affordable credit, and are just as committed to embracing diversity and inclusion in our hiring practices. We celebrate all cultures, backgrounds, perspectives, and experiences, and know that we can only become better together. 

If you require reasonable accommodation in completing an application, interviewing, completing any pre-employment testing, or otherwise participating in the employee selection process, please email [email protected]

https://www.upstart.com/candidate_privacy_policy

Top Skills

AWS
Azure
CloudFormation
Datadog
Docker
GCP
Grafana
Kubernetes
New Relic
Prometheus
Splunk
Terraform

Similar Jobs at Upstart

2 Days Ago
Easy Apply
Remote
2 Locations
Easy Apply
160K-222K Annually
Senior level
160K-222K Annually
Senior level
Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
As a Senior Software Engineer on the ML Platform team, you will build and maintain MLOps platforms, enabling machine learning model deployment and training. You'll develop infrastructure for data access, tooling for rapid experimentation, and ensure automation in model training processes, collaborating with cross-functional teams.
2 Days Ago
Easy Apply
Remote
2 Locations
Easy Apply
123K-171K Annually
Junior
123K-171K Annually
Junior
Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
As a Machine Learning Engineer, you will enhance ML productivity, design algorithms, and collaborate cross-functionally to improve credit underwriting models.
Top Skills: Machine LearningPython
3 Days Ago
Easy Apply
Remote
2 Locations
Easy Apply
142K-197K Annually
Mid level
142K-197K Annually
Mid level
Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
The Software Engineer will develop scalable systems for personalized consumer experience, collaborating with teams and improving overall code quality.
Top Skills: AWSKafkaKotlinNext.JsReactRuby

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account