Upgrade, Inc. Logo

Upgrade, Inc.

Principal DevOps Engineer, Infrastructure Performance

Posted 5 Days Ago
Easy Apply
Remote or Hybrid
Hiring Remotely in United States
Senior level
Easy Apply
Remote or Hybrid
Hiring Remotely in United States
Senior level
Design and build a cloud-based observability platform, troubleshoot performance issues, improve monitoring tools, and scale infrastructure. Lead operational improvements in a collaborative environment.
The summary above was generated by AI

Upgrade helps customers move in the right direction with affordable and responsible financial products. Since 2017, we’ve helped over 7 million customers access over $40 billion in consumer credit. With a relentless focus on improving our customers' financial well-being, we build products that put more money in their pocket and support their journey toward a better financial future. We’re backed by some of the most prominent technology investors and were most recently valued at $6.3B.

We’re consistently recognized for our collaborative and inclusive culture. Most recently, we were named one of the World’s Top Fintech Companies by CNBC, Best Places to Work by Built In, Best Places to Work by the San Francisco Business Times, America’s Greatest Workplaces by Newsweek, Best Startup Employer by Forbes, and Healthiest Employers by Phoenix Business Journal. 

We’re looking for new team members who get excited about designing and delivering new and better products. Come join us and help build a better financial future for millions of people.


What You'll Do:
  • Build a resilient, secure, and efficient cloud based observability platform.
  • Monitor and troubleshoot platform issues, including finding solutions to reduce known issues.
  • Build and scale the observability infrastructure to meet rapidly increasing demand.
  • Develop and improve operational practices and procedures.
  • Sample projects:
    • Improve database monitoring: develop custom prometheus exporters in Go for use cases that go beyond what is possible with SQL exporter. Create Grafana dashboards and alerts for these new metrics.
    • MCP servers for observability: deploy MCP server to integrate our observability stack with our LLM tools.
What We Look For:
  • 8+ years of relevant production-level experience.
  • Experience with VictoriaMetrics.
  • Experience with Sumologic.
  • Experience with tracing tools (e.g. OpenTelemetry, Honeycomb, Tempo).
  • Experience with profiling tools (e.g. Pyroscope)
  • Knowledge of cloud monitoring, logging and cost management tools.
  • Programming/scripting knowledge (Go, Java, or Python) and understanding of JVM concepts.
  • In-depth knowledge of AWS services, hands-on experience in AWS provisioning using terraform.
  • Experience with containerized applications and Kubernetes / EKS. Creating and updating / maintaining Helm charts.
  • Understanding of microservices architecture and debugging/investigation techniques.
  • Strong understanding of systems, networking and troubleshooting techniques.
  • Experience in automated build pipeline, continuous integration and continuous deployment.
  • Ability to operate in an agile, entrepreneurial start-up environment.
  • Experience with running Linux in production.
Our Tech Stack:
  • Monitoring: VictoriaMetrics, Grafana, Prometheus, OpenTelemetry, Honeycomb, Sumologic.
  • Infrastructure as code: Terraform.
  • CD: GitOps, ArgoCD, ArgoRollouts.
  • CI: Tekton.
  • Scripting: Bash.
  • Programming: Golang (preferred).
  • AWS: EKS, Cloudwatch, S3, DynamodDB, RDS, SNS, SQS, Lambda.

What We Offer You: 

  • Competitive salary and stock option plan.
  • 100% paid coverage of medical, dental and vision insurance.
  • Flexible PTO.
  • Learning stipend for personal growth and development. 
  • Paid parental leave.
  • Health & wellness initiatives. 

#LI-Remote  #BI-Remote

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Upgrade does not accept unsolicited resumes from staffing agencies, search firms, or any third parties. Any resume submitted to any employee of Upgrade without a prior written agreement in place will be considered the property of Upgrade, and Upgrade will not be obligated to pay any referral or placement fee. Agencies must obtain advance written approval from Upgrade's Talent Acquisition department to submit resumes and only in conjunction with a valid, fully executed agreement. English is required for all positions, as it involves interacting with staff at Upgrade's offices worldwide.

Top Skills

Argocd
Argorollouts
AWS
Bash
Eks
Gitops
Go
Grafana
Honeycomb
Java
Kubernetes
Opentelemetry
Prometheus
Pyroscope
Python
Sumologic
Tekton
Tempo
Terraform
Victoriametrics

Similar Jobs at Upgrade, Inc.

2 Days Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
Senior level
Senior level
Automotive • Fintech • Hardware • Payments • Travel • Financial Services
The Director of Financial Institution Sales will build a banking funding channel, manage partnerships, and present Upgrade's credit products to banks and credit unions in the Northeast region.
Top Skills: Salesforce CRM
2 Days Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
Senior level
Senior level
Automotive • Fintech • Hardware • Payments • Travel • Financial Services
As a Senior QA Automation Engineer, you will develop test automation, analyze logs, troubleshoot production issues, and implement testing strategies in a fast-paced Agile environment.
Top Skills: ArgocdDockerGatlingGitJavaJenkinsKubernetesLinuxMavenOkhttpPlaywrightRest AssuredSeleniumSQLSumologicTestng
6 Days Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
Mid level
Mid level
Automotive • Fintech • Hardware • Payments • Travel • Financial Services
As a Senior Model Risk Analyst, you will validate statistical/machine learning models, improve model performance, and research new tools while collaborating across teams.
Top Skills: PythonSQL

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account