Baseten Logo

Baseten

Senior Software Engineer - Infrastructure

Reposted 18 Days Ago
In-Office or Remote
3 Locations
200K-270K Annually
Senior level
In-Office or Remote
3 Locations
200K-270K Annually
Senior level
Architect and lead the development of ML inference platforms, optimizing infrastructure and Kubernetes deployments, while mentoring junior engineers.
The summary above was generated by AI

ABOUT BASETEN

Baseten powers inference for the world's most dynamic AI companies, like OpenEvidence, Clay, Mirage, Gamma, Sourcegraph, Writer, Abridge, Bland, and Zed. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. With our recent $150M Series D funding, backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction, we’re scaling our team to meet accelerating customer demand.

THE ROLE

As a Senior Infrastructure Software Engineer at Baseten, you'll architect and lead development of our ML inference platform that powers production AI applications. You'll make key technical decisions for the infrastructure enabling developers to deploy, scale, and monitor ML models with high performance and reliability.

EXAMPLE INITIATIVES

You'll get to work on these types of projects as part of our Infrastructure team:

  • Multi-cloud capacity management

  • Inference on B200 GPUs

  • Multi-node inference

  • Fractional H100 GPUs for efficient model serving

RESPONSIBILITIES

  • Design and architect scalable infrastructure systems for our ML inference platform

  • Lead optimization of Kubernetes deployments for efficient, cost-effective model serving

  • Drive enhancements to our inference orchestration layer for complex model deployments

  • Define monitoring strategies for model performance, latency, and resource utilization

  • Develop advanced solutions for GPU capacity management and throughput optimization

  • Establish infrastructure automation standards to streamline ML deployment workflows

  • Partner with other engineers to translate complex inference requirements into technical solutions

  • Make critical architectural decisions balancing performance with system reliability

  • Lead technical discussions and mentor junior engineers on infrastructure best practices

  • Contribute to long-term technical strategy and infrastructure roadmap

REQUIREMENTS

  • Bachelor's degree or higher in Computer Science or related field

  • 5+ years experience building production infrastructure systems

  • Expert-level proficiency in Go, with Python experience a plus

  • Deep expertise with Kubernetes in production environments

  • Extensive experience with major cloud providers (AWS, GCP) and neo-cloud providers (Crusoe, DigitalOcean, Nebius) a plus.

  • Advanced understanding of distributed systems concepts and performance tuning

  • Proven experience designing observability systems

  • Track record of leading technical initiatives and mentoring engineers

  • Experience with ML/AI workloads and MLOps platforms highly valued

BENEFITS

  • Competitive compensation package.

  • This is a unique opportunity to be part of a rapidly growing startup in one of the most exciting engineering fields of our era.

  • An inclusive and supportive work culture that fosters learning and growth.

  • Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.


At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.

Top Skills

AWS
Distributed Systems
GCP
Go
Kubernetes
Python

Similar Jobs

6 Hours Ago
Remote or Hybrid
2 Locations
Senior level
Senior level
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Join the Mobile Infrastructure team to enhance tools and pipelines for mobile apps, ensuring high-quality user experiences and facilitating developer workflows.
Top Skills: ArtifactoryAWSAzureCi/CdDockerFirebaseGCPGithub ActionsGradleNexusPythonRubyXcode
2 Days Ago
Easy Apply
In-Office or Remote
2 Locations
Easy Apply
180K-210K Annually
Senior level
180K-210K Annually
Senior level
Healthtech • Software
As a Senior Software Engineer, you'll design and operate infrastructure services, enhance development capabilities, and improve system reliability while collaborating cross-functionally across teams.
Top Skills: Ci/CdCloud InfrastructureGoInfrastructure-As-CodeJavaScriptKubernetesPythonTerraform
14 Days Ago
Remote
USA
160K-200K Annually
Senior level
160K-200K Annually
Senior level
Artificial Intelligence • Cloud • Hardware • Machine Learning • Other • Software • Infrastructure as a Service (IaaS)
Design and implement automation tools and APIs for managing infrastructure, collaborate with engineering teams, and participate in architectural discussions.
Top Skills: ContainerizationDell HardwareHpc InfrastructureJuniper NetworksLinuxNetworkingOrchestrationPalo Alto FirewallsPythonSonic SwitchesVast Storage Systems

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account