Groq Logo

Groq

Senior Staff Software Engineer, High Performance Inference System

Reposted 23 Days Ago
In-Office or Remote
2 Locations
249K-336K
Senior level
In-Office or Remote
2 Locations
249K-336K
Senior level
The role involves designing and implementing low-latency, scalable distributed systems for Groq's real-time inference system, optimizing performance, and ensuring reliability across hardware.
The summary above was generated by AI

About Groq

Groq delivers fast, efficient AI inference. Our LPU-based system powers GroqCloud™, giving businesses and developers the speed and scale they need. Headquartered in Silicon Valley, we are on a mission to make high performance AI compute more accessible and affordable. When real-time AI is within reach, anything is possible. Build fast.

Senior Staff Software Engineer – High Performance Inference System

Mission: 

Join the team that builds and operates Groq’s real-time, distributed inference system delivering large scale inference for LLMs and next-gen AI applications at ultra-low latency. Your work will optimize for heterogeneous hardware, dynamic global workloads, and extreme performance—all while running code at the edge of physics.

Responsibilities & opportunities in this role:

  • Distributed Systems Engineering: Design and implement scalable, low-latency runtime systems that coordinate thousands of GroqChips across a software-scheduled interconnect.
  • Low-Level Optimization: Develop deterministic, hardware-aware abstractions that prioritize execution speed, fault tolerance, and reliability.
  • Performance & Diagnostics: Build tools and infrastructure to support real-time system observability, diagnostics, and SLO improvements.
  • Future-Proofing: Evolve Groq’s system stack to support emerging silicon, topologies, and heterogeneous accelerators (e.g., FPGAs).
  • Cross-Functional Collaboration: Partner with teams across compiler, infra, cloud, hardware, and data centers to align architecture and drive shared progress.

Ideal candidates have/are:

  • Consistently ship high-impact, production-ready systems code.
  • Have deep knowledge of computer architecture, operating systems, algorithms, and hardware-software interfaces.
  • Are fluent in low-level systems languages such as C++ or Rust, and comfortable with hardware-aware programming.
  • Rigorously profile and optimize for latency, throughput, and resource efficiency—every cycle counts.
  • Believe in automation and CI/CD best practices—you don’t ship untested code.
  • Thrive across the stack—from kernel internals to hardware integration to cloud load balancers.
  • Communicate clearly, make pragmatic technical decisions, and write maintainable code for the long term.
  • Ensures code stays fast, scales well, and takes ownership of outcomes.
Nice to have:
  • Operating large-scale distributed systems for real-time, high-traffic services.
  • Deploying and optimizing ML or HPC workloads in production environments.
  • Hands-on experience with GPUs, FPGAs, or ASICs in performance-critical systems.
  • Familiarity with ML frameworks (e.g., PyTorch) or compiler tools (e.g., MLIR).
  • Experience delivering complex projects in fast-paced, high-impact environments.

Attributes of a Groqster:

  • Humility - Egos are checked at the door
  • Collaborative & Team Savvy - We make up the smartest person in the room, together
  • Growth & Giver Mindset - Learn it all versus know it all, we share knowledge generously
  • Curious & Innovative - Take a creative approach to projects, problems, and design
  • Passion, Grit, & Boldness - no limit thinking, fueling informed risk taking

If this sounds like you, we’d love to hear from you!

Compensation: At Groq, a competitive base salary is part of our comprehensive compensation package, which includes equity and benefits. For this role, the base salary range is $248,710 - $336,490, determined by your skills, qualifications, experience and internal benchmarks.

Location: Some roles may require being located near or on our primary sites, as indicated in the job description.  

At Groq: Our goal is to hire and promote an exceptional workforce as diverse as the global populations we serve. Groq is an equal opportunity employer committed to diversity, inclusion, and belonging in all aspects of our organization. We value and celebrate diversity in thought, beliefs, talent, expression, and backgrounds. We know that our individual differences make us better.


Groq is an Equal Opportunity Employer that is committed to inclusion and diversity. Qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, gender, sexual orientation, gender identity, disability or protected veteran status.  We also take affirmative action to offer employment opportunities to minorities, women, individuals with disabilities, and protected veterans.

Groq is committed to working with qualified individuals with physical or mental disabilities. Applicants who would like to contact us regarding the accessibility of our website or who need special assistance or a reasonable accommodation for any part of the application or hiring process may contact us at:  [email protected].  This contact information is for accommodation requests only.  Evaluation of requests for reasonable accommodations will be determined on a case-by-case basis.

Top Skills

C++
Mlir
PyTorch
Rust

Similar Jobs

2 Hours Ago
Remote or Hybrid
8 Locations
136K-245K Annually
Senior level
136K-245K Annually
Senior level
eCommerce • Fintech • Hardware • Payments • Software • Financial Services
The Marketing Strategy Lead drives marketing investment strategy and operational excellence, partnering with teams to optimize performance and measure impact.
Top Skills: Data Analytics ToolsFinancial Modeling
2 Hours Ago
Remote or Hybrid
8 Locations
103K-194K Annually
Senior level
103K-194K Annually
Senior level
eCommerce • Fintech • Hardware • Payments • Software • Financial Services
Manage and oversee Block's Global Sanctions Program by assessing risks, conducting investigations, and ensuring compliance with sanctions regulations.
Top Skills: Compliance ToolsData Analysis
8 Hours Ago
Remote or Hybrid
2 Locations
Mid level
Mid level
Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
Responsible for leading the roadmap and development of an API-driven distribution platform, collaborating with stakeholders, and enhancing product adoption.
Top Skills: APIsConfluenceJIRAPendoReadmeSwagger

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account