Groq

Senior Staff Software Engineer, High Performance Inference System

Reposted 23 Days Ago

In-Office or Remote

2 Locations

249K-336K

Senior level

In-Office or Remote

2 Locations

249K-336K

Senior level

The role involves designing and implementing low-latency, scalable distributed systems for Groq's real-time inference system, optimizing performance, and ensuring reliability across hardware.

The summary above was generated by AI

About Groq

Groq delivers fast, efficient AI inference. Our LPU-based system powers GroqCloud™, giving businesses and developers the speed and scale they need. Headquartered in Silicon Valley, we are on a mission to make high performance AI compute more accessible and affordable. When real-time AI is within reach, anything is possible. Build fast.

Senior Staff Software Engineer – High Performance Inference System

Mission:

Join the team that builds and operates Groq’s real-time, distributed inference system delivering large scale inference for LLMs and next-gen AI applications at ultra-low latency. Your work will optimize for heterogeneous hardware, dynamic global workloads, and extreme performance—all while running code at the edge of physics.

Responsibilities & opportunities in this role:

Distributed Systems Engineering: Design and implement scalable, low-latency runtime systems that coordinate thousands of GroqChips across a software-scheduled interconnect.
Low-Level Optimization: Develop deterministic, hardware-aware abstractions that prioritize execution speed, fault tolerance, and reliability.
Performance & Diagnostics: Build tools and infrastructure to support real-time system observability, diagnostics, and SLO improvements.
Future-Proofing: Evolve Groq’s system stack to support emerging silicon, topologies, and heterogeneous accelerators (e.g., FPGAs).
Cross-Functional Collaboration: Partner with teams across compiler, infra, cloud, hardware, and data centers to align architecture and drive shared progress.

Ideal candidates have/are:

Consistently ship high-impact, production-ready systems code.
Have deep knowledge of computer architecture, operating systems, algorithms, and hardware-software interfaces.
Are fluent in low-level systems languages such as C++ or Rust, and comfortable with hardware-aware programming.
Rigorously profile and optimize for latency, throughput, and resource efficiency—every cycle counts.
Believe in automation and CI/CD best practices—you don’t ship untested code.
Thrive across the stack—from kernel internals to hardware integration to cloud load balancers.
Communicate clearly, make pragmatic technical decisions, and write maintainable code for the long term.
Ensures code stays fast, scales well, and takes ownership of outcomes.

Nice to have:

Operating large-scale distributed systems for real-time, high-traffic services.
Deploying and optimizing ML or HPC workloads in production environments.
Hands-on experience with GPUs, FPGAs, or ASICs in performance-critical systems.
Familiarity with ML frameworks (e.g., PyTorch) or compiler tools (e.g., MLIR).
Experience delivering complex projects in fast-paced, high-impact environments.

Attributes of a Groqster:

Humility - Egos are checked at the door
Collaborative & Team Savvy - We make up the smartest person in the room, together
Growth & Giver Mindset - Learn it all versus know it all, we share knowledge generously
Curious & Innovative - Take a creative approach to projects, problems, and design
Passion, Grit, & Boldness - no limit thinking, fueling informed risk taking

If this sounds like you, we’d love to hear from you!

Compensation: At Groq, a competitive base salary is part of our comprehensive compensation package, which includes equity and benefits. For this role, the base salary range is $248,710 - $336,490, determined by your skills, qualifications, experience and internal benchmarks.

Location: Some roles may require being located near or on our primary sites, as indicated in the job description.

At Groq: Our goal is to hire and promote an exceptional workforce as diverse as the global populations we serve. Groq is an equal opportunity employer committed to diversity, inclusion, and belonging in all aspects of our organization. We value and celebrate diversity in thought, beliefs, talent, expression, and backgrounds. We know that our individual differences make us better.

Groq is an Equal Opportunity Employer that is committed to inclusion and diversity. Qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, gender, sexual orientation, gender identity, disability or protected veteran status. We also take affirmative action to offer employment opportunities to minorities, women, individuals with disabilities, and protected veterans.

Groq is committed to working with qualified individuals with physical or mental disabilities. Applicants who would like to contact us regarding the accessibility of our website or who need special assistance or a reasonable accommodation for any part of the application or hiring process may contact us at: [email protected]. This contact information is for accommodation requests only. Evaluation of requests for reasonable accommodations will be determined on a case-by-case basis.

Top Skills

C++

Mlir

PyTorch

Rust

Similar Jobs

Square

Marketing Strategy Lead

2 Hours Ago

Remote or Hybrid

136K-245K Annually

Senior level

136K-245K Annually

Senior level

eCommerce • Fintech • Hardware • Payments • Software • Financial Services

The Marketing Strategy Lead drives marketing investment strategy and operational excellence, partnering with teams to optimize performance and measure impact.

Top Skills: Data Analytics ToolsFinancial Modeling

Square

Program Manager

2 Hours Ago

Remote or Hybrid

103K-194K Annually

Senior level

103K-194K Annually

Senior level

eCommerce • Fintech • Hardware • Payments • Software • Financial Services

Manage and oversee Block's Global Sanctions Program by assessing risks, conducting investigations, and ensuring compliance with sanctions regulations.

Top Skills: Compliance ToolsData Analysis

Applied Systems

Technical Product Manager

8 Hours Ago

Remote or Hybrid

Mid level

Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics

Responsible for leading the roadmap and development of an API-driven distribution platform, collaborating with stakeholders, and enhancing product adoption.

Top Skills: APIsConfluenceJIRAPendoReadmeSwagger

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus