Graphcore Logo

Graphcore

Staff AI Performance Engineer

Posted An Hour Ago
Be an Early Applicant
Hybrid
Austin, TX
Mid level
Hybrid
Austin, TX
Mid level
The Staff AI Performance Engineer will optimize performance across ARM-based architectures and distributed systems, analyzing AI workloads and collaborating to enhance system efficiency.
The summary above was generated by AI
About us

Graphcore is one of the world’s leading innovators in Artificial Intelligence compute.
It is developing hardware, software and systems infrastructure that will unlock the next generation of AI breakthroughs and power the widespread adoption of AI solutions across every industry.
As part of the SoftBank Group, Graphcore is a member of an elite family of companies responsible for some of the world’s most transformative technologies. Together, they share a bold vision: to enable Artificial Super Intelligence and ensure its benefits are accessible to everyone.
Graphcore’s teams are drawn from diverse backgrounds and bring a broad range of skills and perspectives. A melting pot of AI research specialists, silicon designers, software engineers and systems architects, Graphcore enjoys a culture of continuous learning and constant innovation.

Job Summary

Graphcore’s AI/ML training and inference infrastructure is rapidly scaling to meet the growing demands of AI workloads across mobile, edge, and datacenter environments. This role focuses on optimizing performance across ARM-based architectures and large-scale distributed systems, ensuring efficiency, scalability, and reliability across the full hardware-software stack.

The Team

The System Engineering Performance team architects and optimizes high-performance infrastructure for large-scale datacenter deployments. The team works across hardware, software, networking, and system architecture to deliver cutting-edge AI solutions and ensure optimal system performance at scale.

Responsibilities and Duties
  • Analyze ML models’ compute and memory requirements using roofline analysis and simulations
  • Collaborate across hardware and software teams to optimize large-scale AI workloads
  • Benchmark, monitor, and troubleshoot system performance across distributed systems
  • Optimize communication stacks including MPI, NCCL, UCX, RDMA, and networking fabrics
  • Profile and optimize AI workloads, focusing on performance bottlenecks
  • Develop high-quality, ARM-compatible code and documentation
Candidate Profile

Essential:

  • BS/MS in Computer Science, Electrical Engineering, or related field
  • Experience with distributed systems and communication libraries (MPI, NCCL, UCX, libfabric)
  • Strong programming skills in C++ and Python
  • Experience profiling and optimizing HPC or AI/ML workloads
  • Familiarity with ML benchmarks such as MLPerf

Desirable:

  • Experience with GPUs or accelerated computing architectures
  • Knowledge of HPC networking and interconnect technologies (InfiniBand, RoCE)
  • Familiarity with ML frameworks such as PyTorch or TensorFlow
  • Understanding of ARM architectures and toolchains
  • Strong debugging, profiling, and performance optimization skills

Similar Jobs at Graphcore

23 Hours Ago
Hybrid
Expert/Leader
Expert/Leader
Artificial Intelligence • Semiconductor
Design and optimize storage architectures for AI data centers, focusing on NVMe SSDs and ensuring high-performance data flow to GPUs. Responsibilities include performance tuning, vendor engagement, and managing storage subsystems for AI workloads.
Top Skills: BashExt4FioJSONLinuxNvme SsdsPciePythonXfsZfs
23 Hours Ago
Hybrid
Senior level
Senior level
Artificial Intelligence • Semiconductor
The role involves managing assets and inventory for R&D operations, ensuring efficient material lifecycle management and collaboration across departments to support engineering labs and data centers.
Top Skills: ExcelNetboxOdooPower BIServicenowSmartsheet
23 Hours Ago
Hybrid
Senior level
Senior level
Artificial Intelligence • Semiconductor
Design and optimize AI data center networks, focusing on high-performance computing and network fabrics, while collaborating with cross-functional teams.
Top Skills: AIArista EosBashBgpCisco Nx-OsEvpn-VxlanGoHigh-Speed EthernetNetworkingOspfPythonRdmaSonic

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account