Seeking an experienced GPU Architect/Designer to define next-generation GPU microarchitecture, optimize performance, and implement systems for high-efficiency parallel computing solutions.
GPU Architecture / Design (Shader / SIMT)
About the Company Our client is a well-funded, venture-backed semiconductor startup developing next-generation GPU technology. The company is in a growth stage with significant capital backing and is building a world-class engineering team to design high-performance, scalable GPU architectures from the ground up. This is a rare opportunity to join at a foundational stage and directly shape the direction of cutting-edge silicon.
Job Summary We are seeking an experienced GPU Architect/Designer with strong expertise in shader core architecture and SIMT (Single Instruction Multiple Threads) execution models. This role involves defining next-generation GPU microarchitecture, optimizing throughput and efficiency, and driving scalable, high-performance parallel compute solutions.
The ideal candidate will have deep knowledge of GPU shader pipelines, thread scheduling, memory hierarchy, and parallel execution models, with experience translating architectural concepts into high-quality RTL implementations.
Key Responsibilities
Architecture & Microarchitecture
- Define and evolve GPU shader core architecture, including SIMT execution units and pipeline design.
- Design warp/wavefront scheduling, thread dispatch, and execution models.
- Architect SIMT execution pipelines, including ALU pipelines, vector units, and control flow units.
- Define thread divergence handling, reconvergence strategies, and branch control mechanisms.
- Develop scalable shader architectures supporting high thread-level parallelism.
- Collaborate on ISA definitions related to shader and compute workloads.
- Analyze shader workloads and identify performance bottlenecks.
- Optimize GPU execution efficiency across diverse workloads including compute shaders, AI/ML kernels, and high-performance parallel workloads.
- Drive performance-per-watt and area efficiency improvements.
Memory & Interconnect
- Define GPU memory subsystem interactions including register files, shared/local memory, L1/L2 cache hierarchy, and memory coalescing mechanisms.
- Optimize memory access scheduling and bandwidth utilization.
- Collaborate on interconnect and memory fabric architecture.
RTL & Design
- Translate architectural specifications into microarchitecture definitions.
- Implement shader pipeline logic in SystemVerilog.
Verification & Validation
- Define architectural test plans and validation strategies.
- Develop directed tests, constrained-random tests, and performance validation frameworks.
- Analyze simulation and silicon results to drive design improvements.
Required Qualifications
Education: Bachelor's, Master's, or PhD in Computer Engineering, Electrical Engineering, or Computer Science.
10+ years of experience in GPU, CPU, or parallel processor architecture.
Strong experience with:
- SIMT / SIMD architectures
- Shader core design
- Thread scheduling
- Pipeline microarchitecture
- Memory hierarchy design
Proficiency in:
- SystemVerilog or Verilog
- Microarchitecture specification development
- Performance modeling tools
- RTL-level debugging
Deep understanding of:
- Parallel computing models
- GPU execution models
- Pipeline hazard handling
- Synchronization primitives
Compensation: 175,000 - 250,000 USD + Meaningful Equity
CompensationThe base pay range for this role is $175,000 – $250,000 per year.
Similar Jobs
Artificial Intelligence • Enterprise Web • Software • Design • Generative AI
As a Senior Staff Engineer at Webflow, you'll architect scalable AI products, partner with leadership for technical strategy, and mentor engineers to elevate architectural standards.
Top Skills:
AWSGCPGoKubernetesNode.jsPulumiTerraformTypescript
Fintech • Financial Services
Build, deploy, and monitor advanced statistical and machine learning models (credit risk, pricing, collections, fraud). Partner with cross-functional teams to integrate models into production, produce production-grade code, and communicate results to technical and non-technical stakeholders.
Top Skills:
ArizeAWSDatabricksGitMetaflowPythonSagemakerSnowflakeSQLTaktileTecton
Healthtech • Logistics • Pharmaceutical
Assist in various responsibilities based on the department's needs while developing interpersonal and project management skills. Must be enrolled in a post-secondary program, with a flexible working schedule between 8 and 40 hours per week.
Top Skills:
ExcelMicrosoft OutlookPowerPoint
What you need to know about the Charlotte Tech Scene
Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.
Key Facts About Charlotte Tech
- Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
- Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
- Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
- Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
- Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus



