Fluidstack

Head of Networking

Posted 2 Days Ago

Be an Early Applicant

In-Office or Remote

3 Locations

Expert/Leader

In-Office or Remote

3 Locations

Expert/Leader

Lead architecture, design, and operations of networking services for AI infrastructure. Build and mentor a networking team, focusing on automation, performance, and reliability.

The summary above was generated by AI

About Fluidstack

We build and operate high-performance GPU clusters so the most ambitious teams can move fast, stay focused, and scale without friction. Our clusters power top AI labs, governments, and enterprises. Our customers include Mistral, Poolside, Black Forest Labs, Meta, and more.

Our team is highly motivated, and focused on providing a world class supercomputing experience. We put our customers first in everything we do, working hard to not just win the sale, but to win repeated business and customer referrals.

We hold ourselves and each other to high standards. We expect you to care deeply about the work you do, the products you build, and the experience our customers have in every interaction with us.

You must work hard, take ownership from inception to delivery, and approach every problem with an open mind and a positive attitude. We value effectiveness, competence, and a growth mindset.

About the Role

As Head of Networking, you will lead the architecture, design, and operations of our network services that power our AI infrastructure platform. In this role, you will architect networks that move packets for frontier AI models while ensuring maximum reliability and performance through extensive automation. You will build a team that spans

You will build and lead a world-class networking team ranging from junior network engineers eager to learn high-performance computing, to senior architects who have scaled networks at hyperscalers, to specialized engineers with deep expertise in RDMA/InfiniBand for AI workloads. Your team will span network operations, architecture, automation engineering, and performance optimization roles. You'll be responsible for hiring, mentoring, and developing this team while establishing a culture of technical excellence and continuous learning

Focus

Build networks that scale beyond hundreds of thousands of GPUs.
Collaborate with compute, storage, security, and data center teams to deliver integrated infrastructure solutions
Build and lead a team of network engineers and architects focused on performance, reliability, and automation.
Automate everything. Manual processes kill velocity. Build systems that configure themselves, heal themselves, and optimize themselves. Drive automation initiatives across service deployment, provisioning, and lifecycle management
Design scalable network architectures supporting clusters from 2,000 to 200,000 GPUs
Optimize traffic patterns for AI/ML training workloads and high-performance computing
Lead the design and implementation of scalable, high-performance network architectures supporting GPU clusters and AI workloads
Establish comprehensive monitoring, alerting, and incident response procedures. Create remediation systems that detect and resolve issues before customer impact
Lead root cause analysis and implement preventive measures for network incidents
Ensure network reliability, security, and performance meet the demanding requirements of AI supercomputing workloads
Ensure compliance with data sovereignty and regulatory requirements

About You

10+ years of experience designing and operating large-scale network infrastructure
5+ years in leadership roles at cloud providers, hyperscalers, or technology companies
Deep expertise in software-defined networking, routing protocols, and distributed network design
Proven track record scaling networks for high-throughput, low-latency workloads
Experience with AI/ML infrastructure and GPU cluster networking (RoCE / InfiniBand)
Deep understanding of internet routing, switching, peering, and distributed network design.
Expert knowledge of routing protocols (BGP, EVPN), TCP/IP, and network services (DHCP, DNS)
Proven track record of designing and operating large-scale, high-performance networks in cloud or datacenter environments
Strong knowledge of automation frameworks (e.g., Ansible, Terraform) and infrastructure-as-code principles
Experience offloading services into smart NICs and working with hardware acceleration technologies
Excellent communication skills with ability to influence technical strategy across organizations
Monitoring stacks (Prometheus, Grafana) and observability best practices

Nice to haves

Contributions to open-source networking projects
Experience with network source of truth platforms (NetBox, Nautobot, ..) and integrating them with automation workflows
Familiarity with Kubernetes networking, overlay networks, and container networking solutions

Benefits

Competitive total compensation package (cash + equity).
Retirement or pension plan, in line with local norms.
Health, dental, and vision insurance.
Generous PTO policy, in line with local norms.

Fluidstack is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Fluidstack will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.

Top Skills

Ai/Ml Infrastructure

Ansible

Bgp

Dhcp

Distributed Network Design

Dns

Evpn

Gpu Clusters

Grafana

Infiniband

Kubernetes

Prometheus

Roce

Routing Protocols

Software-Defined Networking

Tcp/Ip

Terraform

Similar Jobs

monday.com

Account Executive

59 Minutes Ago

Remote or Hybrid

New York, NY, USA

99K-150K Annually

Mid level

99K-150K Annually

Mid level

Productivity • Sales • Software

As a CRM Account Executive, you will drive CRM sales, manage the full sales cycle, and develop strategies to enhance the product within a growing sales environment.

Top Skills: CRM

CDW

Business Development Manager

59 Minutes Ago

Remote or Hybrid

115K-140K Annually

Senior level

115K-140K Annually

Senior level

Artificial Intelligence • eCommerce • Information Technology • Internet of Things • Automation

Responsible for generating new business opportunities for AI Factory solutions by engaging with clients and account teams, articulating value propositions, and leading sales motions.

Top Skills: Ai InfrastructureAi/Ml ToolsCpuDataGpuKubernetesMlopsSlurmStorage

Inspira Financial

Senior Sales Executive

An Hour Ago

In-Office or Remote

Chicago, IL, USA

100K-225K Annually

Senior level

100K-225K Annually

Senior level

Fintech

The Sr. Sales Executive is responsible for business development in small group markets and must build relationships with brokers and sponsors, manage sales processes, and meet sales objectives.

Top Skills: Salesforce CRM

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus