We build and operate high-performance GPU clusters so the most ambitious teams can move fast, stay focused, and scale without friction. Our cluster power top AI labs, governments, and enterprises. Our customers include Mistral, Poolside, Black Forest Labs, Meta, and more.
Our team is highly motivated, and focused on providing a world class supercomputing experience. We put our customers first in everything we do, working hard to not just win the sale, but to win repeated business and customer referrals.
We hold ourselves and each other to high standards. We expect you to care deeply about the work you do, the products you build, and the experience our customers have in every interaction with us.
You must work hard, take ownership from inception to delivery, and approach every problem with an open mind and a positive attitude. We value effectiveness, competence, and a growth mindset.
About the RoleWe are seeking a Network Automation Engineer to join our Networking team. You will be responsible for designing, implementing, and maintaining automated systems that provision, configure, and manage our network infrastructure at scale. This role is critical to ensuring our GPU supercomputers deliver world-class performance and reliability to our AI customers.
You will build systems and automations frameworks thriving for operational excellence and meeting the demands of our customer’s AI training and inference workloads.
FocusDesign and implement network automation workflows to manage thousands of network devices using infrastructure-as-code principles and configuration management tools
Build self-service automation platforms for network operations including provisioning (e.g. ZTP), deployments, monitoring, remediation, and software push systems
Develop automated monitoring and remediation systems that detect and resolve network issues before they impact customers
Collaborate with the infrastructure team to identify routine tasks and implement automation-first solutions, contributing written reports on improvement opportunities
Design and implement a network device lifecycle controller to automate remediation workflows
Ability to decompose complex network architectures into modular components and build automation from the ground up
7+ years of experience in network engineering and automation.
Experience with network hardware (e.g., Mellanox, Arista, etc.) and Network OSes (Cumulus, SoNIC, etc.); software-defined networking experience is a plus.
Demonstrated knowledge of TCP, IPv4/6, Routing Protocols (one or more of BGP, MPLS, or similar), and related network services (e.g. DHCP and DNS)
Ability to understand how to configure network overlays to accommodate a multi-tenant environment (such as via BGP+EVPN)
Experience with encapsulation protocols (EVPN/VXLAN, Geneve).
Experience with Kubernetes CNIs (Calico and/or Cilium).
Strong programming skills in Python or similar languages with experience building production automation systems
Additional experience in developing automation tools for network operations in a DevOps environment
Experience with network lab provisioning tooling like Containerlab.
Strong knowledge of automation frameworks (e.g., Ansible, Terraform).
Excellent communication and collaboration abilities.
Experience with configuration management tools like Salt, Puppet and IPAM solutions like NetBox
Experience with GPU cluster networking and high-performance computing environments
Familiarity with Kubernetes networking, container orchestration, and cloud-native technologies
Knowledge of monitoring stacks (Prometheus, Grafana) and observability best practices
Contributions to open-source network automation projects
Competitive total compensation package (cash + equity).
Retirement or pension plan, in line with local norms.
Health, dental, and vision insurance.
Generous PTO policy, in line with local norms.
Fluidstack is remote first, but has offices in key hubs. For all other locations, we provide access to WeWork.
Top Skills
Similar Jobs
What you need to know about the Charlotte Tech Scene
Key Facts About Charlotte Tech
- Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
- Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
- Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
- Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
- Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus