Shield AI Logo

Shield AI

Cloud SRE/DevOps Software Engineer (SD/DC/Remote) (R3138)

Posted 10 Days Ago
Remote
Hiring Remotely in United States
Senior level
Remote
Hiring Remotely in United States
Senior level
Optimize cloud deployments, manage internal Hivemind instances, enable scalability, and support customer deployments as a Cloud SRE/DevOps Engineer.
The summary above was generated by AI

Founded in 2015, Shield AI is a venture-backed defense technology company with the mission of protecting service members and civilians with intelligent, autonomous systems. Its products include Hivemind Enterprise—EdgeOS, Pilot, Commander, and Forge—as well as V-BAT and Sentient Vision Systems (wide-area motion imaging software). With offices in San Diego, Dallas, Washington, D.C., Abu Dhabi (UAE), Kyiv (Ukraine), and Melbourne (Australia), Shield AI’s technology actively supports U.S. and allied operations worldwide.  For more information, visit www.shield.ai. Follow Shield AI on LinkedIn, X and Instagram.    


As a Cloud SRE/DevOps Engineer on the Forge team, you will be responsible for optimizing Forge’s cloud deployments and owning the processes that enable customers to deploy their own Forge instances. You will manage Shield AI’s internal Hivemind instances, working closely with the software operations and engineering teams to ensure Forge can scale for simulation, testing, and bursts of use. You will also enable seamless upgrades, canary deployments, and system robustness. Additionally, you’ll serve as the primary point of contact for the customer engagement team, providing expert guidance on deploying Forge in customer environments. 

What You'll Do:

  • Optimize cloud deployments of Forge to ensure scalability, reliability, and cost efficiency. 
  • Design and document processes for external customers to deploy Forge instances using the SDK in on-premises or hybrid environments. 
  • Manage and maintain internal Hivemind instances, ensuring their ability to handle large-scale simulation and testing workloads. 
  • Collaborate with the software operations team to enhance Forge’s ability to scale dynamically, accommodate bursts of use, and support continuous upgrades with minimal disruption. 
  • Develop tools and processes for canary deployments, ensuring smooth rollouts of new features and updates. 
  • Serve as the primary technical consultant for the customer engagement team, providing expertise on deploying and managing Forge in external environments. 
  • Create and maintain detailed, user-friendly documentation and tutorials for deployment processes, catering to both internal teams and external customers. 
  • Monitor, troubleshoot, and resolve issues related to Forge deployments, ensuring high availability and performance. 

Required Qualifications:

  • Typically requires a minimum of 5 - 15 years of related experience with a Bachelor’s degree
  • 2 - 10+ years of experience in DevOps, Site Reliability Engineering, or cloud infrastructure roles. 
  • Expertise in cloud platforms such as AWS, Azure, or GCP, including deploying and managing scalable, distributed systems. 
  • Strong experience with Kubernetes and containerization. 
  • Experience creating Helm charts. 
  • Solid understanding of infrastructure-as-code tools like Terraform, CloudFormation, or similar. 
  • Proficiency in scripting and programming languages such as Python, Golang, or Bash. 
  • Demonstrated experience optimizing CI/CD pipelines, implementing canary deployments, or tools like ArgoCD and FluxCD. 
  • Familiarity with networking concepts and protocols, as well as system monitoring tools (e.g., Prometheus, Grafana). 
  • Experience deploying and configuring databases such as Postgres.
  • Excellent technical writing skills, with a proven ability to create clear, comprehensive documentation and tutorials. 
  • BS/MS in Computer Science, Engineering, or equivalent practical experience. 
  • Ability to work cross-functionally and communicate effectively with engineering, operations, and customer-facing teams. 

Preferred Qualifications:

  • Experience with secure software deployments in regulated industries such as aerospace, defense, or finance. 
  • Systems software development experience using programming languages like C++, Rust or Golang. 
  • Experience building software development kits or productized tools for deploying cloud systems. 
  • Knowledge of hybrid and on-premises deployment strategies and challenges. 
  • Hands-on experience with database performance optimization and scaling strategies. 
  • Familiarity with configuration management tools like Ansible, Chef, or Puppet. 
  • Experience building robust monitoring and alerting systems for mission-critical applications. 
  • Background in managing high-throughput simulation or testing environments. 
  • Experience optimizing databases

#LI-LD1

#LF


Full-time regular employee offer package:

Pay within range listed + Bonus + Benefits + Equity


Temporary employee offer package:

Pay within range listed above + temporary benefits package (applicable after 60 days of employment)


Salary compensation is influenced by a wide array of factors including but not limited to skill set, level of experience, licenses and certifications, and specific work location. All offers are contingent on a cleared background and possible reference check. Military fellows and part-time employees are not eligible for benefits. Please speak to your talent acquisition representative for more information.


###


Shield AI is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, marital status, disability, gender identity or Veteran status. If you have a disability or special need that requires accommodation, please let us know. 

Top Skills

Argocd
AWS
Azure
Bash
CloudFormation
Fluxcd
GCP
Go
Grafana
Helm
Kubernetes
Postgres
Prometheus
Python
Terraform

Similar Jobs

52 Minutes Ago
Remote
Illinois, USA
50K-80K Annually
Junior
50K-80K Annually
Junior
Artificial Intelligence • Hardware • Information Technology • Security • Software • Cybersecurity • Big Data Analytics
Provide technical support for complex communication systems, optimizing RF and broadband architecture, troubleshooting, and performing maintenance routines for government public safety communications.
Top Skills: AccessBridgesCablingData CircuitsExcelFirewallsLocal Area NetworksMs WordOutlookPacket Switching TechniquesRf SystemsRoutersSwitchesTelephonyWide Area NetworksWired Communications Systems
An Hour Ago
Remote
USA
179K-199K Annually
Junior
179K-199K Annually
Junior
eCommerce • Food • Software
As a Machine Learning Engineer, you will develop and enhance ML models for ads systems, collaborating with product leaders and engineers to optimize ads selection, ranking, and pricing.
Top Skills: GoKerasPandasPythonScikit-LearnSparkSQLTensorFlowTorchXgboost
An Hour Ago
Easy Apply
Remote
United States
Easy Apply
Mid level
Mid level
Cloud • Healthtech • Professional Services • Software • Pharmaceutical
The Senior Software Engineer will design and develop core modules for the elluminate platform, focusing on software design, development, unit testing, and collaborating with QA teams.
Top Skills: AngularAsp.Net MvcAWSC#CSSHTMLJavaScriptMicrosoft Sql ServerSagemaker

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account