unstructured.io Logo

unstructured.io

Site Reliability Engineer - Public Sector

Posted 2 Days Ago
Remote
Hiring Remotely in USA
Senior level
Remote
Hiring Remotely in USA
Senior level
The Public Sector Site Reliability Engineer will manage cloud infrastructure, ensure compliance with regulations, implement observability tools, lead incident response, and enhance automation in federal environments.
The summary above was generated by AI
At Unstructured, we’re building the backbone of generative AI—helping federal agencies transform PDFs, HTML, Word docs, images, and more into secure, high-performance data pipelines that scale. Our tools are trusted by nearly half of the Fortune 500 and downloaded more than 38 million times in the open-source community.

We’re expanding our federal/public sector practice, and we’re hiring a Public Sector Site Reliability Engineer (SRE) to help design, scale, and secure the systems that power the next generation of AI-driven workloads for government.

What You’ll Own & Drive

🔐 Mission-Grade Reliability & Security
Design, build, and manage secure, highly available, and scalable cloud infrastructure for federal environments.
Ensure compliance with FedRAMP, FISMA, and other relevant security and regulatory frameworks.
Develop IaC with Terraform, Pulumi, or similar for repeatable, compliant deployments.
Build and maintain automated CI/CD pipelines that move fast without sacrificing security or stability.

📊 Full Observability in Sensitive Environments
Implement/maintain monitoring, logging, and alerting (Prometheus, Grafana, Datadog, Elastic).
Enable real-time visibility and rapid response for mission-critical workloads.
Partner with engineering and program teams for high-assurance rollouts.
Lead capacity planning, deployment strategies, and resilient architecture design for federal networks.

🔥 Incident Response & Continuous Improvement
Lead incident response and root-cause analysis with a blameless, systems-thinking approach.
Drive postmortems and reliability improvements.
Enhance developer experience with secure automation and streamlined workflows.
Help teams iterate quickly while maintaining compliance and operational excellence.

What You Bring
5–9 years managing software deployed to US government or Department of Defense (DOD) networks
Active SECRET clearance required; TS/SCI strongly preferred
Expertise with AWS GovCloud and/or Azure Government.
Deep experience with Kubernetes, Docker, and container orchestration at scale.
Strong Linux systems and networking fundamentals.
Scripting/automation: Python, Bash, or Go. IaC: Terraform, Pulumi, Ansible (or similar).
Strong grasp of monitoring, logging, and observability best practices.
Travel required up to 20%

Bonus Points
ML infrastructure or real-time data pipelines experience.
Serverless or event-driven architectures.
Contributions to open-source DevOps/SRE projects.
Hands-on work with US government security/compliance in cloud-native settings.
Unstructured values service and encourages veterans of the US military and civilian agencies to apply to this role.

Why You’ll Love It Here
Mission Impact: Power critical AI workloads in the public sector.
Big Technical Challenges: High-assurance problems at the edge of AI, data, and cloud.
Elite Team: Sharp, low-ego engineers who value execution and learning.
Innovation + Security: Build cutting-edge systems with rigorous reliability for federal use cases.

Top Skills

Ansible
Aws Govcloud
Azure Government
Bash
Datadog
Docker
Elastic
Go
Grafana
Kubernetes
Prometheus
Pulumi
Python
Terraform

Similar Jobs

14 Days Ago
Remote or Hybrid
New York, NY, USA
175K-200K Annually
Senior level
175K-200K Annually
Senior level
AdTech • Big Data • Digital Media • Software
The Principal Site Reliability Engineer will lead technical initiatives, enhance operational reliability, and mentor teams while focusing on automation and infrastructure improvements.
Top Skills: AnsibleArgo CdAws EcrCi/CdGitGithub ActionsJenkinsKubernetesNexusPuppetTerraform
9 Days Ago
Remote
United States
Senior level
Senior level
Fintech • Software
The Senior Site Reliability Engineer will enhance infrastructure reliability and performance, focusing on Cloudflare integration, IaC, CI/CD, and operational procedures improvement.
Top Skills: AksAnsibleAWSAzureBashBitbucketCloudflareDockerEcsEksGithub ActionsJenkinsMetabaseMongoDBMssqlMySQLPostgresPowershellPythonShellSvnTerraform
9 Days Ago
Remote
United States
Senior level
Senior level
Fintech • Software
The Site Reliability Engineer will manage infrastructure reliability, support deployments, optimize operational procedures, and lead Cloudflare integration.
Top Skills: AksAnsibleAWSAzureBashCloudflareDockerEcsEksGithub ActionsJenkinsMetabaseMongoDBMssqlMySQLPostgresPowershellPythonShellTerraform

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account