Kentik is the network intelligence platform for modern infrastructure teams. Unlike traditional monitoring and observability tools, we demystify complex network operations, enabling organizations to deliver applications and innovation at scale. Built by network experts to make critical insight accessible to every engineer, Kentik is the real-time source of truth that understands every network in context — from data center to cloud to the internet. This single platform unifies and correlates cloud, device, flow, synthetic data to turn telemetry into action. Market leaders like Akamai, Booking.com, Dropbox, and Zoom rely on Kentik to run, manage, and optimize their networks.
What we do
Our platform ingests trillions of records and serves hundreds of thousands of queries for our users each day. You will gain experience building a production quality, high performance server-and-client SaaS application that handles uniquely high volumes of data.
We have built a team of world-class engineers, network experts, and technology thought leaders in a remote-friendly culture from day one. While prior experience in a remote environment is not required, we highly value strong collaboration and communication skills, as well as a high level of independence and autonomy.
*This is a remote role. However, due to the location of the teams we are hiring for, working hours in US time zones is a requirement for this position.
What you'll do
Kentik is looking for a Senior level Site Reliability Engineer (Cloud) to join our Product Engineering team. This person will help build and maintain our Synthetics and Cloud product lines. These products have multiple applications deployed in various cloud providers all over the world, and we manage these cloud applications using observability tooling, automated build processes, and adherence to configuration as code best practices.
We’re looking for an experienced engineer who will work with engineering teams across the company to help grow our hardware and software infrastructure. We operate a well-organized, well-instrumented platform, and offer enormous opportunities for employee growth.
- Ensure our real-time, scalable, infrastructure is set up for growth and working efficiently. Our infrastructure runs on our own hardware, across multiple locations as well as all major cloud vendors
- Work on tools and processes to better monitor our platform as well as ensuring its stability through our rapid growth
- Deep-dive into diverse topics, from firewalls and IP routing, to database replication strategies or automating build processes
- Collaborate with engineering and infrastructure teams on finding solutions from an operational perspective
- Assist with expanding our cloud deployments across the major cloud providers
- Contribute code, code reviews and tools or patches to all kinds of existing code
- Write design documents or collaborate on colleagues’ docs to introduce new features or changes into our infrastructure
- Provide valuable feedback on team goals, projects, and processes. We believe in continuously improving our team
- 5+ years of experience in cloud-based Systems Administration, IT and/or SRE related projects
- Strong experience with public cloud, container and orchestration technologies including AWS, GCP, Azure, Kubernetes, and Docker
- Solid programming and automation skills (Bash, Python, Go) including experience working with configuration management (infrastructure as code) platforms such as Terraform, Ansible, and Puppet
- Experience working with *nix system command line (e.g. ssh, grep, awk)
- Detailed understanding of major internet protocols (TCP/IP, DNS, HTTP, TLS)
- Networking administration experience: concepts such as routing, firewalls (iptables), peering sound familiar
- A passion for documenting code, processes, and infrastructure in runbooks and wikis
- Worked with metrics monitoring solutions such as grafana, prometheus, telegraf, and OpenTelemetry
- Experience creating and managing tickets with third party vendors and owning cloud vendor partner relationships
Nice to haves:
- Familiarity with Kubernetes orchestration and automation tools such as Helm, Kustomize, ArgoCD, and Flux
- Experience optimizing CI/CD pipelines such as GitHub Actions, Earthly and Jenkins
- Exposure to PagerDuty Integrations
- Knowledge of SRE, DevOps and GitOps practices and principles
What we offer
Kentik is a fully remote company that operates globally. We seek professionals that will help us thrive as an organization, and in turn, to broaden and enhance your career. We’re very thorough in the interview process to understand your skills and how they will relate to your successful growth here at Kentik. Our compensation philosophy encompasses a fair program for all in order to attract, engage and retain talented individuals who will drive our business and wow our customers.
The compensation range for this position is: $159,000 - $215,000. This range reflects the low and high end of the U.S. compensation range Kentik reasonably and generally expects to pay the hired candidate in this role. The actual compensation offered may be lower or higher than the stated range depending on various factors, including but not limited to:
- Experience with the skill sets required for success
- Demonstrated competencies and potential
- A geographic market-based approach
In addition to a great career opportunity, Kentik offers stellar benefits for our employees, which include:
- 100% of premiums are paid by company for health, vision and dental coverage for you and your dependents
- Additionally, an annual Health Reimbursement Account (HRA) of $3,000 for an individual or $4,500 for a family
- Paid family & medical leave
- Open PTO, a quarterly Wellness Day, and a minimum of 10 paid holidays
- 401(k) retirement account
- Home office reimbursement
- Stock options
Note: Benefits are as listed for all US full-time employees. For compensation, international applicants will be treated equitably in relation to the laws applicable within the countries in which we operate.
The true meaning of Kentik is visibility. We’re committed to making sure everyone feels empowered to use their voice, has a sense of belonging, and is represented at Kentik.
We don’t look for individuals who fit the culture, but those who will continue to add to the culture.
We encourage everyone to apply, especially those individuals who are underrepresented in the industry: people of color, LGBTQI+ community, women, individuals with disabilities (both seen and unseen), veterans, and people of any age or family status.
Kentik is committed to creating an inclusive interview process. If you require a reasonable accommodation during the application or interview process, please reach out to [email protected].
Come as you are!
You will be working at a fast-growing, well-funded startup alongside industry thought leaders and network aficionados as we build the future of observability and set the high bar for how network operations and digital businesses should run. With a competitive salary and amazing benefits on top of the meaningful and challenging projects you’ll take on, we’re sure you’ll enjoy joining the Kentik team.
#li-remote
Top Skills
Similar Jobs
What you need to know about the Charlotte Tech Scene
Key Facts About Charlotte Tech
- Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
- Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
- Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
- Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
- Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus