Plume Design, Inc Logo

Plume Design, Inc

Manager, Site Reliability Engineering

Posted 12 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in United States
Senior level
Remote
Hiring Remotely in United States
Senior level
Lead the Site Reliability Engineering team by mentoring engineers, setting strategic directions, improving system reliability, and driving SRE principles across the organization.
The summary above was generated by AI

Life at Plume

At Plume, we believe that technology isn't about moving faster, it's about making life’s moments better. Which is why we’ve built the world's first, and only, open and hardware-independent service delivery platform for smart homes, small businesses, enterprises, and beyond. Our SaaS platform uses WiFi, advanced AI, and machine learning to create the future of connected spaces—and human experiences—at massive scale.

We now deliver services to over 60 million locations globally and have managed over 3 billion devices on our platform. We’re expanding rapidly, pioneering a new category, and we achieved our Series F funding in just four years. Our customers include many of the world's largest Internet Service Providers (ISPs) who look to Plume to help them evolve their smart home offerings while gleaning insights from their own data. 

With a bias for action and a love for being trailblazers, the team at Plume embodies a combination of relentless curiosity and imaginative innovation. We challenge ourselves to think in ways that other companies don't, work to do what should be done (rather than what can), and if we can’t do it exceptionally well, we don’t do it. It’s how we've assembled a team of world-class builders, thinkers, and doers. And it’s how we’re reinventing what’s possible every day.

Manager, Site Reliability Engineering (SRE)

Life at Plume

At Plume, we believe that technology isn't about moving faster, it's about making life’s moments better. Which is why we’ve built the world's first, and only, open and hardware-independent service delivery platform for smart homes, small businesses, enterprises, and beyond. Our SaaS platform uses WiFi, advanced AI, and machine learning to create the future of connected spaces—and human experiences—at massive scale.

We now deliver services to over 60 million locations globally and have managed over 3 billion devices on our platform. We’re expanding rapidly, pioneering a new category, and we achieved our Series F funding in just four years. Our customers include many of the world's largest Internet Service Providers (ISPs) who look to Plume to help them evolve their smart home offerings while gleaning insights from their own data. 

With a bias for action and a love for being trailblazers, the team at Plume embodies a combination of relentless curiosity and imaginative innovation. We challenge ourselves to think in ways that other companies don't, work to do what should be done (rather than what can), and if we can’t do it exceptionally well, we don’t do it. It’s how we've assembled a team of world-class builders, thinkers, and doers. And it’s how we’re reinventing what’s possible every day.

We are seeking a highly experienced and visionary Director of Site Reliability Engineering (SRE) to lead our growing global SRE organization. This critical role requires a strong people leader who can manage managers, set the strategic direction for the SRE function, and ensure the reliability, scalability, and performance of our systems.

What You’ll Do:

  • People Leadership and Management:
    • Lead, mentor, and develop a team of site reliability engineers.
    • Foster a culture of continuous learning, collaboration, and operational excellence within the team.
    • Conduct regular 1:1s, provide constructive feedback, and support career development for all team members.
    • Mediate conflicts and facilitate effective communication within the team and with other departments.
  • Organizational Strategy and Direction:
    • Work collaboratively with your peer managers across SRE to deliver on strategic planning, OKR attainment, and ensuring alignment with overall business objectives.
    • Establish and enforce best practices for incident response, problem management, change management, and disaster recovery.
    • Drive the adoption of SRE principles and methodologies across engineering teams.
    • Stay abreast of industry trends and emerging technologies to continuously improve our SRE capabilities.
  • Hiring and Team Growth:
    • Lead the recruitment efforts for SRE roles, including defining job requirements, interviewing candidates, and making hiring decisions.
    • Develop and implement onboarding programs to ensure new hires are successfully integrated into the team.
    • Identify skill gaps and implement training programs to enhance the capabilities of the SRE team.
  • Enabling and Empowering the Team:
    • Provide the necessary tools, resources, and support to enable SRE teams to effectively monitor, troubleshoot, and optimize system performance.
    • Empower teams to take ownership of system reliability and drive continuous improvement initiatives.
    • Remove roadblocks and facilitate cross-functional collaboration to ensure the success of SRE projects.
  • Delivery and Results:
    • Ensure the successful delivery of SRE initiatives, projects, and goals, meeting defined SLAs and KPIs.
    • Drive efforts to reduce operational toil, improve system availability, and enhance overall system stability.
    • Report on key SRE metrics and progress to senior leadership.

What You’ll Bring:

  • Bachelor's degree in Computer Science, Engineering, or a related field
  • 10+ years of experience in Site Reliability Engineering or a similar role.
  • 3+ years of experience managing and leading engineering teams, including managing managers, in a global environment.
  • Proven track record of building and scaling high-performing infrastructure and platforms.
  • Deep understanding of SRE principles, methodologies, and best practices.
  • Experience with large-scale distributed systems and cloud platforms.
  • Strong communication, interpersonal, and leadership skills.
  • Ability to think strategically and execute tactically.
  • Experience with budgeting and resource allocation.

About Plume

As the creator of the only open, hardware-independent, cloud-controlled experience platform for ISPs and their subscribers, Plume partners with over 400 ISP customers, including some of the world’s largest such as Comcast, Charter, Liberty Global, and J:COM. 

Using OpenSync, the most widely supported open-source, silicon-to-cloud framework for smart spaces, Plume’s software-defined network allows ISPs to decouple their service offerings from hardware and rapidly curate and deliver new services over a multi-vendor, open-platform architecture.  

Plume is an equal opportunity workplace that maintains a continuing policy of nondiscrimination in all employment practices and decisions, ensuring equal employment opportunities for all qualified individuals without regard to race, color, creed, religion, sex, national origin, age, physical or mental disability, sexual orientation, gender identity, marital status, pregnancy, childbirth or related individual conditions, medical conditions (as defined by state law), military or veteran status, or any other characteristic protected by federal, state or local law.

Top Skills

AI
Cloud Platforms
Machine Learning
SaaS
Wifi

Similar Jobs

12 Days Ago
Remote
United States
192K-261K Annually
Senior level
192K-261K Annually
Senior level
eCommerce • Software
Lead and develop a team of SRE engineers, overseeing platform tooling, infrastructure, deployment automation, and operational excellence for product teams.
Top Skills: ArgocdAWSCloudFormationDatadogElk StackFluxGCPGithub ActionsGitlab CiGoGrafanaKubernetesNew RelicPrometheusPulumiPythonTerraform
12 Days Ago
Remote
United States
175K-195K Annually
Senior level
175K-195K Annually
Senior level
Marketing Tech • Mobile • Software
Lead the Site Reliability Engineering team, ensuring platform reliability, supporting developers, and implementing scalable architecture while mentoring and managing team members.
Top Skills: AICloud InfrastructureEmberGoReactSaaS
12 Days Ago
Remote
United States
Senior level
Senior level
Big Data • Internet of Things • Machine Learning
Lead and manage the Site Reliability Engineering team, define strategic vision, drive SRE best practices, and ensure system performance and reliability.
Top Skills: AICloud PlatformsMachine LearningSaaS

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account