This role involves maintaining system reliability, managing support calls, debugging issues in Unix/Linux systems, and ensuring platform visibility and uptime.
Are you looking to Optimize your life? Start your exciting path to a rewarding career today!
We are Optimum, a leader in the fast-paced world of connectivity, and we're on the hunt for enthusiastic professionals to join our team! We understand that connectivity isn't just a luxury anymore - it's a necessity that empowers lives, fuels businesses, and drives innovation. A career at Optimum means you'll be enabling progress and enhancing lives by providing reliable, high-speed connectivity solutions that keep the world connected. We owe our success to our amazing product, commitment to our people and the connections we make in every community.
If you are resourceful, collaborative, team-oriented and passionate about delivering consistent excellence, Optimum is the Company for you!
We are Optimum!
Job Summary
As a Site Reliability Engineer I, you are the frontline engine of our hybrid platform. This role is focused on service continuity and active incident response. You will work shifts to provide support coverage, perform real-time debugging, and keep our GCP and On-Premises Unix/Linux systems running at all times.
The Mission: Real-Time Reliability
Your mission is to maintain 100% platform visibility. You will be the primary responder to our observability stack, moving beyond simple monitoring to active debugging and remediation. You will handle the "heavy lift" of shift-based support calls and system health checks, ensuring that technical debt is addressed and service disruptions are mitigated before they impact the business.
Responsibilities
• Shift-Based Support & Triage: Act as the primary technical point of contact during your shift. Manage the support queue, answer urgent infrastructure calls, and provide initial triage for all system anomalies.• Active Debugging: Investigate and resolve service issues across the stack. This includes debugging Kubernetes pod failures, resolving Kafka consumer lag, and troubleshooting Unix/Linux system errors using logs (Loki) and traces (Tempo).• Hybrid Platform Maintenance: Execute routine standardization tasks and health audits for Unix (Solaris/AIX) and Linux (RHEL/Ubuntu) environments to prevent environment drift.• Infrastructure Stewardship (DC Support): Perform on-site "Smart Hands" support in our Bethpage data center, including hardware reboots, component swaps, and verifying physical power/network redundancy.• Unified Observability: Maintain the "single pane of glass" (Prometheus/Grafana). Create and tune alerts to ensure the engineering team is notified of critical issues while minimizing "alert fatigue."• Escalation & Post-Mortems: Follow strict escalation paths to SRE2/SRE3 leads, Assist in complex outage mitigation. Contribute detailed timelines and log data to Blameless Post-Mortems.
Qualifications
• Bachelor's degree in Telecommunications, Computer Engineering, or related technical field.• 0-2years of experience in mobile network operations or systems engineering roles.• OS Internals: Foundational command-line proficiency in Linux (RHEL/Ubuntu) and Unix (Solaris/AIX). Ability to troubleshoot CPU/Memory/Disk bottlenecks.• Debugging Skills: Familiarity with log analysis tools (Loki) and the ability to correlate metrics (Prometheus) to find root causes.• Cloud & Containers: Basic understanding of GCP (Compute Engine, GKE) and Kubernetes (restarting pods, viewing logs, checking ingress).• Kafka Awareness: Basic understanding of Kafka topics and the ability to monitor consumer group health.• Automation Exposure: Ability to run and verify Ansible playbooks and Terraform plans.• Communication: Excellent verbal communication for handling support calls and providing clear updates during high-pressure incidents.
At Optimum, we're fueled by our four core pillars: Taking Ownership, Upholding Transparency, Creating Community, and Demonstrating Expertise. Our commitment to empowering employees to take responsibility and embrace proactive problem-solving underpins Taking Ownership. Upholding Transparency is at the core of our culture, with open and honest communication fostering trust among our dedicated team and loyal customers. Creating Community is more than a goal; it's our daily commitment to fostering an environment of collaboration, innovation, and positivity. Demonstrating expertise is a promise we uphold through continuous learning and engagement with our customers to consistently deliver top-quality products and services. These pillars not only shape our culture but define Optimum as a place of excellence, trustworthiness, and thriving community, and we invite you to be a part of our journey.
If you have the drive to succeed and are ready to embark on a thrilling career, seize this opportunity today, and join our winning team, so together, we'll shape the future of connectivity.
All job descriptions and required skills, qualifications and responsibilities for a particular position are subject to modification by the Company from time to time, in the Company's discretion based on business necessity.
We are an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, creed, national origin, religion, age, disability, sex, sexual orientation, gender identity or protected veteran status, or any other basis protected by applicable federal, state, or local law. The Company provides reasonable accommodations upon request in accordance with applicable requirements.
Optimum collects personal information about its applicants for employment that may include personal identifiers, professional or employment related information, photos, education information and/or protected classifications under federal and state law. This information is collected for employment purposes, including identification, work authorization, FCRA-compliant background screening, human resource administration and compliance with federal, state, and local law.
Applicants for employment with the Company will never be asked to provide money (even if reimbursable) as part of the job application or hiring process. Please review our Fraud FAQ for further details.
Pay is competitive and based on a number of job-related factors, including skills and experience. The starting pay rate/range at time of hire for this position in New York is $66,830.00 - $95,472.00 / year. For other locations, please inquire with your recruiter. The rates/ranges provided herein are the anticipated pay at the time of hire, and do not reflect future job opportunity.
Nearest Major Market: Long Island
Nearest Secondary Market: New York CIty
We are Optimum, a leader in the fast-paced world of connectivity, and we're on the hunt for enthusiastic professionals to join our team! We understand that connectivity isn't just a luxury anymore - it's a necessity that empowers lives, fuels businesses, and drives innovation. A career at Optimum means you'll be enabling progress and enhancing lives by providing reliable, high-speed connectivity solutions that keep the world connected. We owe our success to our amazing product, commitment to our people and the connections we make in every community.
If you are resourceful, collaborative, team-oriented and passionate about delivering consistent excellence, Optimum is the Company for you!
We are Optimum!
Job Summary
As a Site Reliability Engineer I, you are the frontline engine of our hybrid platform. This role is focused on service continuity and active incident response. You will work shifts to provide support coverage, perform real-time debugging, and keep our GCP and On-Premises Unix/Linux systems running at all times.
The Mission: Real-Time Reliability
Your mission is to maintain 100% platform visibility. You will be the primary responder to our observability stack, moving beyond simple monitoring to active debugging and remediation. You will handle the "heavy lift" of shift-based support calls and system health checks, ensuring that technical debt is addressed and service disruptions are mitigated before they impact the business.
Responsibilities
• Shift-Based Support & Triage: Act as the primary technical point of contact during your shift. Manage the support queue, answer urgent infrastructure calls, and provide initial triage for all system anomalies.• Active Debugging: Investigate and resolve service issues across the stack. This includes debugging Kubernetes pod failures, resolving Kafka consumer lag, and troubleshooting Unix/Linux system errors using logs (Loki) and traces (Tempo).• Hybrid Platform Maintenance: Execute routine standardization tasks and health audits for Unix (Solaris/AIX) and Linux (RHEL/Ubuntu) environments to prevent environment drift.• Infrastructure Stewardship (DC Support): Perform on-site "Smart Hands" support in our Bethpage data center, including hardware reboots, component swaps, and verifying physical power/network redundancy.• Unified Observability: Maintain the "single pane of glass" (Prometheus/Grafana). Create and tune alerts to ensure the engineering team is notified of critical issues while minimizing "alert fatigue."• Escalation & Post-Mortems: Follow strict escalation paths to SRE2/SRE3 leads, Assist in complex outage mitigation. Contribute detailed timelines and log data to Blameless Post-Mortems.
Qualifications
• Bachelor's degree in Telecommunications, Computer Engineering, or related technical field.• 0-2years of experience in mobile network operations or systems engineering roles.• OS Internals: Foundational command-line proficiency in Linux (RHEL/Ubuntu) and Unix (Solaris/AIX). Ability to troubleshoot CPU/Memory/Disk bottlenecks.• Debugging Skills: Familiarity with log analysis tools (Loki) and the ability to correlate metrics (Prometheus) to find root causes.• Cloud & Containers: Basic understanding of GCP (Compute Engine, GKE) and Kubernetes (restarting pods, viewing logs, checking ingress).• Kafka Awareness: Basic understanding of Kafka topics and the ability to monitor consumer group health.• Automation Exposure: Ability to run and verify Ansible playbooks and Terraform plans.• Communication: Excellent verbal communication for handling support calls and providing clear updates during high-pressure incidents.
At Optimum, we're fueled by our four core pillars: Taking Ownership, Upholding Transparency, Creating Community, and Demonstrating Expertise. Our commitment to empowering employees to take responsibility and embrace proactive problem-solving underpins Taking Ownership. Upholding Transparency is at the core of our culture, with open and honest communication fostering trust among our dedicated team and loyal customers. Creating Community is more than a goal; it's our daily commitment to fostering an environment of collaboration, innovation, and positivity. Demonstrating expertise is a promise we uphold through continuous learning and engagement with our customers to consistently deliver top-quality products and services. These pillars not only shape our culture but define Optimum as a place of excellence, trustworthiness, and thriving community, and we invite you to be a part of our journey.
If you have the drive to succeed and are ready to embark on a thrilling career, seize this opportunity today, and join our winning team, so together, we'll shape the future of connectivity.
All job descriptions and required skills, qualifications and responsibilities for a particular position are subject to modification by the Company from time to time, in the Company's discretion based on business necessity.
We are an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, creed, national origin, religion, age, disability, sex, sexual orientation, gender identity or protected veteran status, or any other basis protected by applicable federal, state, or local law. The Company provides reasonable accommodations upon request in accordance with applicable requirements.
Optimum collects personal information about its applicants for employment that may include personal identifiers, professional or employment related information, photos, education information and/or protected classifications under federal and state law. This information is collected for employment purposes, including identification, work authorization, FCRA-compliant background screening, human resource administration and compliance with federal, state, and local law.
Applicants for employment with the Company will never be asked to provide money (even if reimbursable) as part of the job application or hiring process. Please review our Fraud FAQ for further details.
Pay is competitive and based on a number of job-related factors, including skills and experience. The starting pay rate/range at time of hire for this position in New York is $66,830.00 - $95,472.00 / year. For other locations, please inquire with your recruiter. The rates/ranges provided herein are the anticipated pay at the time of hire, and do not reflect future job opportunity.
Nearest Major Market: Long Island
Nearest Secondary Market: New York CIty
Top Skills
Ansible
GCP
Grafana
Kafka
Kubernetes
Linux
Prometheus
Terraform
Unix
Similar Jobs at Optimum
AdTech • Digital Media • Internet of Things • Marketing Tech • Mobile • Retail • Software
The Event Representative drives market engagement, builds community relationships, and delivers sales by connecting with customers at community events.
AdTech • Digital Media • Internet of Things • Marketing Tech • Mobile • Retail • Software
The role involves leading AI system development, collaborating on ML model integration, and mentoring engineers while ensuring software reliability and performance.
Top Skills:
GitGoogle CesGCPJavaScriptNode.jsPython
AdTech • Digital Media • Internet of Things • Marketing Tech • Mobile • Retail • Software
The Sales Event Coordinator plans and executes marketing events, manages logistics, evaluates event success, and provides operational support in New York.
Top Skills:
ExcelMicrosoft PowerpointMicrosoft Sharepoint
What you need to know about the Charlotte Tech Scene
Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.
Key Facts About Charlotte Tech
- Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
- Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
- Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
- Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
- Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

