Cisco ThousandEyes Logo

Cisco ThousandEyes

Observability DevOps Site Reliability Engineer (SRE)

Posted 2 Days Ago
Be an Early Applicant
Remote or Hybrid
Hiring Remotely in Oeiras, Lisboa
Mid level
Remote or Hybrid
Hiring Remotely in Oeiras, Lisboa
Mid level
The role focuses on enhancing observability and reliability for CiscoIT workloads, involving monitoring solutions, AI/ML integration, and cloud infrastructure support.
The summary above was generated by AI
Meet the Team
At Cisco, we are a global leader in networking and IT, driving innovation and redefining how people connect, communicate, and collaborate. Our mission is to shape the future of the internet by creating unprecedented value and opportunity for our customers, employees, investors, and ecosystem partners. We are committed to encouraging a diverse and partnership environment where everyone can thrive and encourage our collective success.
Your Impact
We are seeking a highly skilled and experienced DevOps, Site Reliability Engineer to join our team, focusing on the development & support of Observability capabilities for workloads across CiscoIT Datacenter and Cloud envs.
  • Reshaping how we manage alerts, metrics, and logs by introducing deep learning and GenAI to enhance reliability services.
  • You will take ownership & responsibility for reliability, scalability, automation, and other issues related to uptime and availability of our monitoring solutions.

Minimum Qualifications
The ideal candidate will have a strong background in relevant Observability technologies & AI/ML with a proven track record of delivering innovative solutions that enhance system monitoring, performance, and reliability.
  • Bachelor's degree in computer science, Computer Engineering, a related field, or 3+ years of relevant experience.
  • Understand lifecycle IT processes including architecture, design, implementation, and operations
  • Understanding of security including OS hardening, firewalls, iptables, and working with Infosec
  • Understanding of network basics like routers and switches
  • Experience with software development tools like GitHub and Jenkins
  • Python, Shell, Go, or similar programming experience.
  • Software development lifecycle including design, development, testing, packaging, deployment, upgrade, and support.
  • Opensource development experience.
  • Familiar with Agile software development.
  • Leadership in building and maintaining SRE technologies.
  • Experience with public cloud like AWS, GCP, or Azure.
  • QA and testing experience of your code and the entire platform.

Preferred Qualifications
  • Experience with tool suites like Splunk Cloud, Splunk Observability Cloud, Elastic, Prometheus/Thanos & Grafana.
  • ThousandEyes, Zabbix & AppD or similar experience a plus.
  • Experience with JavaScript either Node JS or React.
  • Experience with implementing AI/ML & LLM based Agentic Observability use-cases.
  • Experience with Infrastructure or Application Performance Monitoring Solutions & Testing experience in a diverse and complex infrastructure.
  • Experience with on-premises cloud technologies using VMware or Openstack.
  • Experience with container technologies like Openshift, Kubernetes, and Docker.
  • Experience with building and maintaining Redhat or Centos Linux.
  • Experience with configuration automation using Ansible.

Behavioral Competencies
  • Working with geographically distributed teams
  • Self-motivated and willing to help where help is needed
  • Able to build relationships, be culturally sensitive, have goal alignment, have learning agility

Why Cisco?
At Cisco, we're revolutionizing how data and infrastructure connect and protect organizations in the AI era - and beyond. We've been innovating fearlessly for 40 years to create solutions that power how humans and technology work together across the physical and digital worlds. These solutions provide customers with unparalleled security, visibility, and insights across the entire digital footprint.
Fueled by the depth and breadth of our technology, we experiment and create meaningful solutions. Add to that our worldwide network of doers and experts, and you'll see that the opportunities to grow and build are limitless. We work as a team, collaborating with empathy to make really big things happen on a global scale. Because our solutions are everywhere, our impact is everywhere.
We are Cisco, and our power starts with you.

Top Skills

Ansible
Appd
AWS
Azure
Centos
Docker
Elastic
GCP
Git
Go
Grafana
JavaScript
Jenkins
Kubernetes
Openstack
Prometheus
Python
Redhat
Shell
Splunk Cloud
Splunk Observability Cloud
Thanos
Thousandeyes
VMware
Zabbix

Similar Jobs at Cisco ThousandEyes

2 Days Ago
Remote or Hybrid
Oeiras, Lisboa, PRT
Mid level
Mid level
Cloud • Software
Seeking a DevOps Site Reliability Engineer for developing and supporting Observability capabilities. Responsibilities include reliability, scalability, and automation for monitoring solutions.
Top Skills: AnsibleAWSAzureDockerElasticGCPGitGoGrafanaJenkinsKubernetesOpenshiftPrometheusPythonShellSplunkThanosVMware
2 Days Ago
Remote or Hybrid
Oeiras, Lisboa, PRT
Senior level
Senior level
Cloud • Software
Seeking a skilled DevOps Site Reliability Engineer to develop and support Observability capabilities, ensuring reliability and performance of monitoring solutions.
Top Skills: AIAnsibleAWSAzureCentosDockerElasticGCPGitGoGrafanaJavaScriptJenkinsKubernetesMachine LearningNode JsObservabilityOpenshiftOpenstackPrometheusPythonReactRedhatShellSplunk CloudThanosVMware
2 Days Ago
Remote or Hybrid
Oeiras, Lisboa, PRT
Senior level
Senior level
Cloud • Software
Cisco is seeking an experienced DevOps Site Reliability Engineer to enhance observability services for workloads through innovative monitoring solutions and automation.
Top Skills: AnsibleAppdAWSAzureDockerElasticGCPGitGoGrafanaJavaScriptJenkinsKubernetesNode JsOpenstackPrometheusPythonReactShellSplunk CloudSplunk Observability CloudThanosThousandeyesVMwareZabbix

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account