DevOps/SRE Engineering Leader - Job Description
Role Summary:
We are looking for a highly accomplished DevOps/SRE Engineering Leader to oversee multi-cloud
infrastructure, manage critical Kubernetes environments, lead a high-performing engineering team, and
ensure enterprise-grade reliability, automation, and security across global cloud platforms. You will be
responsible for strategic DevOps and SRE initiatives, platform stability, security compliance, and FinOps
optimization in a dynamic, fast-paced technology environment.
Key Responsibilities:
- Lead DevOps and SRE strategy and execution across hybrid and multi-cloud platforms (AWS, Azure,
Oracle, GCP).
- Manage critical SaaS infrastructure with 99.99% uptime requirements across global regions.
- Design and optimize CI/CD pipelines using Jenkins, GitLab CI, Azure DevOps, Helm, and GitOps.
- Implement Infrastructure as Code using Terraform, Ansible, and Python.
- Oversee container orchestration with Kubernetes (AKS/EKS), Docker, and Rancher.
- Define and enforce cloud security, compliance (SOC 2, PCI DSS, ISO 27001), and governance standards.
- Drive cloud cost optimization through FinOps best practices and tooling (e.g., Cast AI, Azure Cost
Management).
- Lead observability, monitoring, and incident response using tools such as Prometheus, Grafana, Datadog,
ELK Stack, and Azure Monitor.
- Manage stakeholder communication, project delivery, and resource planning aligned with business OKRs.
- Mentor and scale distributed engineering teams fostering a culture of technical excellence and
accountability.
- Deliver high-scale platform modernization, cloud migration, and automation initiatives.
Required Skills and Technologies:
Cloud Platforms: AWS, Azure, Oracle Cloud, GCP
CI/CD & DevOps: Jenkins, GitLab, Azure DevOps, Helm, GitOps
IaC Tools: Terraform, Ansible, Puppet, Shell scripting, Python
Containers & Orchestration: Docker, Kubernetes (AKS/EKS), Rancher, OpenShift
Monitoring & Observability: Grafana, Prometheus, ELK, Datadog, New Relic
Security & Compliance: SOC 2, PCI DSS, ISO 27001, IAM, PIM
Network & Infrastructure: Cisco, FortiGate, VMware, KVM, HPE, Veritas NetBackup
Project & Team Management: Agile, Scrum, Jira, Confluence, ITSM, ITIL
Qualifications:
Bachelor's or Master's degree in Computer Science, IT, Telecommunications, or a related field
Preferred Certifications:
- Certified Kubernetes Administrator (CKA)
- AWS/Azure Certified Solutions Architect
- (ISC)² Certified in Cybersecurity (CC)
- ITIL, Cisco CCNP or Specialist-level certifications
Preferred Experience:
- 10+ years of experience in SRE/DevOps with at least 5 years in technical leadership roles
- Proven success in large-scale cloud architecture and production operations
- Experience working with global teams and international client stakeholders
- Hands-on expertise in cloud migration, application modernization, and platform security
Soft Skills:
- Strategic thinking & leadership
- Strong communication & stakeholder management
- Analytical problem-solving
- Resilience and adaptability in dynamic environments
- Commitment to continuous learning and innovation