Manager, Dev Ops

Stagwell • Full-time • Bengaluru, IN • 3d ago

GALE helps brands solve complex challenges through our integrated consultancy and agency offering. Headquartered in New York with offices in Toronto, Singapore and Bangalore, our teams are connected by a set of core values that inform everything we do, from how we hire to how we work together: values like Everyone Matters, No Silos, and Masters of Our Craft.

If you're driven by a passion to build something great, a desire to innovate, and a commitment to achieve excellence in your craft, GALE is a great place for you.

About the Role:

We are seeking an experienced and strategic Manager, DevOps to lead our DevOps and Platform Engineering function from our Bangalore office. In this role, you will drive cloud-native infrastructure strategy, SRE practices, and large-scale automation initiatives across the organization. You will manage a team of full-time and contract engineers, partner with engineering, product, and security leadership, and own the reliability, scalability, and operational excellence of our platforms. If you are a hands-on leader with deep cloud expertise, a passion for building high-performing teams, and the ability to translate complex technical trade-offs into clear business outcomes, we want to hear from you.

Key Responsibilities:

Team & People Leadership:

Lead, manage, and grow a team of DevOps engineers (FTEs and contractors), overseeing day-to-day delivery, performance reviews, and career development
Establish clear ownership, accountability, and a high-performance culture within the DevOps function
Drive training and up-skilling initiatives across key areas such as Kubernetes, Terraform, and GCP to keep the team current and effective
Mentor senior engineers and support their growth into technical leadership roles

Cloud & Infrastructure Strategy:

Own and evolve the organization’s cloud infrastructure strategy across AWS and GCP, ensuring platforms are scalable, secure, and cost-effective
Oversee and architect large-scale migrations and infrastructure modernization programs, including cloud platform transitions and GitHub Enterprise adoption
Set strategic priorities and roadmaps for reliability, automation, observability, and infrastructure improvements aligned with business objectives
Collaborate with engineering and product leadership to define infrastructure requirements for new platforms and product initiatives

SRE & Operational Reliability:

Establish and lead a dedicated SRE function within the DevOps team, driving ownership of uptime, incidents, and on-call practices
Oversee the full incident management lifecycle, including on-call processes, RCA sign-off, corrective actions, and preventive measures to improve MTTR
Define and enforce SLOs, SLIs, and error budgets to maintain high service availability
Standardize DevOps workflows and tooling across planning, alerting, and incident management platforms

CI/CD & Automation:

Define and govern CI/CD standards and pipeline architecture across the organization, ensuring reliable and consistent deployments
Champion the use of AI-assisted development tools and automation to reduce toil and accelerate delivery velocity
Oversee container orchestration strategy using Kubernetes (EKS, OpenShift) and ensure best practices for containerized workloads
Drive Infrastructure as Code (IaC) adoption using Terraform and Ansible to maintain consistent, auditable environments

Observability, Security & Compliance:

Own the organization’s observability strategy, driving adoption of monitoring, logging, and alerting solutions across all platforms
Lead technology audit and compliance programs aligned to ISO certification standards
Partner with security teams to embed DevSecOps practices into pipelines and infrastructure provisioning
Work closely with leadership to communicate risks, trade-offs, and timelines in a clear, actionable manner

Requirements:

10+ years of hands-on experience in DevOps, platform engineering, or site reliability engineering, with at least 2 years in a people management role
Proven experience managing cross-functional teams including full-time engineers and contractors
Deep expertise in AWS (20+ services) and hands-on experience with GCP; familiarity with Azure or other cloud platforms is advantageous
Strong proficiency in container orchestration using Kubernetes (AWS EKS, GKE) and Docker
Hands-on expertise with Infrastructure as Code tools, particularly Terraform and Ansible
Demonstrated experience designing and managing CI/CD pipelines using tools such as Jenkins, ArgoCD, GitHub Actions, or GitLab CI
Experience establishing and running SRE functions, including on-call frameworks, incident management, and RCA processes
Proficiency in observability tooling including Grafana Stack (Grafana, Loki, Mimir), ELK/OpenSearch, and AWS CloudWatch
Strong scripting and automation skills in Python and Shell
Experience leading or contributing to technology audits and compliance initiatives (e.g., ISO certifications)
Excellent communication skills with the ability to explain technical concepts and risks to non-technical stakeholders and senior leadership
Experience with project and service management tooling such as JIRA, PagerDuty or equivalent platforms

Why Join Us:

Opportunity to shape and lead the DevOps and Platform Engineering practice within a fast-growing, global organization
Strategic role with direct influence on cloud architecture, tooling decisions, and engineering culture
Collaborative environment that values operational excellence, innovation, and continuous improvement
Continuous learning and development opportunities with investment in certifications and skills growth
Competitive compensation package and benefits

Apply