GALE helps brands solve complex challenges through our integrated consultancy and agency offering. Headquartered in New York with offices in Toronto, Singapore and Bangalore, our teams are connected by a set of core values that inform everything we do, from how we hire to how we work together: values like Everyone Matters, No Silos, and Masters of Our Craft.
If you're driven by a passion to build something great, a desire to innovate, and a commitment to achieve excellence in your craft, GALE is a great place for you.
About the Role:
We are seeking an experienced and strategic
Manager, DevOps to lead our DevOps and Platform Engineering function from our Bangalore office. In this role, you will drive cloud-native infrastructure strategy, SRE practices, and large-scale automation initiatives across the organization. You will manage a team of full-time and contract engineers, partner with engineering, product, and security leadership, and own the reliability, scalability, and operational excellence of our platforms. If you are a hands-on leader with deep cloud expertise, a passion for building high-performing teams, and the ability to translate complex technical trade-offs into clear business outcomes, we want to hear from you.
Key Responsibilities:
Team & People Leadership:
- Lead, manage, and grow a team of DevOps engineers (FTEs and contractors), overseeing day-to-day delivery, performance reviews, and career development
- Establish clear ownership, accountability, and a high-performance culture within the DevOps function
- Drive training and up-skilling initiatives across key areas such as Kubernetes, Terraform, and GCP to keep the team current and effective
- Mentor senior engineers and support their growth into technical leadership roles
Cloud & Infrastructure Strategy:
- Own and evolve the organization’s cloud infrastructure strategy across AWS and GCP, ensuring platforms are scalable, secure, and cost-effective
- Oversee and architect large-scale migrations and infrastructure modernization programs, including cloud platform transitions and GitHub Enterprise adoption
- Set strategic priorities and roadmaps for reliability, automation, observability, and infrastructure improvements aligned with business objectives
- Collaborate with engineering and product leadership to define infrastructure requirements for new platforms and product initiatives
SRE & Operational Reliability:
- Establish and lead a dedicated SRE function within the DevOps team, driving ownership of uptime, incidents, and on-call practices
- Oversee the full incident management lifecycle, including on-call processes, RCA sign-off, corrective actions, and preventive measures to improve MTTR
- Define and enforce SLOs, SLIs, and error budgets to maintain high service availability
- Standardize DevOps workflows and tooling across planning, alerting, and incident management platforms
CI/CD & Automation:
- Define and govern CI/CD standards and pipeline architecture across the organization, ensuring reliable and consistent deployments
- Champion the use of AI-assisted development tools and automation to reduce toil and accelerate delivery velocity
- Oversee container orchestration strategy using Kubernetes (EKS, OpenShift) and ensure best practices for containerized workloads
- Drive Infrastructure as Code (IaC) adoption using Terraform and Ansible to maintain consistent, auditable environments
Observability, Security & Compliance:
- Own the organization’s observability strategy, driving adoption of monitoring, logging, and alerting solutions across all platforms
- Lead technology audit and compliance programs aligned to ISO certification standards
- Partner with security teams to embed DevSecOps practices into pipelines and infrastructure provisioning
- Work closely with leadership to communicate risks, trade-offs, and timelines in a clear, actionable manner
Requirements:
- 10+ years of hands-on experience in DevOps, platform engineering, or site reliability engineering, with at least 2 years in a people management role
- Proven experience managing cross-functional teams including full-time engineers and contractors
- Deep expertise in AWS (20+ services) and hands-on experience with GCP; familiarity with Azure or other cloud platforms is advantageous
- Strong proficiency in container orchestration using Kubernetes (AWS EKS, GKE) and Docker
- Hands-on expertise with Infrastructure as Code tools, particularly Terraform and Ansible
- Demonstrated experience designing and managing CI/CD pipelines using tools such as Jenkins, ArgoCD, GitHub Actions, or GitLab CI
- Experience establishing and running SRE functions, including on-call frameworks, incident management, and RCA processes
- Proficiency in observability tooling including Grafana Stack (Grafana, Loki, Mimir), ELK/OpenSearch, and AWS CloudWatch
- Strong scripting and automation skills in Python and Shell
- Experience leading or contributing to technology audits and compliance initiatives (e.g., ISO certifications)
- Excellent communication skills with the ability to explain technical concepts and risks to non-technical stakeholders and senior leadership
- Experience with project and service management tooling such as JIRA, PagerDuty or equivalent platforms
Why Join Us:
- Opportunity to shape and lead the DevOps and Platform Engineering practice within a fast-growing, global organization
- Strategic role with direct influence on cloud architecture, tooling decisions, and engineering culture
- Collaborative environment that values operational excellence, innovation, and continuous improvement
- Continuous learning and development opportunities with investment in certifications and skills growth
- Competitive compensation package and benefits