Key Responsibilities
- Own end-to-end release management across DEV, SIT, UAT, CERT, and PROD
environments.
- Define and govern release calendars, versioning strategies, go/no-go criteria, rollback
plans, and post-release validation.
- Design, implement, and maintain CI/CD pipelines with embedded security, quality, and
compliance controls.
- Drive infrastructure automation using Infrastructure as Code (IaC) for cloud and hybrid
environments.
- Implement SRE practices including SLIs, SLOs, error budgets, incident response, and
blameless postmortems.
- Lead observability strategy using ELK stack for centralized logging, monitoring, alerting,
and root cause analysis.
- Apply AI/AIOps capabilities for anomaly detection, alert noise reduction, predictive
monitoring, and automated RCA.
- Ensure audit readiness, compliance, and traceability across the SDLC.
- Act as technical authority and mentor for DevOps, SRE, and platform engineers.
Required Skills & Experience
- 8–12+ years of experience in DevOps, SRE, Platform Engineering, or Release Engineering
roles.- Strong hands-on experience with CI/CD tools (GitLab CI/CD, Jenkins, Azure DevOps).
- Deep expertise in Docker, Kubernetes, Helm, and zero-downtime deployment strategies.
- Proven experience with Infrastructure as Code (Terraform, Ansible, ARM, or equivalent).
- Strong understanding of SRE principles (SLOs, SLIs, error budgets, incident management).
- Hands-on expertise with ELK / Elastic Stack for observability.
- Experience applying AI/AIOps techniques in IT operations.
- Strong scripting skills (Python, Shell).
- Experience in regulated environments, preferably Wholesale or Corporate Banking.
Preferred Skills
- Experience with Payments, Cash Management, H2H integrations, or API platforms.
- Exposure to DevSecOps tools such as SAST, DAST, SCA, secret scanning.
- Experience with cloud platforms (AWS, Azure) and hybrid architectures.
- Familiarity with DORA metrics and engineering productivity measurement.
Key Competencies
- Strong ownership mindset and accountability for production reliability.
- Excellent stakeholder communication and cross-team orchestration.
- Data-driven decision-making using engineering and reliability metrics.
- Ability to operate under pressure during releases and incidents.
- Mentorship and technical leadership capabilities.