We are looking for an experienced Senior Manager to lead Oracle Banking Cloud Services AMS/SRE teams
Job description
The role requires a deep understanding of cloud technologies PaaS and SaaS platform (OCI, AWS, Azure, or GCP), along with the ability to manage critical banking applications in a fast-paced, 24/7 environment.
Requirements:
- Proficient technical knowledge of PaaS and SaaS platforms (OCI, AWS, Azure, or GCP).
- Strong understanding of microservices, containers, and serverless technologies.
- Hands-on experience with Kubernetes, Docker, and container orchestration platforms.
- Deep knowledge of CI/CD pipelines, version control systems (e.g., Git), and build automation tools (e.g., Jenkins, GitLab CI/CD).
- Experience in automating deployment, scaling, and monitoring processes.
- Experience with logging and monitoring tools (e.g., OCI Monitoring, Prometheus, Grafana ELK Stack, Splunk).
- Proficiency in managing production environments, troubleshooting, and optimizing applications.
- Ability to handle patch management, incident resolution, and root cause analysis.
- Strong understanding of SRE principles (RTO/RPO/SLA).
- Experience in designing and implementing reliability strategies, including failover and disaster recovery.
- Good understanding of compliance and regulatory processes like SOC, GDPR, DORA etc
- Good understanding of programming languages like Python, Ruby etc and expertise in shell scripting.
- Expertise in banking applications and SaaS-based service delivery.
- Experience in managing teams in shift-based models and ensuring round-the-clock availability.
- Strong communication skills to coordinate with customers, product, engineering, and leadership teams.
- Ability to mentor and upskill team members in DevOps and SRE best practices.
- Ability to present technical concepts to non-technical stakeholders effectively.
- Familiarity with incident management frameworks and escalation processes.
Career Level - M3
Responsibilities
- Oversee the operational support for banking SaaS solutions, ensuring high availability and reliability.
- Lead and manage a team of AME/SRE engineers working in shifts to provide round-the-clock support and issue resolution.
- Oversee production deployments, trouble-shooting incidents and performance analysis.
- Implement and optimize AMS processes to meet service-level agreements (SLAs) and exceed client expectations.
- Leverage cloud expertise to troubleshoot and optimize SaaS applications and infrastructure.
- Collaborate with cross-functional teams to drive continuous improvement and operational excellence.
- Ensure compliance with security, regulatory, and performance standards.
- Automation of operational tasks to minimize toil.
- Development and deployment of Monitoring and Observability Tools.