We are seeking an experienced Team Lead - DevOps Engineer to join our dynamic team. This role is ideal for someone passionate about automation, cloud infrastructure, and driving operational excellence while leading a high-performing DevOps team. You will be responsible for architecting, maintaining, and optimizing our cloud-based gaming platform, ensuring high availability, security, and performance. As a Team Lead, you will mentor team members, collaborate with cross-functional teams, and implement best practices to enhance system reliability and scalability.
Responsibilities
- Develop scripts and automation frameworks for infrastructure automation and management.
- Implement Infrastructure as Code (IaC) using tools like Terraform, ARM templates, CloudFormation, etc.
- Design, build, and manage multi-cloud environments (AWS, GCP, Azure) with a focus on AWS services such as EKS, ECS, API Gateway, Lambda, VPC, IAM, WAF, S3 RDS (MySQL + Aurora), DynamoDB and EC2
- Ensure high availability and scalability of cloud infrastructure for gaming applications.
- Implement and optimize CI/CD pipelines using tools like Git, Bitbucket, and Jenkins, for seamless code deployments.
- Work with software engineers to design fault-tolerant and scalable architectures.
- Develop and refine alerting systems using Grafana, Prometheus, and other APM tools to proactively detect and resolve issues.
- Conduct root cause analysis of incidents and implement preventive measures.
- Participate in on-call rotations for critical platform incidents.
- Implement security best practices for DevSecOps and ensure compliance with industry standards.
- Work on disaster recovery and business continuity strategies.
- Maintain comprehensive documentation of system architecture, automation scripts, and operational processes.
- Collaborate with Engineering and Product teams to enhance system reliability and performance.
Requirements
- Bachelor's degree in Computer Science, Engineering, or a related technical field (Master's degree is a plus).
- 6+ years of experience in DevOps, with a strong focus on AWS Cloud infrastructure.
- Hands-on experience with containerization (Docker, Kubernetes) and orchestration.
- Advanced knowledge of Python or Shell scripting for automation.
- Experience with monitoring tools (Grafana, Prometheus) and APM tools.
- Familiarity with Infrastructure as Code (IaC) using Terraform, Ansible, Chef, Puppet, etc.
- Experience in leading technical teams and driving DevOps best practices.
- Experience in incident management, debugging, and performance optimization.
Certifications (Preferred, Not Mandatory)
- AWS Solutions Architect.
- Certified Kubernetes Administrator (CKA).
- HashiCorp Certified: Terraform Associate.
This job was posted by Ananya Srivastava from Baazi Games.