As a DevOps Team Lead, you will play a crucial role in leading our Infrastructure team. You will be responsible for overseeing a team of 4-6 engineers and driving the end-to-end lifecycle of Infrastructure operations, focusing on prototyping, security, provisioning, maintenance, and de-provisioning. Your primary mission will be to innovate within AWS's platform, optimize operational efficiency, and minimize costs while ensuring robust and scalable infrastructure solutions.
Responsibilities
- Lead, mentor, and manage a team of 4-6 engineers, fostering a collaborative and high-performance work environment.
- Collaborate with product managers, designers, and other stakeholders to define project scope, goals, and deliverables.
- Oversee, manage, and mentor your squad, ensuring they're equipped with top-notch skills and expertise.
- Oversee the development, security, and maintenance of CleverTap's robust cloud infrastructure.
- Monitor key performance indicators (KPIs) and ensure the team hits all targets.
- Lead incident remediation efforts to minimize downtime and keep systems running smoothly.
- Ensure CleverTap's cloud infrastructure remains secure and compliant with industry standards.
- Lead proof of concept initiatives and oversee seamless rollout and deployment of new solutions.
- Own the lifecycle of features, enhancements, bug fixes, security updates, support tasks, and multiple applications.
- Serve up regular updates to stakeholders on the latest infrastructure status, project milestones, and team performance highs (and lows).
- Help recruit and onboard new team members whenever the need arises.
Requirements
- Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field.
- 5-8 years of hands-on experience in managing infrastructure at the internet scale.
- Proven experience in leading and managing a team of 4-6 Engineers.
- Extensive experience with AWS specifically with networking, and security.
- Strong background in designing, implementing, and maintaining scalable, secure, and reliable infrastructure.
- Expertise in CI/CD pipelines (e. g. Bamboo, GitHub Actions).
- Proficiency in monitoring and logging tools such as Prometheus, Grafana, and Splunk.
- Solid understanding of security best practices and compliance requirements.
- Skilled in scripting languages such as Python, and Bash.
- Excellent communication and collaboration skills, with a knack for working cross-functionally.
- Strong analytical and troubleshooting skills to swiftly resolve issues.
- Passionate about staying up-to-date with industry trends and emerging technologies.
- Experience working with large-scale infrastructure and handling internet-scale applications.
- Deep knowledge of cloud services and architecture, particularly in large, complex environments.
- Proven ability to optimize performance and scalability in high-demand systems.
- Skilled in ensuring high availability and minimal downtime for mission-critical systems.
- Expertise in IaC tools like Terraform, CloudFormation, or similar, for managing scalable environments.
This job was posted by Poppin Mathias from CleverTap.