About the Role
As DevOps Engineer - IV, you will design systems capable of serving as the brains of complex distributed products. In addition, you will also closely mentor younger engineers on the team and contribute to team building. Overall, you will be a strong technologist at Meesho who cares about code modularity, scalability, re-usability.
What you will do
- Develop reusable Infrastructure code and testing frameworks for Infrastructure.
- Develop tools and frameworks to allow Meesho engineers to provision and manage Infrastructure access controls.
- Design and develop solutions for cloud security, secrets-management and key rotations.
- Design a centralized logging and metrics platform that can handle Meesho’s scale.
- Take on new Infrastructure requirements and develop infrastructure code Work with service teams to help them onboard container platform.
- Scale the Meesho platform to handle millions of requests concurrently.
- Drive solutions to reduce MTTR and MTTD, enabling High Availability and Disaster Recovery.
What you will need
- Bachelors / Masters in Computer Science.
- 8-12 years of in-depth and hands-on professional experience in the DevOps /Systems Engineering domain.
- Proficiency in Strong Systems, Linux, Open Source, Infrastructure Engineering, and DevOps fundamentals.
- Hands-on experience with data processing frameworks(eg. Spark, Databricks)Familiar with Big Data Technologies.
- Experience with Data Ops concepts and tools(eg. Airflow, Zeplin).
- Exceptional design and architectural skills.
- Experience in building large-scale distributed systems.
- Experience in Scalable Systems - transactional systems (B2C).
- Expertise in designing and implementing solutions on public cloud (AWS) for security, resilience etc.
- Expertise in Security Hardening of cloud infrastructure applications/web servers against known/unknown vulnerabilities.
- Understanding of compliance and security.
- Ability to assess business needs and requirements to ensure appropriate approaches.
- Expertise in Capacity Planning Design, cost and effort estimations and cost-optimization.
- Ability to deliver the best operations tooling and practices, including CI/CD.
- In-depth understanding of SDLC.
- Ability to write infrastructure as code for public or private clouds.
- Ability to implement modern cloud Integration architecture.
- Knowledge of configuration and infra management (Terraform) or CI tools (Any).
- Knowledge of coding language: Python and Go (proficiency in anyone).
- Expertise in container services like Docker, Kubernetes, GKE/EKS, etc.
- Ability to architect and implement end-to-end monitoring of solutions in the cloud.
- Ability to design for failover, high availability, MTTR, MTTD, RTO, RPO and so on.
- Ability to define and report on business and processes metrics.
- Ability to balance governance, ownership and freedom against reliability.
- Ability to develop and motivate individual contributors on the team.