CodeChavo is a global digital transformation and software services company. One of our client an AI/ML company, operates a robust infrastructure heavily reliant on computing and data processing. Managing multiple AWS accounts, diverse environments, and Kubernetes deployments demands a comprehensive approach to automation that extends beyond the CI/CD pipeline. The complexities of scalability, reliability, security, and cost optimization persist, presenting continuous challenges. If the prospect of addressing these intricacies in a dynamic environment excites you, this position is an ideal fit
What will you do
- Help in the definition of best practices in production monitoring and alerting, and be able to own the application of the same
- Assist and troubleshoot in the setup and maintenance of various environments (Production, testing, etc)
- Automate, optimize, and drive the efficiency of effort, code, and process
- Be able to assist with product stability and closely collaborate with other tech teams to suggest improvements for the same
- Assist in the implementation of security best practices, especially in public cloud infrastructure and in audit/compliance requirements.
- Own integration of existing systems using appropriate Kubernetes/ Docker / Terraform scripts to automate and improve the efficiency of the deployment
- Develop CI/CD pipelines for various services
- Coordinate and monitor releases of the same
Who you are
- Bachelor's degree in Computer Science, Information Technology, or related field with 2+ years of industry experience as an SRE/DevOps Engineer.
- Expertise in scripting and programming skills (e.g., Python, Shell, Go).
- Good problem-solving and hands-on with the programming language or scripting for infra-automation
- CI/CD experience with Jenkins and cloud deployment technologies like Code Deploy (AWS), and/or GitLab.
- Understanding of enterprise software development and infrastructure processes and lifecycle; ability to adjust and apply this knowledge in a dynamic environment using Agile or similar methodologies.
- Hands-on experience with Infrastructure as Code, using Terraform, CloudFormation, or other tools.
- Hands-on experience with microservices and distributed applications, such as orchestration and containers, Kubernetes, and/or serverless technology.
- Understanding of network concepts and tools
- Understanding of different kinds of infrastructure components, such as database, pub-sub services, and cache etc.
- Stay up-to-date with industry trends and emerging technologies to drive best practices in innovation for security, compliance, and data protection.