ShepHertz is seeking a highly skilled and experienced DevOps Lead to join our team. In this pivotal role, you will be responsible for leading our DevOps initiatives, streamlining our deployment processes, and ensuring the scalability and reliability of our cloud infrastructure. The ideal candidate will have a strong background in software engineering, with expert knowledge of Azure, cloud infrastructure management, and automation tools.
Key Responsibilities
Infrastructure Automation And Management
- Develop and maintain CI/CD pipelines. Automate our software delivery processes, including integration, testing, deployment, and infrastructure provisioning, using tools such as Jenkins, Docker Swarm, GitLab CI, and Azure DevOps.
- Manage cloud environments. Responsible for provisioning, configuring, and optimizing Azure cloud environments to ensure maximum efficiency and minimal downtime.
- Implement and maintain Kubernetes clusters to support our containerization strategy, including monitoring, scaling, and troubleshooting.
Monitoring And Performance Optimization
- Implement monitoring solutions. Utilize tools like Prometheus and Grafana for comprehensive monitoring, logging, and alerting to identify and resolve issues proactively.
- Performance optimization. Monitor system performance, identify bottlenecks, and implement strategies to resolve them, ensuring high availability and seamless user experience.
- Engage in capacity planning and cost optimization activities to ensure efficient use of cloud resources.
Security And Compliance
- Ensure security best practices. Implement and maintain security measures to protect data and infrastructure from threats, in compliance with industry standards.
- Automate compliance processes. Develop automation scripts in Python and bash to improve efficiencies in deployment, scaling, and monitoring, adhering to regulatory requirements.
- Continuously assess and upgrade systems to address security vulnerabilities and to be in line with compliance needs.
Key Skills & Qualifications:
Must Have Skills
- Azure expertise. - Deep knowledge of Azure services and management, including Azure AD, App Services, and Azure SQL.
- CI/CD and automation tools proficiency. - Hands-on experience with Jenkins, Docker Swarm, GitLab CI, and similar tools for automating deployment and infrastructure management.
- Container orchestration with Kubernetes. - Skillful in deploying, managing, and scaling applications with Kubernetes.
- Scripting skills in Python and bash. - Ability to write and maintain scripts to automate tasks and processes.
Good To Have Skills
- Monitoring tools expertise. - Familiarity with Prometheus, Grafana, or similar tools for systems monitoring and alerting.
- Experience with Docker Swarm. - Adds value by enabling more efficient container management and deployment strategies.
KPIs for Success in This Role:
- Delivery efficiency. - Reducing deployment time and frequency of deployment failures.
- System uptime and reliability. - Maintaining high availability and quick recovery from outages.
- Security compliance. - No breaches or compliance issues, ensuring system integrity and data protection.
Why Join Us?
- Cutting-edge technology stack. - Work with the latest technologies in cloud computing and DevOps practices, staying ahead in the industry.
- Growth and learning opportunities. - Continuous professional development in a supportive environment that encourages innovation and creativity.
- Impactful work. - Contribute significantly to improving the reliability and efficiency of our services, directly impacting customer satisfaction and business success.