Job Description: DevOps Lead/Architect
Position Overview: We are seeking a seasoned DevOps Lead/Architect with 10 to 12 years of experience to join our team. The ideal candidate will have a strong background in DevOps and Cloud technologies, with a proven ability to design, implement, and manage advanced cloud-based infrastructures. This role requires expertise in automation, containerization, CI/CD pipelines, and cloud platforms, along with excellent troubleshooting skills and a focus on reliability, security, and cost optimization.
Key Responsibilities:
- Cloud Infrastructure Management: Hands-on experience with AWS, Azure, and GCP, including compute, networks, IAM, security, and integration services.
- Containerization: Define and implement container-based environment topologies using Docker, Kubernetes, and hosted solutions, adhering to well-architected framework principles.
- Service Mesh Technologies: Design and implement advanced service mesh solutions using tools like Istio, Linkerd, and Kuma.
- Infrastructure Automation: Leverage tools like Chef, Puppet, Ansible, Terraform, ARM, and CloudFormation to automate and manage infrastructure.
- CI/CD Pipelines: Design pipelines as code and manage CI/CD tools such as Azure Devops, Jenkins, TeamCity, GitLab, and GitHub.
- Monitoring & Security: Utilize tools such as CloudWatch, CloudTrail, ELK/EFK, Splunk, Prometheus, OWASP, SAST, and DAST to ensure system monitoring, logging, and security.
- Performance Optimization: Implement strategies to meet reliability, performance, and business continuity requirements while optimizing costs.
- Advanced Troubleshooting: Apply strong debugging and problem-solving skills for recurring/mundane tasks, OS administration, and network performance issues.
- High Availability (HA) & Disaster Recovery (DR): Design and implement HA/DR strategies aligned with SLAs, RTO, and RPO.
- Capacity Planning: Estimate infrastructure and resource requirements for optimal performance and scalability.
- Scripting & Automation: Develop complex scripts in Python, Bash, Ant, or Shell for automation of infrastructure and operational tasks.
Skills & Qualifications:
- Educational Background: Bachelor’s in Engineering/Technology or Master’s in Engineering/Computer Applications.
- Cloud Platforms: Expertise in AWS, Azure, and GCP with production-level knowledge.
- Version Control: Proficiency in Git, GitLab, and GitHub.
- Container Infrastructure: Hands-on experience with Docker, Kubernetes, and service mesh technologies.
- Build Tools: Proficiency in tools like Ant, Maven, Make, and Gradle.
- Artifact Repositories: Experience with Nexus and JFrog Artifactory.
- Infrastructure Automation Tools: Expertise in Chef, Puppet, Ansible, Terraform, ARM, and CloudFormation.
- Scripting Languages: Strong skills in Python, Ant, Bash, and Shell scripting.
- Database Knowledge: Familiarity with MySQL, Oracle, and NoSQL databases such as Cassandra and MongoDB.
- Networking: Solid understanding of load balancing, CDN options, and network algorithms for failover and availability.
- Certifications: Cloud Architect or Professional certifications in AWS, Azure, or GCP are highly desirable.
Set Yourself Apart With:
- Ability to estimate setup times for infrastructure, build, and release activities.
- Advanced Linux OS expertise.
- Knowledge base creation, skill development, and toolset optimization within the practice.
- Experience with CDN management and root cause analysis.