Job Description
We are seeking an experienced Technical Lead DevOps to join our engineering team at Wexa AI. In this role, you will lead our DevOps initiatives, architect and maintain our cloud infrastructure, and drive automation across our development and deployment pipelines. You will work closely with development teams to ensure seamless CI/CD processes, optimize system performance, and maintain high availability of our AI-powered recruitment platform. This is a leadership position where you'll mentor team members, establish best practices, and shape our infrastructure strategy.
Department : Engineering
Job Type - Full-time
Work Type- On-site
Locations - Hyderabad, India
Seniority- Lead
Required Qualifications
- Bachelor's or Master's degree in Computer Science, Engineering, or related field
- 8+ years of experience in DevOps, Site Reliability Engineering, or Infrastructure roles
- 5+ years of hands-on experience with cloud platforms (AWS, Azure, or GCP)
- Proven experience leading DevOps teams or initiatives in a technical leadership capacity
- Strong expertise in containerization (Docker) and orchestration (Kubernetes, EKS, AKS)
- Deep understanding of CI/CD tools such as Jenkins, GitLab CI, GitHub Actions, or CircleCI
- Experience with infrastructure as code tools (Terraform, Ansible, CloudFormation)
- Strong scripting skills in Python, Bash, or similar languages
Key Responsibilities
- Lead the design, implementation, and maintenance of scalable cloud infrastructure on AWS/Azure/GCP
- Architect and optimize CI/CD pipelines for automated testing, deployment, and monitoring
- Manage containerization strategies using Docker and orchestration with Kubernetes
- Implement infrastructure as code using Terraform, CloudFormation, or similar tools
- Establish monitoring, logging, and alerting systems to ensure high availability and performance
- Lead incident response and root cause analysis for production issues
- Mentor and guide junior DevOps engineers and collaborate with development teams
- Drive security best practices including vulnerability management, access control, and compliance
Must Have Skills
- AWS/Azure/GCP
- Kubernetes
- Terraform
- CI/CD Pipelines
- Linux / Unix Administration
- Git / Version Control
- IAM & Security Groups
- Helm
- ELK Stack / Logging Infrastructure
- Secrets Management (Vault / AWS Secrets Manager)
- Observability / APM Tools (Datadog / New Relic / CloudWatch)
- Python / Bash Scripting
- Multi-cloud Architecture
- Database Administration
- Service Mesh (Istio/Linkerd)
- Cost Optimization / FinOps
- Advanced Networking & Security
- GitOps Strategy & Governance
- Security & Compliance Frameworks
Good to Have
- MLOps
- Gen AI Infrastructure
- Team Leadership Practices
- Software Architecture & System Design