We are seeking a skilled DevOps Engineer with strong hands on experience in Azure, Terraform, and Kubernetes to help design, build, and operate secure, scalable, and highly available cloud platforms. This role plays a critical part in enabling engineering teams through Infrastructure as Code, CI/CD automation, container orchestration, and operational excellence for mission critical SaaS products serving global clients.
Key Responsibilities:
Cloud & Infrastructure (Azure)
- Design, deploy, and manage cloud infrastructure on Microsoft Azure following best practices for scalability, availability, and security.
- Work with core Azure services including Azure Kubernetes Service (AKS), Virtual Networks, Load Balancers, Storage Accounts, Azure Monitor, Application Insights, and IAM.
- Implement cost optimisation, capacity planning, and performance tuning strategies.
Infrastructure as Code (Terraform)
- Build and maintain Terraform modules to provision and manage Azure infrastructure in a repeatable, version controlled manner.
- Enforce IaC standards, environment parity (dev / QA / prod), and secure configuration practices.
- Collaborate with security and architecture teams to embed compliance and governance into IaC pipelines.
Kubernetes & Container Platform
- Operate and support AKS based Kubernetes clusters for production and non production workloads.
- Manage container deployments, scaling, rolling upgrades, and resiliency patterns.
- Troubleshoot cluster, node, pod, and networking related issues to ensure high availability and reliability.
CI/CD & Automation
- Design, implement, and maintain CI/CD pipelines for infrastructure and application deployment.
- Automate build, test, release, and rollback processes to improve deployment speed and reliability.
- Integrate security scans, quality gates, and infrastructure validation into pipelines.
Reliability, Monitoring & Operations
- Implement monitoring, alerting, and logging using Azure Monitor, Application Insights, and related tools.
- Participate in incident response, root cause analysis (RCA), and post incident improvements.
- Support on call rotations and drive continuous improvements to system reliability and operational maturity.
Cross Functional Collaboration
- Work closely with Software Engineering, QA, Security, and Architecture teams.
- Support agile delivery through DevOps best practices and shared ownership of production systems.
Required Skills & Experience:
- 4–5 years of hands on experience as a DevOps / Platform / SRE Engineer.
- Bachelor’s degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience).
- Strong experience with Microsoft Azure in production environments.
- Proven hands on expertise with Terraform for Infrastructure as Code.
- Solid experience running workloads on Kubernetes (AKS).
- Strong understanding of Linux systems, networking, and cloud security fundamentals.
- Experience troubleshooting production issues in distributed systems.
- Proficiency in scripting for automation (PowerShell, Bash, or Python)
- Exposure to DR planning, backup strategies, and high availability architectures.