About the RoleWe are looking for an experienced DevOps Engineer to join our engineering team. This role involves setting up, managing, and scaling development, staging, and production environments both on AWS cloud and on-premise (open source stack). You will be responsible for CI/CD pipelines, infrastructure automation, monitoring, container orchestration, and model deployment workflows for our enterprise applications and AI platform.
Key Responsibilities- Infrastructure Setup & ManagementDesign and implement cloud-native architectures on AWS and be able to manage on-premise open source environments when required.
- Automate infrastructure provisioning using tools like Terraform or CloudFormation.
- Maintain scalable environments for dev, staging, and production.
- CI/CD & Release ManagementBuild and maintain CI/CD pipelines for backend, frontend, and AI workloads.
- Enable automated testing, security scanning, and artifact deployments.
- Manage configuration and secret management across environments.
- Containerization & OrchestrationManage Docker-based containerization and Kubernetes clusters (EKS, self-managed K8s).
- Implement service mesh, auto-scaling, and rolling updates.
- Monitoring, Security, and ReliabilityImplement observability (logging, metrics, tracing) using open source or cloud tools.
- Ensure security best practices across infrastructure, pipelines, and deployed services.
- Troubleshoot incidents, manage disaster recovery, and support high availability.
- Model DevOps / MLOpsSet up pipelines for AI/ML model deployment and monitoring (LLMOps).
- Support data pipelines, vector databases, and model hosting for AI applications.
Required Skills and Qualifications- Cloud & InfraStrong expertise in AWS services: EC2, ECS/EKS, S3, IAM, RDS, Lambda, API Gateway, etc.
- Ability to set up and manage on-premise or hybrid environments using open source tools.
- DevOps & AutomationHands-on experience with Terraform / CloudFormation.
- Strong skills in CI/CD tools such as GitHub Actions, Jenkins, GitLab CI/CD, or ArgoCD.
- Containerization & OrchestrationExpertise with Docker and Kubernetes (EKS or self-hosted).
- Familiarity with Helm charts, service mesh (Istio/Linkerd).
- Monitoring / Observability ToolsExperience with Prometheus, Grafana, ELK/EFK stack, CloudWatch.
- Knowledge of distributed tracing tools like Jaeger or OpenTelemetry.
- Security & ComplianceUnderstanding of cloud security best practices.
- Familiarity with tools like Vault, AWS Secrets Manager.
- Model DevOps / MLOps Tools (Preferred)Experience with MLflow, Kubeflow, BentoML, Weights & Biases (W&B).
- Exposure to vector databases (pgvector, Pinecone) and AI pipeline automation.
Preferred Qualifications- Knowledge of cost optimization for cloud and hybrid infrastructures.
- Exposure to infrastructure as code (IaC) best practices and GitOps workflows.
- Familiarity with serverless and event-driven architectures.
Education- Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience).
What We Offer- Opportunity to work on modern cloud-native systems and AI-powered platforms.
- Exposure to hybrid environments (AWS and open source on-prem).
- Competitive salary, benefits, and growth-oriented culture.