Position: DevOps / SRE Engineer
Client: Acuver (Serving Notice Period candidates preferred)
Location: Bangalore to Bangalore
Experience: 5–8 Years
Employment Type: Full Time
Work Location: Bangalore to Bangalore preferred
Budget Details (Based on Experience)
5 Years: Up to 18 LPA
6 Years: Up to 20 LPA
7 Years: Up to 24 LPA
8 Years: Up to 26 LPA
Serving Notice Period (NP) candidates preferred.
Role Overview
We are looking for a mid-senior DevOps / SRE Engineer to build and scale a cloud-native, event-driven platform supporting high-throughput logistics and fulfillment systems.
The role involves building infrastructure, CI/CD pipelines, observability, and ensuring system reliability and scalability.
Required Skills
5–8 years in DevOps / SRE
Strong AWS (Preferred) or GCP
Hands-on experience in:
Kubernetes
Docker
Terraform
CI/CD tools (GitLab, Jenkins)
Experience with:
Kafka / Event-driven systems
Prometheus / Grafana / ELK
Microservices architecture
Strong knowledge of:
Networking
Load balancing
High availability systems
Good to Have
Service Mesh (Istio / Linkerd)
PostgreSQL / AWS RDS
Snowflake exposure
Logistics / Supply Chain domain
DevSecOps practices
Key Responsibilities
CI/CD & Release Engineering
Design and implement CI/CD pipelines (GitLab CI, Jenkins)
Automate build, test, and deployment workflows
Implement blue-green / canary deployments
Ensure rollback mechanisms and deployment governance
Cloud Infrastructure & IaC
Manage infrastructure on AWS (Primary) / GCP
Use Terraform for Infrastructure as Code
Build reusable modules
Optimize cost, performance, and scalability
Containerization & Orchestration
Deploy applications using Docker & Kubernetes
Manage workloads using Helm
Implement auto-scaling and high availability
Reliability Engineering (SRE)
Define SLIs, SLOs, SLAs
Drive uptime and performance improvements
Handle incident response and RCA
Build fault-tolerant systems
Observability & Monitoring
Metrics: Prometheus
Logs: ELK / Kibana
Tracing: OpenTelemetry / Jaeger
Build actionable alerting systems
Event-Driven Systems
Work with Kafka, Pub/Sub, SQS/SNS
Support distributed microservices architecture
Improve throughput and resilience
Database Support
Work with PostgreSQL (AWS RDS)
Performance tuning
Backup and recovery
HA and failover setup