Senior DevOps Engineer (Platform / SRE / AWS)
(Platform / DevOps / SRE / AWS / Infrastructure)
Location: Pune, India (On-site)
Type: Full-Time
Experience: 5 - 9 years
About Dispatch NetworkDispatch Network is building the most efficient last-mile network in India from the ground up using technology and AI-powered optimization to drive efficiency and earnings for delivery partners. We operate across food delivery, quick commerce, grocery, ecommerce, and pharma — serving both B2B clients and B2C consumers.
Our systems power real-time movement of thousands of riders across cities — connecting orders, people, and places with near-zero latency.
As we move from pilots to national scale, platform reliability, automation, and observability become first-class problems. This role exists to own that layer.
Role Overview
This is a pure Senior DevOps / SRE role focused on platform reliability and infrastructure engineering.
You will:
- Build and operate the systems that keep Dispatch running reliably at scale
- Own infrastructure-as-code, deployment pipelines, and service observability
- Drive automation, resilience, and operational excellence across environments
Split:
- 90% Platform / DevOps / SRE
- 10% Infra tooling / automation scripting (Bash where needed)
What You’ll Do
Platform, DevOps & SRE (Primary Focus)
Infrastructure & Automation
- Design and manage AWS infrastructure using Terraform (ECS, RDS, Redis, Kafka, networking, IAM).
- Own service deployment patterns on ECS / Fargate.
- Build safe, repeatable environments (dev, staging, prod).
- Manage VPC architecture, service discovery, secrets, and access controls.
Reliability & Operations
- Define and implement SLIs, SLOs, and error budgets.
- Build alerting and incident response playbooks.
- Improve system resilience against:
- Service crashes
- Network failures
- Dependency latency
- Traffic spikes
- Lead incident response and postmortems.
- Reduce MTTR through automation and tooling.
Observability
- Implement structured logging, metrics, and distributed tracing.
- Instrument services and infrastructure for performance and reliability visibility.
- Own dashboards and alerts for critical systems.
CI/CD & Release Engineering
- Build and maintain CI/CD pipelines.
- Improve deployment safety (rollbacks, canaries, blue-green where needed).
- Standardize build and release workflows.
- Enable high deployment velocity with operational safety.
Platform Tooling & Automation
- Build internal tools for infra lifecycle management, cost monitoring, and scaling.
- Automate provisioning, scaling, and recovery workflows.
- Write Go / scripting utilities where infra meets runtime systems.
Infrastructure Collaboration- Work with backend and platform teams to:
- Ensure services are production-ready and observable.
- Improve deployment patterns and runtime configurations.
- Reduce operational risk in service design.
- Provide reliability and scalability input during architecture reviews.
What We’re Looking For
Core Requirements- 5 - 9 years of experience in DevOps, SRE, or platform engineering roles.
- Strong hands-on experience with AWS (ECS/Fargate, RDS, networking, IAM).
- Solid experience with Terraform in production systems.
- Strong understanding of Linux, containers, and networking basics.
- Proficiency in Go for automation, tooling, or infra services.
- Experience running Redis, Kafka, Postgres in real systems.
- Strong debugging and incident-handling mindset.
SRE Mindset- Comfort owning systems end-to-end.
- Ability to reason about failure modes, not just happy paths.
- Bias toward automation over manual operations.
Nice to Have- Experience defining SLOs and alerting strategies.
- Experience with high-throughput or real-time systems.
- Exposure to geo-distributed systems or event-driven architectures.
- Experience building internal developer platforms or golden paths.
Why This Role Exists- Dispatch is past the “just write services” phase.
- Reliability, scale, and operational excellence are now critical.
- This role ensures platform stability and backend velocity as the network scales nationally.
Growth Path- Platform Lead / Senior SRE
- Staff-level ownership of reliability and infra architecture
- Transition into Infrastructure Architect or SRE Lead roles