Voters AI (https://votersai.com/), a silicon valley, USA based start-up is building a patent-pending, multi-tenant intelligence platform on AWS. We are hiring a DevOps / platform engineer to own our deployment pipeline, cloud infrastructure, observability, and reliability.
What you will do
- Build and operate CI/CD that deploys server-less services to AWS through GitHub Actions and OIDC (no stored credentials), with safe one-at-a-time deploys, verification, and fast rollback.
- Operate and harden a production AWS estate: Lambda, API Gateway, SQS with dead-letter queues, DynamoDB, RDS PostgreSQL, VPC networking, IAM, S3, CloudFront, Route 53, Cognito, Secrets Manager, SES.
- Build out observability — CloudWatch metrics, alarms, dashboards, structured logging, and alerting — and an operations and monitoring surface for the team.
- Engineer for resilience and scale: load and chaos testing, Step Functions self-healing and auto-remediation, capacity and concurrency tuning, and queue-based admission control for high-burst workloads.
- Own least-privilege IAM, secret rotation, and infrastructure security.
- Carry on-call; drive incident response and mean-time-to-resolve.
Required experience
- Minimum 5 years of professional experience is required, in DevOps, SRE, or platform engineering, with deep hands-on AWS: Lambda, API Gateway, SQS, DynamoDB, IAM, VPC, CloudWatch, EventBridge, Step Functions, S3, CloudFront, Cognito, Secrets Manager.
- Infrastructure-as-code: Terraform, CloudFormation, SAM, or CDK.
- GitHub Actions CI/CD, including OIDC federation to cloud roles.
- Strong in BOTH NoSQL and relational databases: DynamoDB (single-table design, capacity and throughput modeling) AND PostgreSQL (RDS operations, tuning, backups).
- REST API operations behind API Gateway: authorizers, throttling, CORS, staged
deployments.
- Observability and incident response: metrics, alarms, dashboards, alerting, SLOs, and MTTR discipline.
- Confident Python and Bash scripting; strong Linux
Preferred
- Capacity planning and queuing theory; chaos and load testing. High-volume messaging: SMS 10DLC and email deliverability.
- Multi-tenant SaaS, and operating under regulated-communications compliance such as TCPA.
- ClickHouse or another columnar / OLAP database in production.
Other Skills:
- Fluent in English
- Extremely focused work habits.
- You will be working on patent pending technology, strong Math background preferred.
- You will have the lifetime opportunity to learn cutting edge technology from Silicon Valley experts.
Location:
You should be able to commute to Aundh Area of Pune.