G
GeekyAnts India Pvt Ltd
Services
251 - 500
Employees
4.5
Reviews
Bengaluru, Karnataka
Location
About Company
GeekyAnts is a design and development studio that specializes in building solutions for web and mobile that drive innovation and transform industries and lives. They hold expertise in state-of-the-art technologies like React, React Native, Flutter, Angular, Vue, NodeJS, Python, Svelte and more.
GeekyAnts has worked with around 500+ clients all across the globe, delivering tailored solutions to a wide array of industries like Healthcare, Finance, Education, Banking, Gaming, Manufacturing, Real Estate and more. They are trusted tech partners of some of the world's top corporate giants and have helped small to mid-sized companies realize their vision and transform digitally. They are also the registered service suppliers for Google LLC since 2017.
They provide services ranging from Web & Mobile Development, UI/UX design, Business Analysis, Product Management, DevOps, QA, API Development, Delivery & Support and more.
In addition to that, GeekyAnts is the brains behind React Native's most famous UI library; NativeBase (15000+ GitHub Stars), BuilderX, Vue Native, Flutter Starter, apibeats and hold numerous other Open Source contributions to their name. GeekyAnts has offices in India (Bangalore) and the UK (London)
DevOps / SRE Engineer (Mumbai)
Posted 21 hours ago
Not Disclosed
Salary
5 years
Experience
Bengaluru, Karnataka
Location
Job Description
We are seeking an experienced DevOps / Site Reliability Engineer (L5) to own and scale the production operations of a large-scale, AI-first platform. In this role, you will be responsible for reliability, performance, observability, and cost efficiency across cloud-native workloads running on GCP and Kubernetes. You will work closely with platform, data, and AI teams to ensure resilient, secure, and highly available systems in production.
Key Responsibilities
- Own day-2 production operations for a large-scale AI-driven platform running on Google Cloud Platform (GCP).
- Run, scale, and harden GKE-based Kubernetes workloads integrated with GCP managed services (data, messaging, AI, networking, and security).
- Define, implement, and operate SLIs, SLOs, and error budgets across platform and AI services.
- Build and manage end-to-end observability using New Relic (APM, infrastructure monitoring, logging, alerts, and dashboards).
- Design, improve, and maintain CI/CD pipelines and Terraform-driven infrastructure automation.
- Operate and integrate Azure AI Foundry for LLM deployments and model lifecycle management.
- Lead incident response, conduct postmortems, and drive long-term reliability and resilience improvements.
- Optimize cost, performance, and autoscaling for AI- and data-intensive workloads.
- Collaborate with engineering and leadership teams to drive best practices in reliability, security, and operations.
Key Skills
- 6+ years of hands-on experience in DevOps, SRE, or Platform Engineering roles.
- Strong, production-grade expertise in Google Cloud Platform (GCP), especially GKE and core managed services.
- Proven experience running Kubernetes at scale in live, mission-critical environments.
- Deep hands-on expertise with New Relic in complex, distributed systems.
- Solid experience operating AI/ML or LLM-powered platforms in production.
- Strong background in Terraform, infrastructure as code, and CI/CD pipelines.
- Good understanding of cloud networking, security, and reliability engineering principles.
- Ability to own and operate production systems end-to-end with minimal supervision.
Good-to-Have Skills
- Experience with multi-cloud environments (GCP + Azure).
- Familiarity with FinOps practices for cloud cost optimization.
- Exposure to service mesh, advanced autoscaling strategies, and capacity planning.
- Experience with data-intensive or real-time systems.
- Knowledge of security best practices, compliance, and IAM in cloud environments.
- Prior experience mentoring junior engineers or leading operational initiatives.
Educational Qualifications
Education & Qualifications
- Bachelor’s degree in Computer Science, Information Technology, Engineering, or a related field.
- Master’s degree in a relevant discipline is a plus, but not mandatory.
Rounds description
[One-to-One In-person Interview] You will be talking directly with the Head of Department of GeekyAnts for Technical Assessments & Review.