Short Description
Platform Engineer with strong expertise in AWS, Kubernetes, Cloud-Native Microservices, CI/CD, Infrastructure Automation, and Observability to support and scale enterprise ecommerce platforms.
Description
What’s this role about?
We are looking for a highly skilled Platform Engineer to join our Platform Engineering team supporting large-scale ecommerce and enterprise microservices platforms running on Kubernetes and AWS cloud infrastructure.
The role focuses on building, automating, securing, and managing cloud-native platform services while ensuring high availability, scalability, observability, and operational excellence across production and non-production environments. The engineer will work closely with Development, QA, Security, and Infrastructure teams to enable faster software delivery, improve developer experience, and maintain reliable platform operations.
This role requires hands-on experience with Kubernetes, AWS services, CI/CD automation, Infrastructure as Code, observability tooling, security integrations, and production support for mission-critical ecommerce applications.
Here’s how you’ll contribute:
- Build and maintain scalable Kubernetes-based cloud platforms for enterprise ecommerce applications
- Design and manage CI/CD pipelines for microservices deployments
- Automate infrastructure provisioning and platform operations using Terraform and scripting
- Implement monitoring, logging, tracing, and alerting solutions for cloud-native applications
- Support production environments with strong troubleshooting and incident management practices
- Collaborate with development and QA teams to improve deployment reliability and developer productivity
- Implement security best practices, authentication, and authorization integrations across platform services
- Support platform modernization, cloud migration, and disaster recovery initiatives
- Enable operational excellence through automation, observability, and proactive monitoring
You’ll do this by:
- Managing Kubernetes clusters and cloud infrastructure on AWS
- Developing Helm charts, Jenkins pipelines, and shared CI/CD libraries
- Automating infrastructure deployments using Terraform and scripting tools
- Configuring observability solutions using Splunk, OpenTelemetry, Prometheus, Grafana, Sumo Logic, etc.
- Supporting microservices deployments and release management activities
- Implementing ingress, networking, CDN, WAF, and API security solutions
- Troubleshooting production issues across application, infrastructure, and networking layers
- Managing logging, tracing, metrics, and synthetic monitoring solutions
- Collaborating with cross-functional teams during feature rollouts and incident resolution
- Improving system reliability, scalability, and operational efficiency through automation and platform engineering best practices
Core Skills:
- Kubernetes Administration and Cloud-Native Architecture
- AWS Cloud Services (EC2, EKS, VPC, IAM, Route53, S3, Load Balancers, CloudWatch, etc.)
- CI/CD Pipeline Development using Jenkins, Groovy, Helm
- Infrastructure Automation using Terraform
- Docker and Containerization Technologies
- Monitoring and Observability (Splunk, OpenTelemetry, Prometheus, Grafana, Sumo Logic)
- Linux System Administration and Troubleshooting
- Security integrations using OKTA, Keycloak, OAuth/OIDC
- Programming/Scripting knowledge in Python or Java
- Production Support, Incident Management, and Root Cause Analysis
- Strong troubleshooting skills across distributed microservices environments
- Excellent communication and collaboration skills
Desired Skills:
- Service Mesh implementations using Istio or Linkerd
- Experience with ecommerce and high-traffic customer-facing applications
- Knowledge of Akamai, Cloudflare, WAF, CDN, and bot protection solutions
- Experience with synthetic monitoring and RUM solutions
- Understanding of distributed tracing and APM tools
- Exposure to GitOps workflows and ArgoCD
- Experience with OpenSearch, Elasticsearch, Redis, Kafka, or RabbitMQ
- Familiarity with disaster recovery and high-availability architecture patterns
- Understanding of REST APIs and microservices communication patterns
- Experience with performance testing and platform scalability optimization
- Knowledge of AI-driven observability or operational automation is a plus