Platform Engineer – Infrastructure
A Platform Engineer – Infrastructure builds and operates the foundational platform that product engineering teams rely on, including developer tools, automation systems, cloud infrastructure, and reliability frameworks.
The role focuses on developer productivity, reliability, scalability, security, and cost efficiency, treating infrastructure as a software product.
Key Responsibilities
1. Developer Platform & Tooling
- Build internal developer tools, self-service platforms, and automation systems.
- Improve developer productivity through infrastructure abstractions, secrets management, debugging, and access tooling.
2. Reliability & Platform Engineering
- Design highly available, scalable systems across multi-AZ/multi-region environments.
- Drive observability (logs, metrics, tracing), SLOs/SLIs, automated failover, and reliability best practices.
- Build automation for scaling, resilience, and incident reduction.
3. Infrastructure as Code & Kubernetes
- Build reusable Terraform/Pulumi/OpenTofu modules and platform abstractions.
- Manage Kubernetes infrastructure, GitOps workflows (ArgoCD/Flux), and multi-cluster environments.
- Develop operators/controllers and automation for infrastructure lifecycle management.
4. Internal Platform Services
- Build APIs, control planes, and orchestration systems for deployments, resource provisioning, access, and environment management.
5. Security, Governance & Cost Optimization
- Implement IAM automation, secrets management, policy-as-code (OPA/Gatekeeper/Kyverno), and compliance guardrails.
- Optimize cloud costs through autoscaling, infrastructure efficiency, and FinOps initiatives.
Requirement:
Experience
- 3–6 years of experience in Platform Engineering, SRE, DevOps, or Software Engineering
.Technical Skill
- Strong programming skills in Go, Python, or Rust for APIs, tooling, and automation
- Hands-on expertise with AWS/GCP/Azure, networking, IAM, and scalable cloud architectures
- Advanced experience in Terraform, Pulumi, or OpenTofu
- Strong understanding of Kubernetes, GitOps (ArgoCD/Flux), and container orchestration
- Experience with observability tools such as Prometheus, Grafana, OpenTelemetry, Datadog, or Jaeger
- Familiarity with OPA, Gatekeeper, Kyverno, Vault, and infrastructure security best practices
.Good to have
- Experience with KEDA, Karpenter, FinOps, and cloud cost optimization
.