DevOps Engineer

Scale.jobs • Full-time • Remote (New York, NY, US) • 3d ago

About The Role

The role owns the reliability, scalability, and automation of core cloud infrastructure, ensuring that high-throughput production systems remain highly available and performant around the clock.

This position collaborates closely with software engineering teams to design self-healing infrastructure, streamline continuous delivery pipelines, and build robust observability platforms that enable rapid, safe deployments.

Key Responsibilities

Design, provision, and maintain multi-region cloud infrastructure on AWS using Terraform for Infrastructure as Code (IaC)
Manage and optimize Kubernetes clusters (EKS) to ensure high availability, efficient resource utilization, and secure container orchestration
Build and maintain robust CI/CD pipelines using GitHub Actions or GitLab CI to automate application testing, packaging, and deployment
Implement comprehensive observability across the stack using Prometheus, Grafana, and ELK/Datadog for proactive monitoring and alerting
Participate in a blameless on-call rotation, conducting root-cause analysis (RCA) and implementing preventative measures for production incidents
Partner with security teams to enforce IAM policies, network security groups, vulnerability scanning, and compliance standards

What We Are Looking For

3-6 years of experience in a DevOps, SRE, or Systems Engineering role supporting high-traffic production environments
Deep expertise with AWS and containerization technologies, specifically Docker and production-grade Kubernetes
Strong proficiency in Infrastructure as Code, specifically writing clean, modular Terraform configurations
Hands-on scripting experience in Python, Go, or Bash for automation and tooling development
Solid understanding of networking concepts (VPC, DNS, load balancing, CDN) and Unix/Linux system administration
Bonus: Experience with service meshes (Istio, Linkerd), GitOps workflows (ArgoCD, Flux), or database administration at scale