NOTE: Candidates requiring sponsorship now or in the future (including CPT/OPT) cannot be considered for this job
Candidates will be required to work on site 4 days per week in Salt Lake City
AWS Site Reliability Engineer
Our client is seeking a skilled and forward-thinking Site Reliability Engineer (SRE) to join their growing infrastructure team. You will be instrumental in designing, building, and maintaining scalable, high-availability systems that support mission-critical services. This role requires deep technical knowledge across modern DevOps tools and practices, with a strong focus on automation, observability, and platform resilience.
Key Responsibilities
- Architect, deploy, and manage cloud-native infrastructure using AWS, Docker, Kubernetes, and Terraform.
- Design and implement CI/CD pipelines using GitHub Actions to ensure smooth, reliable delivery of software.
- Collaborate with developers to improve application performance, availability, and scalability through observability and incident response best practices.
- Integrate cloud-hosted services with traditional on-premise enterprise solutions (e.g., Microsoft, Oracle, SAP, IBM).
- Implement, monitor, and enforce compliance standards related to security, privacy, and regulatory frameworks (e.g., HIPAA, PCI, SOX).
- Work with event streaming tools like Kafka and serverless platforms such as AWS Lambda to drive modern architecture initiatives.
- Maintain clean, structured infrastructure-as-code using JSON and YAML formats.
- Support Agile and iterative development workflows, participating in sprints, planning, and retrospectives.
Qualifications
- Proven experience deploying and maintaining scalable systems in cloud environments, particularly AWS.
- Hands-on expertise with containers and orchestration tools (Docker, Kubernetes).
- Strong experience with infrastructure automation using Terraform.
- Proficient in configuring CI/CD workflows and automation using GitHub Actions.
- Familiar with event-driven architecture and streaming platforms like Kafka.
- Experience setting up and utilizing monitoring tools like Dynatrace
- Deep understanding of JSON, YAML, and scripting best practices.
- Background in systems integration across hybrid environments (cloud/on-premise).
- Demonstrated understanding of compliance requirements and governance frameworks.
- Familiarity with Agile software development methodologies and DevOps culture.
Preferred
- Certifications in AWS, and/or Kubernetes.
- Strong communication skills and a collaborative mindset.