Location: 100% Remote (Only for Residents from USA, Canada, LATAM)
Experience: 5+ years in DevOps, Site Reliability Engineering, or similar roles.
Skills:
- Proficiency with cloud platforms, particularly AWS and GCP.
- English Level Advanced (C1/ C2)
- Experience with containerization technologies (e.g., Docker) and orchestration tools (e.g., Kubernetes).
- Expertise in CI/CD tools and practices, especially GitHub Actions.
- Familiarity with monitoring and logging tools such as Datadog and Sentry.
- Strong knowledge of infrastructure-as-code principles and tools (e.g., Terraform, CloudFormation).
- Experience with Ruby on Rails applications in production environments.
- Solid understanding of network security principles and best practices.
- Excellent problem-solving skills and ability to troubleshoot complex systems.
- Strong communication skills and ability to collaborate effectively with cross-functional teams.
- Previous experience working at a startup (Seed, Series A or Series B) where you contributed to scaling the company. Current big company experience is acceptable if you previously worked at startup companies.
.
Bonus Points For:
- Experience in HIPAA-compliant environments.
- Familiarity with React/Redux/ImmutableJS ecosystems.
- Knowledge of data pipeline tools like Segment.
- Experience with Aptible or similar HIPAA-compliant hosting platforms.
- Relevant certifications (e.g., AWS Certified DevOps Engineer, Google Cloud Professional DevOps Engineer).
Responsibilities:
- Foster a reliability-focused culture, emphasizing monitoring, alerting, and scaling practices within the engineering team.
- Participate in architecture discussions from a site reliability engineering perspective.
- Manage and optimize CI/CD pipelines and deployment processes.
- Enhance developer experience through tooling and process improvements.
- Design, implement, and maintain scalable and secure infrastructure using cloud platforms (AWS, GCP).
- Collaborate with software engineers to ensure efficient system operations and resolve production issues.
- Implement and maintain monitoring and alerting systems using tools like Datadog and Sentry.
- Continuously improve system reliability, performance, and security.
- Ensure compliance with HIPAA regulations and best practices.
If you have the skills and experience we're looking for and want to be part of a dynamic team, we would love to hear from you! Apply now and help us build and maintain robust, scalable systems.