We are looking for a DevOps Engineer to support application development, infrastructure, and security from the start by automating workflows to keep the DevOps workflow from slowing down. The ideal individual should have the mindset of DevOps with built-in security, not security that functions as a perimeter around apps and data. The candidate will work closely with development teams throughout the application lifecycle to achieve application availability, scalability, and operational effectiveness in the most secure way aligned with automation.
Responsibilities
- Bridging the gaps b/w core infra, security, and development team.
- Owning the end-to-end Availability, Performance, and Capacity of applications and their infrastructure and creating/maintaining the respective observability with DataDog/New Relic/ECS.
- Providing 24X7 infra, and app support, building processes, and documenting tribal knowledge around the same time.
- Managing application deployment, and AWS ECS platforms - automate and improve development and release processes.
- Creating, managing, and maintaining data stores, and data platform infra using IaC.
- Owning and onboarding new applications with the production readiness review process.
- Managing the SLO/Error Budgets/Alerts and performing root cause analysis for production errors.
- Working with the Dev team to have an in-depth understanding of the application architecture and its bottlenecks.
- Identifying observability gaps in application, and infrastructure and working with stakeholders to fix them.
- Managing outages by doing detailed RCA with developers and identifying ways to avoid that situation.
- Automate toil and repetitive work.
Requirements
- 4 to 6 Years of experience in managing large-scale microservices and infrastructure with excellent troubleshooting skills.
- Experience in troubleshooting, managing, and deploying containerized environments using Docker/containers, ECS is a must.
- Must be very hands-on in managing and troubleshooting the AWS environment.
- Extensive experience with Linux administration and a good understanding of the various Linux kernel subsystems (memory, storage, network, etc).
- Good experience in DNS, TCP/IP, UDP, GRPC, Routing and Load Balancing.
- Expertise in GitOps, Infrastructure as a Code tool such as Terraform, etc., and Configuration Management Tools such as Chef, Puppet, Saltstack, and Ansible.
- Experience working with Cloud Infrastructure solutions like AWS.
- Experience in building the CI/CD pipelines.
- Experience with multiple data stores is a plus (Redis, Elasticsearch).
- Must be good in any of the DevOps scripting languages - python, Ruby, or Go.
This job was posted by Sumana B from Stable Money.