About The Role
As a Senior DevOps Engineer, you will leverage your extensive experience in software and systems engineering to lead the design and implementation of large-scale, fault-tolerant systems. You will play a crucial role in ensuring the reliability, availability, and scalability of both internal and external services, driving continuous improvement through automation and optimized infrastructure development.
What You Will Do
- Designing and developing advanced automation solutions to eliminate manual processes and enhance system reliability
- Managing complex distributed systems that dynamically adapt to various deployment models and evolving customer demands
- Collaborating closely with cross-functional teams to integrate new technologies and improve system architecture
Leading efforts in system debugging, optimization, and proactive monitoring to maintain high performance and availability
What We Need
- Bachelor's degree in Computer Science, a related technical field, or equivalent practical experience
- Minimum of 5 years of experience in DevOps or a related field
- Extensive experience programming in multiple languages such as Python, Java, Go, etc
- Strong knowledge of deploying and debugging applications on Kubernetes
- Deep knowledge of Unix/Linux internals, networking concepts (routing, DNS, SDN), and cloud platforms (AWS, GCP, Azure)
- Proven track record in designing and deploying distributed systems at scale
- Strong expertise in infrastructure-as-code tools (e.g., Terraform, Ansible) and container orchestration (e.g., Kubernetes)
Ability to diagnose complex system issues and implement effective solutions
Preferred Qualifications
- Experience leading projects in system or network automation
- Proficiency in CI/CD pipelines (e.g., GitHub Actions, FluxCD) and observability tools (e.g., Prometheus, Loki)
- Demonstrated ability to mentor and coach junior engineers
- Innovative mindset with a proactive approach to shaping future technical strategies
Technologies You'll Work With
- Kubernetes: Custom Controllers, Advanced Deployment Strategies
- Cloud Platforms: AWS, GCP, Azure
- Infrastructure-as-Code: Terraform, Ansible
- Observability: Prometheus, Grafana, ELK Stack
- CI/CD: GitHub Actions, Jenkins
- Scripting: Python, Go, Bash