Senior DevOps Engineer - Azure
Do you thrive in automating and optimizing cloud operations? Are you passionate about Azure and building rock-solid CI/CD pipelines? If so, we want to hear from you!
We're seeking a seasoned DevOps Engineer with at least 6 years of experience to join our growing team. You'll play a key role in designing, implementing, and maintaining our cloud infrastructure on Microsoft Azure.
JOB RESPONSIBILITY:
- Run the production environment by monitoring availability and taking a holistic view of system health
- Build software and systems to manage platform infrastructure and applications
- Improve reliability, quality, and time-to-market of our suite of software solutions
- Running our infrastructure with Ansible, Terraform, GitLab CI/CD, and Kubernetes
- Measure and optimize system performance, to push our capabilities forward, get ahead of customer needs, and innovate for continual improvement
- Partner with development teams to improve services through rigorous testing and release procedures
- Participate in system design consulting, platform management, and capacity planning
- Create sustainable systems and services through automation and uplift
- Balance feature development speed and reliability with well-defined service-level objectives
JOB REQUIREMENTS:
- The candidate should have 3 – 5 years of experience as a Site Reliability Engineer
- Ability to automate with various scripting languages (Python, Shell scripting, etc…)
- Experience managing systems using infrastructure as code tools (IAM, ARM, Terraform, Chef, …)
- Solid understanding of Cloud Computing and DevOps concepts, including CI/CD pipelines
- Hands-on experience with container orchestration, preferably with Kubernetes
- Hands-on experience with one or more observability tools (Prometheus, Grafana, ELK/OpenSearch, Open Telemetry, Datadog, etc.)
- Experienced in Instrumentation with systems skills in building and operating, monitoring, logging, and alerting services of distributed systems at scale
- Proven experience in maintaining scalability and resilience in complex environments.
- Bachelor’s degree (or equivalent) in computer science or related discipline
- A proactive approach to identifying problems, performance bottlenecks, and areas for improvement
- Strong experience in maintaining AWS cloud infrastructure.
- Experience with infrastructure automation and container orchestration tools (Docker, Kubernetes, Terraform, Helm, etc.).
- Good understanding of concepts related to computer architecture, data structures, and programming practices.
- Knowledge of any one or more of Python, Shell, Go, or Java.
- Strong debugging/troubleshooting skills.
- Deep working knowledge of Linux servers and networking.
- Experience with monitoring/logging solutions like DataDog, ELK, and Prometheus.
- Familiarity with continuous integration and deployment tools like Gitlab CI/Argo workflow/Argo CD