DevOps Engineer

Peninsular Research Operation • Full-time • Chennai, IN • 2w ago

Senior DevOps Engineer - Azure

Do you thrive in automating and optimizing cloud operations? Are you passionate about Azure and building rock-solid CI/CD pipelines? If so, we want to hear from you!

We're seeking a seasoned DevOps Engineer with at least 6 years of experience to join our growing team. You'll play a key role in designing, implementing, and maintaining our cloud infrastructure on Microsoft Azure.

JOB RESPONSIBILITY:

Run the production environment by monitoring availability and taking a holistic view of system health
Build software and systems to manage platform infrastructure and applications
Improve reliability, quality, and time-to-market of our suite of software solutions
Running our infrastructure with Ansible, Terraform, GitLab CI/CD, and Kubernetes
Measure and optimize system performance, to push our capabilities forward, get ahead of customer needs, and innovate for continual improvement
Partner with development teams to improve services through rigorous testing and release procedures
Participate in system design consulting, platform management, and capacity planning
Create sustainable systems and services through automation and uplift
Balance feature development speed and reliability with well-defined service-level objectives

JOB REQUIREMENTS:

The candidate should have 3 – 5 years of experience as a Site Reliability Engineer
Ability to automate with various scripting languages (Python, Shell scripting, etc…)
Experience managing systems using infrastructure as code tools (IAM, ARM, Terraform, Chef, …)
Solid understanding of Cloud Computing and DevOps concepts, including CI/CD pipelines
Hands-on experience with container orchestration, preferably with Kubernetes
Hands-on experience with one or more observability tools (Prometheus, Grafana, ELK/OpenSearch, Open Telemetry, Datadog, etc.)
Experienced in Instrumentation with systems skills in building and operating, monitoring, logging, and alerting services of distributed systems at scale
Proven experience in maintaining scalability and resilience in complex environments.
Bachelor’s degree (or equivalent) in computer science or related discipline
A proactive approach to identifying problems, performance bottlenecks, and areas for improvement
Strong experience in maintaining AWS cloud infrastructure.
Experience with infrastructure automation and container orchestration tools (Docker, Kubernetes, Terraform, Helm, etc.).
Good understanding of concepts related to computer architecture, data structures, and programming practices.
Knowledge of any one or more of Python, Shell, Go, or Java.
Strong debugging/troubleshooting skills.
Deep working knowledge of Linux servers and networking.
Experience with monitoring/logging solutions like DataDog, ELK, and Prometheus.
Familiarity with continuous integration and deployment tools like Gitlab CI/Argo workflow/Argo CD