About the job
Job Description:
We are seeking a Senior DevOps Engineer to join our team and lead the development and
maintenance of a highly reliable, scalable, secure, and cost-efficient cloud infrastructure. The
ideal candidate will have deep experience managing AWS environments, responding to critical
alerts, and maintaining zero-downtime systems. You will play a key role in ensuring compliance
with industry standards and optimizing infrastructure for cost-effectiveness while supporting
multi-client operations in a B2B setup.
Key Responsibilities:● Design, implement, and manage scalable AWS infrastructure across multiple regions,
ensuring compliance with GDPR, HIPAA, SOC 2, and ISO standards.
● Lead the efforts to monitor and respond to alerts to ensure system reliability,
availability, and zero-downtime.
● Perform and manage VPC Peering, API Gateway, and Kafka configurations across
multiple regions.
● Automate deployment, monitoring, and scaling of infrastructure using best practices.
● Manage and optimize CI/CD pipelines using GitHub and related automation tools.
● Troubleshoot and resolve complex infrastructure, networking, and application
performance issues.
● Implement and optimize system monitoring and alerting tools such as Prometheus,
Grafana, and CloudWatch.
● Ensure high availability and disaster recovery strategies are in place to minimize
downtime.
● Apply security best practices to ensure a secure infrastructure.
● Optimize cloud infrastructure costs without compromising on performance and
availability.
● Oversee multi-client infrastructure management in a B2B environment, including
both cloud and on-premise setups.
● Work closely with development teams to support seamless code deployment and
integration.
● Provide mentorship to junior engineers and foster best practices across the team.
● Flexibility to travel as needed for on-premise infrastructure setups or client needs.
Required Skills & Qualifications:● 5+ years of experience in a DevOps or SRE role, with a focus on cloud infrastructuremanagement.● Extensive experience with AWS (EC2, S3, VPC, Elastic Beanstalk, RDS, API Gateway,
Lambda, etc.).
● Strong experience with VPC Peering and Kafka in multi-region environments.
● Expertise in GitHub, CI/CD pipeline setup, and automation.
● Proficiency with Infrastructure as Code (IaC) tools (e.g., Terraform, CloudFormation).
● Proven ability to manage multi-client infrastructure and on-premise setups.
● Deep knowledge of containerization technologies like Docker and Kubernetes.
● Strong networking concepts, security best practices, and performance tuning skills.
● Expertise in managing cloud infrastructure in compliance with GDPR, HIPAA, SOC 2,
and ISO.
● Strong scripting skills (e.g., Bash, Python) for automation.
● Experience in cost optimization of cloud infrastructure.
● Flexibility to travel for client requirements and on-premise setup management.
Nice to Have:● Experience with Azure and Google Cloud.
● Experience with monitoring and alerting tools like Prometheus, Grafana, or
CloudWatch.
● Knowledge of microservices architecture and API management.
● Database management experience (SQL/NoSQL) within AWS.
● Experience with incident response and postmortem analysis.