Responsibilities
We are seeking a highly skilled and experienced Senior DevOps Engineer to join our dynamic team. As a Senior DevOps Engineer, you will be responsible for designing, implementing, and maintaining scalable infrastructure, CI/CD pipelines, and automation processes. You will work closely with development teams, system administrators, and other engineers to improve the overall development lifecycle and optimize system reliability, performance, and security.
Key Responsibilities:
Design, implement, and maintain cloud infrastructure (AWS, Azure, GCP) and automation tools to streamline deployment processes.
Collaborate with development teams to build and improve Continuous Integration/Continuous Delivery (CI/CD) pipelines.
Automate system configurations, deployment, monitoring, and scaling using tools such as Ansible, Chef, Puppet, or Terraform.
Maintain and improve monitoring and alerting systems to ensure high availability and quick issue resolution.
Troubleshoot complex issues across infrastructure, networking, and application layers.
Ensure security best practices are followed in both infrastructure and code development, including implementing automated security scans and patches.
Perform system updates, patches, and backups to maintain the integrity and reliability of infrastructure.
Optimize resource utilization and cost efficiency across cloud platforms.
Lead efforts in the design, development, and implementation of disaster recovery strategies.
Mentor junior engineers and help improve team skills and productivity through code reviews, knowledge sharing, and training.
Stay updated with emerging technologies, best practices, and industry trends related to DevOps, cloud infrastructure, and CI/CD.
Work from the client's office in Bangalore 3 days per week.
Participate in on-call support shifts once every three months.
Skills
Overall experience in the role of at least 10 years.
More than 6 years of hands-on experience with the following tools and technologies: Azure Cloud, Ansible, Terraform scripting
At least 4 years of hands-on experience with deploying, managing, and upgrading Kubernetes
Cloud Infrastructure:
In-depth experience with at least one cloud provider (AWS, Azure, GCP).
Familiarity with cloud services like EC2, S3, Lambda, RDS, Kubernetes, VPC, IAM, etc.
Expertise in cloud networking, security, and cost optimization strategies.
CI/CD & Automation:
Strong experience with CI/CD tools such as Jenkins, GitLab CI, CircleCI, or Travis CI.
Ability to implement and maintain end-to-end automation for build, test, and deployment pipelines.
Experience with infrastructure-as-code tools (e.g., Terraform, AWS CloudFormation, or Azure Resource Manager).
Familiarity with containerization (Docker) and container orchestration (Kubernetes, OpenShift).
Version Control and Scripting:
Advanced proficiency in Git and Git workflows (e.g., branching, pull requests, rebasing).
Experience with scripting languages like Python, Bash, Ruby, or PowerShell.
Ability to write modular, reusable scripts to automate tasks.
Monitoring & Logging:
Experience with monitoring tools like Prometheus, Grafana, Nagios, Datadog, or New Relic.
Knowledge of log aggregation tools (e.g., ELK stack, Splunk, Fluentd).
Strong understanding of setting up alerting and proactive issue detection.
Containerization and Orchestration:
Expertise in Docker containerization and managing containerized applications.
Experience with Kubernetes or other container orchestration platforms (e.g., OpenShift, ECS, EKS).
Familiarity with Helm charts for Kubernetes deployments.
Infrastructure Management & Configuration:
Proficient in tools like Ansible, Chef, Puppet, SaltStack, or similar configuration management tools.
Experience with managing servers, clusters, and networking configurations.
Security:
Knowledge of security best practices, vulnerability management, and compliance (e.g., SOC2, HIPAA, PCI-DSS).
Experience with IAM, encryption, key management, and security auditing.
Ability to implement secure deployment practices (e.g., secrets management with Vault or AWS Secrets Manager).
Networking & Performance:
Strong understanding of networking fundamentals (DNS, Load Balancing, HTTP/S, VPNs, proxies).
Experience with performance tuning, network optimization, and troubleshooting.
Database Management & Backup:
Experience with managing and automating database backups and restores.
Familiarity with databases like MySQL, PostgreSQL, MongoDB, and cloud-native databases (e.g., RDS, Cosmos DB).
Collaboration & Communication:
Strong interpersonal and communication skills.
Experience working in Agile/Scrum teams, and ability to collaborate with cross-functional teams.
Ability to document processes, procedures, and architecture designs clearly.
Soft Skills:
Strong problem-solving and troubleshooting abilities.
Self-motivated and able to work independently with minimal supervision.
Excellent time management skills and ability to prioritize tasks effectively.