DevOps Engineer
Location: Kochi, Kerala, India (Office-based)
Job Summary
We are looking for an experienced and detail-oriented DevOps Engineer who excels in managing Kubernetes and Docker-based environments, with significant experience in cloud platforms like AWS and Azure. This role will be critical in managing and optimizing our infrastructure, automating CI/CD pipelines, and ensuring seamless and efficient deployments. The candidate should be an expert in Linux system administration, proficient in scripting and automation (Python and shell scripting), and have a strong understanding of cloud infrastructure, container management, and modern DevOps practices.
Key Responsibilities
1. Kubernetes & Docker Management
- Design, deploy, and manage Kubernetes clusters for production-grade applications in AWS and Azure environments.
- Manage containerization using Docker, creating scalable, secure, and efficient container environments.
- Monitor and optimize Kubernetes workloads, ensuring high availability, autoscaling, and fault tolerance.
- Work with Helm for Kubernetes configuration management and automate updates to Kubernetes-based systems.
2. CI/CD Pipeline Design and Implementation
- Design and implement fully automated CI/CD pipelines for applications using tools such as Jenkins, GitLab CI, CircleCI, or similar.
- Manage the complete pipeline process, from code commit to production deployment, including automated testing, building, and release processes.
- Implement and optimize CI/CD processes with a focus on continuous integration, automated testing, and continuous delivery to ensure the smooth and efficient deployment of applications.
- Integrate security (DevSecOps) practices into CI/CD pipelines to ensure secure builds, scans, and deployments.
3. Cloud Infrastructure Management
- Provision and manage cloud-based resources (AWS, Azure) including EC2 instances, S3, RDS, VPC, Load Balancers, Azure VM, and Azure Blob Storage.
- Utilize Infrastructure as Code (IaC) tools such as Terraform, AWS CloudFormation, and Azure Resource Manager (ARM) templates to automate infrastructure provisioning and management.
- Implement and maintain cloud-based networking and security solutions, ensuring the scalability and security of cloud environments.
- Set up virtual private cloud (VPC) and subnet configurations, and optimize for cost, performance, and compliance.
4. Linux System Administration
- Manage and administer Linux servers for the production environment, ensuring uptime, performance, and security.
- Perform server hardening, regular updates, patch management, and secure shell (SSH) configurations.
- Troubleshoot and resolve server issues related to performance, disk space, memory usage, and network connectivity.
- Optimize Linux server environments for high performance in cloud-native and container-based setups.
5. Automation and Scripting
- Automate repetitive tasks and operational workflows using Python, Bash, or other scripting languages.
- Develop Python-based automation scripts for various cloud and DevOps operations (e.g., cloud resource management, log analysis, monitoring).
- Implement configuration management tools (Ansible, Puppet) to automate system and environment configurations.
6. Monitoring, Logging, and Alerting
- Set up and configured monitoring solutions (Prometheus, Grafana, CloudWatch, or similar) to detect issues related to infrastructure and applications proactively.
- Implement logging systems such as the ELK Stack (Elasticsearch, Logstash, Kibana) for centralizing logs, monitoring application behavior, and troubleshooting errors.
- Create dashboards and alerts to monitor system health, CPU/memory usage, application performance, and container health.
7. Security and Compliance
- Ensure the infrastructure meets security compliance standards, implementing best practices such as role-based access control (RBAC) for Kubernetes, identity and access management (IAM) policies for cloud environments, and secure container management.
- Implement security practices within the CI/CD pipelines, ensuring security scans (SAST, DAST) are integrated into the development lifecycle.
- Manage backup strategies, disaster recovery plans, and security incident response protocols.
Qualifications
Experience:2+ years in DevOps with container management and cloud platform expertise
Education: Graduate in computer science or related fields or have equivalent experience.
Skills
- Containerization (Kubernetes, Docker) -Expert
- CI/CD Pipeline Management -Expert
- Cloud Platforms (AWS, Azure) -Expert
- Infrastructure as Code (IaC) -Expert
- Linux System Administration -Expert
- Automation & Scripting (Python, Bash) -Expert
- Monitoring and Logging (Prometheus, ELK Stack) -Expert
- Security Practices (RBAC, IAM) -Expert
- Configuration Management (Ansible, Puppet) -Advanced
- Container Security (Clair, Aqua Security) -Advanced
- Service Meshes (Istio, Linkerd) -Intermediate
- Cloud Cost Optimization -Intermediate
PriceSenz is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, age, sex, sexual orientation, gender identity, national origin, or disability.