Key Responsibilities
Linux & Server Administration
- Install, configure, and maintain Linux servers (Ubuntu, CentOS, RHEL).
- Perform OS hardening, patching, upgrades, and performance tuning.
- Troubleshoot system-level issues related to CPU, memory, disk, and processes.
DevOps & Automation
- Support and manage CI/CD pipelines (Jenkins, GitHub Actions, GitLab CI, etc.).
- Automate infrastructure and operational tasks using Shell, Bash, or Python.
- Assist in Infrastructure as Code (IaC) using Terraform / Ansible.
On-Ground Infrastructure & GPU Support
- Handle on-site server deployments, hardware installations, and upgrades.
- Perform GPU setup, driver installation, CUDA configuration, and troubleshooting.
- Coordinate rack setup, power, cooling, and cabling at client premises.
Containers & Virtualization
- Deploy and manage Docker containers.
- Support Kubernetes clusters (basic to intermediate level).
- Manage virtual machines using VMware / KVM.
Monitoring, Reliability & Security
- Set up and maintain monitoring and alerting (Prometheus, Grafana, ELK, or similar).
- Implement backup, disaster recovery, and log management practices.
- Manage access control, SSH security, firewall rules, and VPN connectivity.
Incident Management & Collaboration
- Act as first-line responder for Gujarat-based infrastructure incidents.
- Work closely with the central DevOps/Infra team for escalations and best practices.
- Document deployments, SOPs, runbooks, and infrastructure changes.
Required Skills & Qualifications
Core Technical Skills
- Strong hands-on experience in Linux system administration.
- Practical experience in DevOps tools and CI/CD pipelines.
- Experience with Docker and exposure to Kubernetes.
- Solid understanding of networking fundamentals (TCP/IP, DNS, VPN, firewalls).
- Hands-on experience with GPU-based systems is highly preferred.
Cloud & Platform Knowledge
- Working knowledge of AWS / Azure / GCP (compute, storage, networking).
- Experience managing hybrid (on-prem + cloud) environments is a plus.
Soft Skills
- Willingness to travel across Gujarat for client-site support.
- Ability to work independently and take ownership of on-ground operations.
- Strong troubleshooting, communication, and coordination skills.