Job Description
Job Description Zycus is looking for an AI-focused Site Reliability Engineer (SRE) Intern who is excited about building, operating, and scaling reliable AI-driven systems. This role is ideal for candidates interested in the intersection of SRE, cloud infrastructure, and GenAI-driven platforms, with hands-on exposure to deploying and monitoring intelligent, Java-based enterprise applications powered by cloud-based LLMs. Internship Details: Duration: 6 months Stipend: INR 25,000 per month Full-Time Opportunity: High-performing interns will be considered for a full-time SRE role at Zycus Roles and Responsibilities: AI System Reliability: Ensure high availability, scalability, and performance of AI-driven, Java-based applications powered by LLM integrations. GenAI Platform Support: Assist in managing and optimizing applications leveraging cloud-based LLMs and AI services. Kubernetes & Microservices: Support containerized applications using Kubernetes, ensuring reliability of microservices architecture. Monitoring & Observability: Track system health, latency, and performance using tools like Prometheus, Grafana, or similar. Automation & AI-driven Ops: Leverage AI tools to automate repetitive SRE tasks, incident resolution, and operational workflows. Incident Management: Troubleshoot production issues, identify root causes, and implement preventive measures. Performance Optimization: Improve system efficiency, resource utilization, and application responsiveness. Cloud Infrastructure Support: Work with cloud environments (AWS/GCP/Azure) to maintain reliable and scalable systems. Collaboration: Partner with engineering teams to improve system resilience and deployment processes. Documentation & Continuous Learning: Maintain system documentation and explore emerging trends in SRE + GenAI operations. Eligibility & Skills: Experience: Final-year students or recent graduates (0–1 year experience) Must Have Skills: Programming & Scripting: Basic proficiency in Java (preferred) and/or Python, along with Shell scripting AI / GenAI Awareness: Basic understanding of Generative AI concepts and familiarity with LLM-based applications Containers & Kubernetes (Basics): Understanding of Docker and Kubernetes fundamentals Operating Systems: Basic knowledge of Linux/Unix systems Cloud Fundamentals: Awareness of AWS/GCP/Azure environments Good to Have Skills: Exposure to AI-assisted development tools (e.g., GitHub Copilot, ChatGPT) for automation and troubleshooting Basic understanding of microservices architecture Familiarity with CI/CD pipelines Knowledge of Infrastructure-as-Code (Terraform, Ansible) Experience with monitoring/logging tools Version control (Git) What You’ll Gain: Hands-on experience with GenAI-powered enterprise SaaS platforms Exposure to LLM-integrated applications in production environments Real-world experience in SRE + AI-driven operations Mentorship from experts in cloud, SRE, and AI engineering Opportunity to transition into a full-time role based on performance Walkin Drive Detail Walk In Drive Date: Friday April 17th, 2026 Time: 10:000 AM to 4:00 PM Venue: Zycus Infotech Pvt Ltd. Plot No GJ-07, SEEPZ++, SEEPZ, MIDC, Andheri East, Mumbai MH 400096. Note: 1. Candidates need to apply for the job online before the Walk-in (https://zycus.talismatic.com/jobs/JE5VTQZV2) 2. Carry your resume, 1 color passport size photograph and Aadhar Card copy along with the original 3. Our office is in a high-security zone, and you will need a gatepass therefore candidates are requested to email below documents in advance for gate pass to "facilities@zycus.com" and "seepz.consultant@zycus.com" Closet Entry: SEEPZ, Gate no -3 , Reception – Contact: 022 - 66407676 Documents for gate pass 1. Copy of Aadhar Card 2. Passport Size Photographs 3. Contact Number 4. email Address