Key Responsibilities
- Perform Tier-1 monitoring, troubleshooting, and incident response for production systems.
- Work in 24×7 shifts to ensure system uptime, reliability, and SLA adherence.
- Escalate issues to DevOps, QA, FinOps, and Security teams as needed.
- Execute predefined runbooks and document resolutions.
- Maintain clear communication during incidents and handovers.
Requirements
- 1–3 years of experience in IT operations, NOC, or support roles.
- Basic understanding of cloud infrastructure (AWS preferred).
- Strong troubleshooting and communication skills.
- Willingness to work in rotational shifts - Must
- Team player with a proactive attitude.
Preferred Qualifications
- Familiarity with monitoring tools (New Relic, Grafana, CloudWatch).
- Exposure to incident management processes and ITIL basics.
Skills: grafana,cloudwatch,newrelic,aws,troubleshooting,reliability,communication