Job Title: IT Operations Manager-AWS
Location: Bangalore
Job Type: Full-Time
Salary-10-16lpa
Job Summary
We are seeking an experienced AWS Operations Manager to lead our Network Operations Center (NOC) and Security Operations Center (SOC) teams. This role will focus on developing and implementing cloud operations strategies that align with business objectives, enhancing service reliability, and optimizing cloud resources.
Required Qualifications
- 7+ years of IT experience with 3+ years specifically in cloud operations (Preferably
AWS).
- 3+ years of experience in production operations for globally distributed cloud
infrastructure.
- Proven leadership experience in managing technical teams and projects.
- Hands-on experience with AWS services (VPC, EC2, EBS, RDS, ALB, ASG, IAM, S3
etc.).and linux.
- Strong conceptual knowledge of DevOps practices, CI/CD pipelines, and
Infrastructure as Code (Terraform, CloudFormation etc).
Familarity with monitoring and managing complex centenaried production
environment
- Familiarity with app/infra monitoring and log management tools (CloudWatch,
Prometheus, ELK, New relic, Datadog, Splunk , Grafana etc).
Familiarity with automations related to routin operational task.
Preferred/ Good To Have Qualifications
- AWS certifications (Solutions Architect, DevOps Engineer, etc.) are highly desirable.
- Experience in regulated industries and managing high-availability systems.
- Multi-cloud and hybrid cloud experience is a plus.
- ITIL certification
Key Responsibilities
Leadership & Team Management:
- Lead and mentor DevOps, Site Reliability Engineering (SRE), and Quality Assurance
(QA) team leads.
- Foster a culture of collaboration and continuous improvement within cloud
operations teams.
Cloud Operations Strategy
- Develop and implement cloud operations strategies aligned with business
objectives.
- Drive continuous improvement in cloud infrastructure, automation, and CI/CD
processes.
Incident Management
- Own incident management and resolution processes, ensuring timely response and effective resolution.
- Collaborate with customer & internal teams to enhance service reliability and
performance.
Security & Compliance
- Ensure adherence to security and compliance standards across all cloud
operations.
Cost Optimization
- Optimize cloud costs and resource utilization through effective management and
strategic planning.
Performance Tracking
- Establish and track key performance indicators (KPIs) for cloud operations,
providing regular updates to stakeholders.
- Stay current with industry trends and emerging technologies to inform strategic
decisions.
Stakeholder Management
- Serve as a key point of contact for cloud performance and operational metrics,
building strong vendor relationships and providing regular updates to leadership.
Skills: terraform,security compliance,datadog,elk,security and compliance,web services,linux,incident management,team coordination,monitoring and logging tools,cost optimization,grafana,cloudwatch,patch management,stakeholder management,management,cloudformation,troubleshooting,monitoring tools,log management,technical guidance,nginx,web,ci/cd,windows,apache,devops,ci/cd practices,cloud operations,infrastructure as code,splunk,ci/cd pipelines,prometheus,sre principles,devops practices,new relic,aws,site reliability engineering (sre),aws services,web services management