Duties
You will be working with our SREs and Product Engineers to provide relaibility services and help improve our Tooling platforms building on automated code development and continuos integration/continuos deployment principles. You will contribute and learn multiple technology skills (cloud, SCM,CI/CD, Network, Automation, Monitoring/logging) and will be responsible to provide semaless experience to our user community by improving relaibility, security and automation of our Tooling platforms.
The job reauires support and on-call work as well to handle incidents/changes and problem management
Skills
Senior Operations Engineer I (SRE)
Job Description –
Total years of experience – Min 5 to 9
Cloud Platforms: AWS (Extensive Hands-on experience)
Good understanding of key services like CloudFormation, KMS, S3, EC2, CloudWatch, IAM, Code Commit
Secrets management service from AWS and ability to understand other secrets management systems
Ability to analyze logs and troubleshoot issue
Ability to understand costing and help in optimizing the architecture
Tools for Development – Splunk, Dynatrace, GitHub, GA, Jfrog, CircleCI (Admin on tools) – Good to have knowledge (any 2)
Good understanding of GitHub and GitHub Action workflow
Ability to work with docker and image artifactory via Jfrog
Should fully understand and have hands-on experience with SCM and CI/CD principles
Tools for Monitoring and Logging
Have hands on experience with Dynatrace & Cribl
Ability to build indexes and dashboards
Coding and Documentation Skills
Ability to debug any code and make modifications wherever needed - python and terraform.
Good documentation skills and make sure to review and edit documentation to help improve user as well as SRE documentation.
EducationBachelor's degree in Computer Science, Information Technology, or a related field. Relevant certifications such as AWS Certified DevOps Engineer, Microsoft Certified: Azure DevOps Engineer, or Google Cloud Certified - Professional Cloud DevOps Engineer are preferred.