About Company
Our client is a global technology consulting and digital solutions company that enables enterprises to reimagine business models and accelerate innovation through digital technologies. Powered by more than 84,000 entrepreneurial professionals across more than 30 countries, it caters to over 700 clients with its extensive domain and technology expertise to help drive superior competitive differentiation, customer experiences, and business outcomes.
Job Description
Supporting a reliable application suite for the environment in order to meet the development and maintenance requirements of systemsplatforms
Working as part of the development team to evaluate the health stability and reliability of applications
Utilizing monitoring s dashboards and management tools to ensure the availability reliability and performance of applications and services
Constantly working to improve and implement automation of applications tasks
Providing technical support for systemsplatforms according to application SLAs
Responsible for developing resiliency in the application code troubleshooting incidents engaging with squads to address failure patterns and participating in incident management
Responsibilities
6 or more years of handson experience as a Site Reliability Engineer or related technical engineering capacity
Knowledge of open systems and platforms ability to work with open systems architecture open formats underlying standards and development and support of software for open environments
Experience handling large numbers of diverse systems with configuration management systems like Puppet Chef Ansible
Knowledge of software engineering ability to deliver new or enhanced feebased software products
Proficient in one or more of the following scripting languages JavaScript Nodejs Python Ansible Bash etc
Knowledge of agile methodologies and the agile development lifecycle ability to use formal agile methodologies disciplines practices and techniques for the delivery of new and enhanced applications
Knowledge of production applications ability to monitor application functions and resolve issues to maintain optimal conditions for system applications
Strong experience with monitoring and ing systems like Prometheus Grafana Datadog
Knowledge of concepts values and tools applied in building Continuous Integration CI Continuous Delivery and Continuous Deployment CD pipeline ability to design build implement and maintain CICD pipelines to achieve the automation of software delivery process
Experience engineering software within an Amazon Web Services AWS cloud infrastructure
Experience in containerized workloads and management platforms such as Docker or Kubernetes
Understanding of standard networking protocols and components such as HTTP DNS ECMP TCPIP ICMP the OSI Model Subnetting and Load Balancing strategies
Knowledge of the theories and methodologies of reliability engineering ability to design develop and support various tools services and applications to maintain a reliable site environment
Embraces a diverse set of people thinking and styles
Consistently makes safety and security of self and others the priority
High School diploma GED or High School Equivalency
What Will Give You a Competitive Edge Preferred Qualifications
Bachelors Degree in Computer Science Information Systems or related technical field
Skills
DEVOPS-SITE-RELIABILITY-ENGINEERING
Job Location: Pan India
Work Experience: 6 to 12 Years
Work Mode: Hybrid
Employment Type: Full Time
Notice period : 0 to 30 Days