You’ll be working in a team responsible for the development, configuration, optimization, and support of product cloud infrastructure and services that deploy to the customers. The DevOps Engineer will be responsible for reviewing the server environment and executing initiatives to reduce failures and improve overall performance. You will work with the engineering team to ensure their systems scale, increase instrumentation and automation of tests to validate their environments, and coordinate and be able to resolve issues across the entire stack, including, hardware, software, application, server, and network.
Job Responsibilities
- Automation of build, test, & deployment pipelines by orchestrating automated deployment processes for backend services and conducting reviews to make sure your work is well-designed, elegantly coded, and effectively tuned, as well as assessing the overall quality of delivered components.
- Implementing, deploying, and maintaining high availability systems, with features such as auto-scaling, load balancing, data backup, and restoration, etc by the installation of alert and monitoring to services and infrastructure for data and system integrity, health and security
- Administration and maintenance of database servers, including log management
- Implementation & analysis of key performance indicators at both the application and infrastructure levels.
- Collaborating with team on technical considerations & feasibility of features.
- Drive changes in the product to promote stability, security, performance, and modularity.
- Ensure adoption of new and standard technologies through research and implementation
- Automate the provisioning of environments using tools like Ansible or containers like Docker using Hybrid infrastructure
- Share knowledge across various teams to Influence the team members and broader engineering groups in adopting DevOps best practices
- Troubleshoot problems, involving the appropriate resources and driving resolution of issues with focus on minimizing impact on running applications.
Essential Requirements (Required):
- Experience in cloud platforms such as AWS, Azure, or Google Cloud, including infrastructure provisioning and management.
- Proficiency with automation tools like Ansible, Terraform, Puppet, or Chef for provisioning and configuration management.
- Hands-on experience with containerization technologies such as Docker and orchestration platforms like Kubernetes.
- Strong understanding of CI/CD pipelines, including experience with tools like Jenkins, GitLab CI/CD, or CircleCI.
- Knowledge of monitoring and alerting systems, such as Prometheus, Grafana, Nagios, or ELK stack, to ensure system health and security.
- Experience in scripting languages such as Bash, Python, or PowerShell for automation and integration tasks.
- Expertise in Linux/Unix system administration, with a focus on troubleshooting, performance tuning, and log management.
- Familiarity with database administration, including backups, restoration, and performance optimization for databases like MySQL, PostgreSQL, or MongoDB.
- Strong analytical and problem-solving skills, with the ability to diagnose issues across the entire stack (hardware, software, application, server, and network).
Preferred Qualifications (Nice-to-Have):
- Experience with hybrid cloud infrastructure setups, including integration of on-premises and cloud resources.
- Familiarity with security practices such as firewalls, VPNs, and identity management.
- Knowledge of Agile development practices and collaboration in cross-functional teams.
- Experience with version control systems like Git, and best practices for branching and merging workflows.
- Background in scaling high-availability systems with features like auto-scaling, load balancing, and disaster recovery mechanisms.
- Awareness of DevOps cultural practices, including team collaboration, feedback loops, and iterative development.