JOB DUTIES & RESPONSIBILITIES:
• Diagnose and resolve software application issues and challenges. Monitor the system for potential issues, and address them proactively before they become major problems
• Participate in 24/7 on-call rotation, respond to alerts in a timely fashion, escalate issues as needed. Support off hours as needed.
• Oversee and coordinate code deployment activities across several product lines – both internal and customer-facing
• Directly manage a team of several TechOps Engineers
• Coordinate incident response and support for production environments
• Own and administer CICD infrastructure and tools (Jenkins servers, code repositories, artifact storage, SonarQube, CodeQL)
• Be a point of contact for several engineering teams; identify impediments and build cross-functional collaboration to resolve them.
• Support Engineering Teams in troubleshooting software builds and environmental issues.
• Identify and implement environment configuration changes and optimizations.
• Manage work intake for the DevOps team; clarify and prioritize tasks
• Provide guidance and mentoring for the team
• Lead by example: perform hands-on tasks as needed (code deployments, troubleshooting, configuration changes)
WORK EXPERIENCE AND EDUCATION REQUIREMENTS:
- Master of Science in a relevant field such as Computer Science, Statistics, Mathematics, Engineering
- 10+ years of experience in supporting production systems and deployments.
- 3+ years of management experience.
· Strong knowledge of Kubernetes, RabbitMQ, Kafka, Rancher, Github, Jenkins, Ansible, Linux and Windows
- Proficiency in Shell/Powershell Scripting.
- Excellent communication skills, with the ability to explain complex concepts to non-technical stakeholders.
- Strong problem-solving skills and attention to detail.
- Strong experience with agile development methodologies and version control tools such as Git.
Preferred Qualifications:
- Experience working with Trino.
- Knowledge of cloud computing platforms such as AWS, GCP, or Azure.