We are seeking a Technical Lead with extensive experience in AWS, Linux, and Windows environments, as well as expertise in troubleshooting processes across a large number of servers. The ideal candidate will have a strong background in web services (Apache, Nginx) and will be responsible for leading a managed services team, focusing on routine issues, patch management, and system reliability.
Key Responsibilities
Troubleshoot and resolve issues in AWS, Linux, and Windows environments,
ensuring high availability and performance.
Lead incident response efforts, identifying root causes and implementing longterm solutions to prevent recurrence.
Oversee the deployment and management of web services using Apache and
Nginx.
Implement and manage patch management processes to ensure systems are up
to date and secure.
Assign tasks and coordinate team activities to ensure timely project delivery.
Collaborate with offshore and onsite teams to align on project goals and
objectives.
Provide technical guidance and mentorship to team members, fostering an
environment of continuous learning and improvement.
Conduct code reviews and ensure adherence to best practices and coding
standards.
Participate in the architectural design and planning of new features and systems.
Maintain documentation related to processes, architectures, and workflows.
Drive Site Reliability Engineering (SRE) practices, focusing on system
performance, reliability, and monitoring.
Qualifications
Bachelor’s degree in Computer Science, Engineering, or a related field.
3-4 years of experience in a leadership role, managing medium to large-scale
projects.
Strong troubleshooting experience with AWS, Linux, and Windows systems.
Proficient in web services management using Apache and Nginx.
Excellent communication and interpersonal skills, with the ability to work
effectively with diverse teams.
Experience coordinating between offshore and onsite teams is highly desirable.
Strong organizational skills and the ability to manage multiple tasks
simultaneously.
Preferred Skills
Knowledge of CI/CD practices and tools.
Familiarity with AWS Compute services and architecture.
Experience with monitoring and logging tools for system reliability.
Understanding of SRE principles and practices, including SLAs, SLOs, and error