Location: NYC (3x per week on site)
Duration: Permanent
Salary: $150-200k/year
Must Haves:
- 5+ years' experience as a site reliability engineer or similar.
- Strong coding skills in Python and PowerShell and the ability to comprehend C# and write basic applications in the language.
- Experience with Windows and Linux based operating systems, Cloud-based services and infrastructure (AWS), Infrastructure as Code (Terraform) and Configuration Management (Ansible)
- Experience with container technologies: Docker, Kubernetes, AWS EKS and ECS
- Self-motivated individual with great communication and interpersonal skills, and a very strong sense of ownership
Day to Day:
For $150-200k/year, as an Site Reliability Engineer, ensuring the uptime and reliability of these systems is crucial for our Global Macro business operations and trading strategies. You will optimize existing systems and infrastructure through strict adherence to automation and tooling. Your day-to-day will involve a mix of tasks, from building and maintaining the technical backbone of our SRE program to working closely with our development and quant teams. You'll ensure that any changes we make align with our service level objectives (SLOs), keeping everything running efficiently. You'll also keep an eye on system performance and capacity, spotting and fixing potential issues before they become problems. Reviewing and providing feedback on automation code will be part of your routine, helping us maintain high standards. When issues do arise, you'll troubleshoot and resolve them, analyzing their impact on our infrastructure and services. Plus, you'll participate in design reviews, helping us choose the best technologies and strategies to keep improving. It's a dynamic role where your contributions will directly impact our business operations and trading strategies.