Cloud DevOps / Site Reliability Engineer
NYC | Hybrid 4 days | Mid–Senior
We're a fast-growing fintech platform and we need someone to own the infrastructure underneath it — cloud architecture, CI/CD, container orchestration, observability, and enforcement.
This isn't a ticket-taker role. We need a builder and enforcer who understands how things work at depth, not just how to configure them.
You should be able to speak to:
- How Docker isolation actually works at the kernel level (cgroups, namespaces)
- CI/CD pipelines built as reusable systems, not one-offs — full code-to-cloud ownership
- Enforcing standards in the pipeline: linters, security scanning, policy-as-code
- Cloud architecture decisions you can justify on performance, security, cost, and ROI
- Observability as a discipline you've enforced, not just implemented
Strong signal: If you've implemented OPA in a production pipeline or describe OpenTelemetry as a specification rather than a tool — say so. We'll want to talk immediately.
Required: 5+ years in infrastructure/DevOps/SRE, AWS, Terraform, Python, Linux, production distributed systems experience.
AI workload infrastructure is on our roadmap. If that interests you, there's room to grow into it.
Tools and OS are secondary. Depth of thinking is not.