Senior Site Reliability / DevOps Engineer
Location: Hybrid
Department: Platform Engineering / Infrastructure
Overview
We are seeking a Senior Site Reliability / DevOps Engineer to design, implement, and operate modern platform infrastructure that enables fast, reliable software delivery at scale. This role focuses on edge delivery platforms, CI/CD systems, artifact lifecycle management, and AI-assisted developer productivity tooling.
You will help drive our MCP strategy, progressive delivery pipelines, and artifact platform modernization, including initiatives such as Nexus-to-SaaS migration, canary build deployments, and AI-powered developer tooling integration.
This role requires a strong combination of SRE principles, DevOps automation, and platform engineering mindset.
Key Responsibilities
Platform & Reliability Engineering
- Design and maintain highly reliable infrastructure supporting critical services and applications.
- Implement SRE best practices, including SLOs, SLIs, error budgets, and automated incident response.
- Build observability frameworks for proactive monitoring, alerting, and performance optimization.
Edge & Delivery Infrastructure
- Architect and manage edge delivery solutions using Fastly for high-performance application delivery.
- Optimize caching, traffic routing, and security controls at the edge.
- Implement canary and progressive deployment strategies to safely release changes in production environments.
CI/CD & Developer Platforms
- Design and maintain CI/CD pipelines using GitLab and GitHub ecosystem tools.
- Integrate AI-assisted development workflows leveraging tools such as Cursor and GitHub Copilot.
- Improve developer productivity through automation, reusable pipeline templates, and infrastructure self-service.
Artifact & Dependency Management
- Administer and modernize artifact repositories using Sonatype Nexus Repository.
- Lead initiatives for Nexus-to-SaaS migration and artifact lifecycle governance.
- Implement security scanning, dependency control, and artifact promotion workflows.
Platform Modernization
- Define and execute MCP (Modular/Managed Cloud Platform) strategy to standardize infrastructure components.
- Build scalable, reusable platform services for internal engineering teams.
- Drive adoption of infrastructure as code and GitOps practices.
Automation & Infrastructure as Code
- Build automated infrastructure provisioning and configuration management pipelines.
- Develop tooling and scripts to improve operational efficiency and reliability.
- Integrate platform automation into CI/CD workflows.
Required Qualifications
- 6+ years of experience in Site Reliability Engineering, DevOps, or Platform Engineering
- Strong experience with CI/CD platforms and pipeline automation
- Hands-on experience with edge delivery platforms (Fastly preferred)
- Experience managing artifact repositories such as Nexus
- Deep understanding of progressive delivery techniques (canary, blue/green, feature flags)
- Experience integrating AI-assisted development tools into engineering workflows
- Strong experience with:
- Infrastructure as Code
- Observability systems
- Cloud infrastructure (AWS, GCP, or Azure)
- Linux-based environments
Preferred Qualifications
- Experience leading artifact platform modernization or SaaS migrations
- Experience implementing GitOps-based delivery models
- Knowledge of software supply chain security
- Experience supporting large-scale microservices or container platforms
- Experience building developer platform tooling and internal developer portals
Key Skills
- Platform Reliability & SRE Practices
- CI/CD Pipeline Engineering
- Edge Delivery & CDN Optimization
- Artifact Repository Management
- Infrastructure as Code & Automation
- Progressive Delivery (Canary, Blue/Green)
- Developer Experience (DevEx) Enablement