About Brillio:
Brillio is the partner of choice for many Fortune 1000 companies seeking to turn disruption into a competitive advantage through innovative digital adoption. Backed by Bain Capital private equity, and growing at nearly 60% YoY since its inception, Brillio is one of the fastest growing digital technology service providers. We help clients harness the transformative potential of the four superpowers of technology – cloud computing, internet of things (IoT), artificial intelligence (AI), and mobility. Born digital in 2014, we apply Customer Experience Solutions, Data Analytics and AI, Digital Infrastructure and Security, and Platform and Product Engineering expertise to help clients quickly innovate for growth, create digital products, build service platforms, and drive smarter, data-driven performance.
With delivery locations across the United States, Romania, Canada, Mexico, and India, our growing global workforce of over 6,000 Brillians blends the latest technology and design thinking with digital fluency to solve complex business problems and drive competitive differentiation for our clients. Brillio was awarded ‘Great Place To Work’ in 2021 and 2022. Learn more www.Brillio.com.
Role Summary
We are seeking a highly experienced SRE & DevOps Architect to lead and scale our DevOps & Reliability Center of Excellence (CoE). This role will define enterprise-wide DevOps, SRE, and platform engineering standards, drive reliability at scale, and partner with Engineering, Cloud, Security, and Business teams to enable high-performing, resilient, and cost-efficient platforms.
The ideal candidate brings deep technical expertise, strong architectural thinking, and proven CoE leadership across tools, processes, governance, and enablement.
Experience
- Experience: 15–20 years, including 6+ years in Architecture and DevOps/SRE CoE leadership roles.
- RFP/RFI Expertise: Strong experience in leading and contributing to RFP/RFI responses, solutioning, and proposal development.
- P&L Management: Proven ability to manage P&L, drive revenue growth, optimize margins, and ensure financial accountability.
Key Responsibilities
🔹 Architecture & Strategy
- Define enterprise DevOps & SRE architecture, reference frameworks, and best practices.
- Design scalable, highly available, fault-tolerant platforms across cloud-native and hybrid environments.
- Establish reliability engineering principles including SLOs, SLIs, error budgets, and capacity planning.
- Lead adoption of platform engineering and Internal Developer Platforms (IDP).
🔹 DevOps & SRE CoE Leadership
- Build and operate the DevOps/SRE Center of Excellence.
- Define CoE operating model, governance, maturity models, and success metrics.
- Standardize CI/CD pipelines, IaC, observability, security, and release practices.
- Act as an internal consultant for product and engineering teams.
🔹 Cloud & Infrastructure
- Architect and govern solutions across AWS / Azure / GCP.
- Drive Infrastructure as Code using Terraform, CloudFormation, ARM, or equivalent.
- Enable container platforms using Kubernetes, OpenShift, and service mesh technologies.
- Establish cloud cost optimization and FinOps practices.
🔹 Observability & Reliability
- Architect enterprise observability using Prometheus, Grafana, ELK, Splunk, OpenTelemetry, Datadog, etc.
- Drive proactive monitoring, alerting, incident response, and root-cause analysis.
- Lead blameless postmortems and continuous reliability improvement initiatives.
🔹 CI/CD & Automation
- Define and standardize CI/CD platforms using tools like GitHub Actions, GitLab, Jenkins, Azure DevOps, Argo CD.
- Champion GitOps, pipeline-as-code, and automated quality gates.
- Integrate security (DevSecOps) into pipelines.
🔹 Security & Compliance
- Embed DevSecOps practices across SDLC.
- Work with security teams to enable secrets management, vulnerability scanning, and policy-as-code.
- Ensure compliance with enterprise and regulatory requirements.
🔹 Stakeholder & Leadership Engagement
- Partner with Engineering Heads, Cloud CoE, Security, and Enterprise Architecture teams.
- Mentor DevOps and SRE engineers; build upskilling programs and communities of practice.
- Influence leadership through metrics, ROI, and business outcomes.
Required Skills & Experience
- Strong experience in SRE, DevOps, Cloud Architecture, and Platform Engineering.
- Containers & Orchestration: Docker, Kubernetes, Helm.
- CI/CD: Jenkins, GitHub Actions, GitLab, Azure DevOps, Argo CD.
- Cloud Platforms: AWS / Azure / GCP.
- IaC & Config Management: Terraform, Ansible, CloudFormation.
- Observability: Prometheus, Grafana, ELK, Open Telemetry.
- Scripting: Python, Shell, Go (preferred).
- Security & DevSecOps toolchains.
Architecture & CoE Experience
- Proven experience building and leading DevOps/SRE CoE.
- Defining standards, reusable assets, and reference implementations.
- Large-scale enterprise transformation experience.
- Multi-team and multi-geography delivery exposure.
Soft Skills
- Strong stakeholder management and communication skills.
- Influencing without authority.
- Mentorship and thought leadership mindset.
- Ability to balance governance with developer velocity.
Certifications (Preferred)
- AWS / Azure / GCP Architect Professional.
- CKA / CKAD / CKS.
- SRE or DevOps certifications.