TechGrove is the Centre of Excellence for Banyan Software, based in Chennai, India. It plays a key role in supporting Banyan’s global businesses through technology, security, and software development. TechGrove brings together India’s deep pool of technical talent with Banyan’s long-term approach to growth, creating a trusted, developer-focused environment where people can do their best work.
Overview
We are seeking a highly experienced and hands-on Principal Cloud Architect & DevOps Engineer, Lead to drive the engineering, architecture, and operational excellence for the core AI Application Modernization Factory platform. This is a deeply technical role critical for designing, building, and delivering the highly scalable, secure, and resilient cloud infrastructure and automated software delivery systems that form the foundation of the factory. The ideal candidate will possess expert-level cloud mastery, a deep-seated commitment to engineering excellence, and a proven ability to translate complex technical requirements into operational realities, adhering to established architectural standards.
Key Responsibilities
- Platform Architecture & Strategy: Own the technical architecture, design, and implementation of the AI Factory's core cloud infrastructure, ensuring it is secure, highly available, and resilient across major cloud providers (primarily AWS & Azure).
- Infrastructure-as-Code (IaC) Leadership: Define and enforce the strategy for Infrastructure-as-Code (IaC), making significant contributions and serving as the technical authority for the use of Terraform in provisioning and managing large-scale, distributed cloud environments.
- CI/CD Pipeline Mastery: Design, build, and optimize the factory's core CI/CD pipelines (e.g., GitHub Actions, GitLab CI), automating build, test, and deployment processes to achieve maximum engineering velocity and operational efficiency.
- Engineering Excellence & Standards: Serve as a top technical authority, defining and enforcing engineering standards, DevSecOps practices, and quality benchmarks for the platform team, aligning with Twelve-Factor App principles and modern design patterns.
- Security, Observability & Reliability: Implement secure-by-design principles and integrate robust cloud-native observability tooling (monitoring, logging, tracing) to debug, optimize, and maintain the complex distributed systems and data pipelines of the factory.
- Hands-on Problem Solving: Act as the highest technical escalation point for engineering and operational challenges related to the platform. Apply strong analytical skills to resolve complex infrastructure, network, and automation issues with the ability to navigate technical ambiguity.
- Mentorship & Coaching: Lead and mentor senior and junior DevOps Engineers, fostering a culture of technical ownership, continuous improvement, and knowledge sharing in cloud-native and automation best practices.
Required Qualifications & Experience
- Experience: 8+ years of progressive experience in Software Engineering and DevOps, with significant time spent in a Principal Engineer, Staff Engineer, or Architect role focused on platform engineering.
- Expert Cloud Mastery: Expert-level experience with Amazon Web Services (AWS) (e.g., EC2, Lambda, EKS, S3, RDS), Microsoft Azure (e.g., Container Apps, AKZ, Container Storage) and proven mastery of Infrastructure-as-Code (IaC) using Terraform for large-scale cloud environments.
- DevSecOps & Automation: Deep history of hands-on mastery in setting up and optimizing CI/CD platforms (GitHub Actions, GitLab CI) and embedding DevSecOps practices directly into the development workflow.
- Containerization & Serverless: Deep expertise in container technologies (Docker/Kubernetes) and serverless architectures (AWS Lambda) for building highly scalable and resilient distributed systems.
- Architectural Understanding: Solid understanding of modern architectural patterns (microservices, event-driven architecture, distributed systems), cloud networking and security implementation, and experience implementing security and observability patterns at scale.
- Hands-on Technical Depth: Proven history of making significant code contributions in modern technology languages (e.g., Python, TypeScript, Go, or similar) to build and maintain automation tooling and platform components.
- Communication & Collaboration: Exceptional communication, presentation, and collaboration skills, with a proven ability to define technical vision and manage expectations with both technical and business stakeholders.
Preferred Skills (A Plus)
- Prior experience building and operating a multi-tenant SaaS (Software as a Service) solution at scale.
- Architecture and operational experience implementing compliance frameworks (SOC2, HIPAA, PCI) within a cloud platform environment.
- Experience with advanced data infrastructure and processing pipelines for GenAI-related systems (e.g., Vector Databases, knowledge graphs).
- Direct experience training or mentoring other engineers in platform-level technologies or methodologies.
- Familiarity with advanced cloud security tools like Wiz, Prisma Cloud, and Chekov
Beware of Recruitment Scams
We have been made aware of individuals fraudulently posing as members of our Talent Acquisition team and extending fake job offers. These scams may involve requests for personal information or payment for equipment.
Protect yourself by following these steps:
- Verify that all communications from our recruiting team come from an @banyansoftware.com email address.
- Remember, employers will never request payment or banking information during the hiring process.
- If you receive a suspicious message, do not respond — instead, forward it to careers@banyansoftware.com and/or report it to the platform where you received it.
Your safety and security are important to us. Thank you for staying vigilant.