Synechron intends to extend its production environment, currently built on Azure Kubernetes Service (AKS) and Postgres as a Service, to support batch operations in addition to existing live traffic services. These batch operations are running flawlessly currently in Non-Prod environment.
Key considerations:
- Ensure proper isolation between batch and live workloads through dedicated Postgres instances and node pools.
- Implement deployment enhancements for simplified node pool dedication between live and batch operations.
- We use Jsonnet configuration code and convert it into Kubernetes YAML manifests.
- Adapt and configure Airflow python DAGs for batch processing orchestration.
- Collaborate closely with the internal team to ensure a smooth transition of sensitive processes.
Team Responsibility
The selected engineers will be responsible for:
- Maintaining and scaling Kubernetes clusters in Azure.
- Ensuring proper (no shared VM) isolation between live traffic and batch operations.
- Supporting deployment improvements for reliability and scalability.
- Coordinating with engineering teams to maintain production stability.
- Enhancing batch orchestration workflows in Airflow.
Key Responsibilities of the selected candidate
- Maintain Kubernetes clusters on AKS.
- Ensure isolation between live traffic services and batch workloads via dedicated node pools and Postgres instances.
- Extend Jsonnet code responsible for Kubernetes YAML manifest generation for deployment consistency.
- Implement deployment enhancements for easier node pool dedication and improved reliability.
- Adapt and enhance Airflow DAGs to support batch processing in production.
- Collaborate with internal teams for smooth process transitions.
- Support data synchronization between Non-Prod and Prod environments.
- Participate in release management, including dry-runs, parallel runs, and controlled cutovers.
- Troubleshoot and resolve issues to ensure stability of ESG Climate workloads.
Skills and experience:
- 8+ years of experience in Cloud Infrastructure / DevOps Engineering.
- 5+ years of hands-on experience with Kubernetes (AKS preferred) — covering cluster setup, scaling, networking, RBAC, and workload isolation.
- Strong expertise in Azure services: AKS, Networking (VNet, NSG, Subnets), Storage Accounts, Azure Container Registry (ACR), Key Vault, PostgreSQL as a Service.
- Proven experience in creating and maintaining deployment manifests.
- Experience in workflow orchestration with Airflow, including adapting DAGs for batch processing.
- Hands-on experience with Infrastructure as Code (Terraform)
- Strong knowledge of CI/CD pipelines and deployment automation.
Must to have skillsets:
- Strong hands-on experience with Kubernetes (deployment, scaling, networking, security, troubleshooting).
- Proven expertise with Azure Cloud services, especially AKS, managed databases, and networking.
Good to have skillsets
- Airflow (workflow orchestration) experience.
- Terraform (infrastructure as code) exposure.
- Postgres (database management) experience.
- Familiarity with Jsonnet templating.
Soft skills:
- Strong problem-solving and troubleshooting abilities.
- Ability to work independently and collaboratively in distributed teams.
- Clear communication and documentation skills.
- Flexibility and adaptability in dynamic environments.