Zenith Row has partnered with a fast-growing technology organization to hire a Cloud Infrastructure Engineer (Azure) to support and scale a production Azure environment powering an advanced AI platform.
This role sits within the Cloud Infrastructure and Deployment team and focuses on advanced Azure infrastructure, escalation support, and platform stability. The engineer will operate as a Tier 2 technical escalation point, helping reduce reliance on senior architects while strengthening operational resilience across the platform.
This is a true infrastructure-first role, ideal for engineers who enjoy deep platform troubleshooting, Azure networking, and infrastructure automation, rather than application development.
In This Role, You Will
- Act as the Tier 2 escalation point for complex Azure infrastructure issues
- Troubleshoot and resolve advanced Azure networking and platform challenges
- Develop and maintain infrastructure patches, updates, and improvements
- Support and maintain the Azure infrastructure running a large-scale AI platform
- Provide technical guidance to administrator-level engineers across the team
- Participate in technical client discussions when deep infrastructure expertise is required
Responsibilities
- Own and troubleshoot Azure platform infrastructure issues across networking, compute, and container services
- Support platform reliability and infrastructure stability
- Maintain automation and scripting frameworks for infrastructure management
- Assist with platform recovery and redeployment scenarios
- Provide technical oversight and escalation support across the infrastructure team
- Contribute to improving deployment processes and operational efficiency
Requirements
- 5+ years of experience in cloud infrastructure, platform engineering, or site reliability
- Strong hands-on experience building and operating Azure infrastructure
- Advanced troubleshooting experience across Azure networking and platform services
- Strong scripting and automation experience, including:
- PowerShell (required)
- Bicep (required)
- Python (nice to have)
Clear understanding of the full infrastructure lifecycle (design, deployment, operations, troubleshooting). Must be an infrastructure-focused engineer, not primarily an application developer.
Platform-Specific Experience
Experience supporting Azure Container Apps, including:
- Failed or partial revisions
- Instance outages following problematic updates
- Recovery and redeployment scenarios
Ability to navigate rapidly evolving Azure platform services
Tooling & Environment
- Azure-native environment
- Infrastructure tracking and escalation managed through ServiceNow
- Platform built entirely on Azure services supporting AI workloads
Nice to Have
- Strong QA-style issue documentation and reporting
- Experience mentoring or supporting junior engineers
- Exposure to Azure AI services