We are seeking a Cloud Operations Engineer to serve as the subject matter expert for enterprise cloud operations, Azure infrastructure, automation, and emerging AI cloud initiatives. This individual will support day-to-day cloud operations while acting as a strategic liaison between Cloud Operations and the Research Department for AI-driven and grant-supported projects.
This role combines traditional cloud engineering responsibilities with hands-on support for Microsoft AI Foundry, chatbot deployments, AI model deployment and monitoring, and secure cloud infrastructure operations.
- No C2C candidates
- Candidates must currently reside in the Atlanta, GA area
- No relocation assistance available
Top Priorities
- Strong Azure cloud engineering and operational support experience
- Advanced Terraform and Infrastructure as Code (IaC) expertise
- Working knowledge of Microsoft AI Foundry and AI-enabled cloud services
- Experience supporting AI, chatbot, or machine learning initiatives
- Deep understanding of Azure networking, security, governance, and compliance
- Strong automation, monitoring, troubleshooting, and production support skills
Core Responsibilities
- Serve as the primary cloud operations resource for AI and research initiatives
- Partner with research teams supporting grant-funded cloud and AI projects
- Configure, deploy, monitor, and maintain AI models and chatbot platforms within Microsoft AI Foundry
- Support secure cloud infrastructure deployments aligned with enterprise governance standards
- Act as the technical liaison between Research and Cloud Operations teams
- Manage infrastructure deployment lifecycle activities across development, test, and production environments
- Support operational monitoring, troubleshooting, incident response, root cause analysis, and post-deployment validation
- Implement automation, CI/CD pipelines, reusable Terraform modules, and cloud governance controls
- Maintain secure resource provisioning patterns that enforce private/internal-only access
- Support change control, release management, rollback planning, and operational readiness processes
Required Experience
- 5+ years of experience in cloud operations, infrastructure engineering, or technical support
- 1+ year managing enterprise cloud infrastructure environments
- Strong Azure experience preferred (AWS/GCP experience acceptable)
- Hands-on experience with Terraform, PowerShell, Bash, Azure CLI, Bicep, or Ansible
- Experience supporting virtualization technologies, including VMware, Hyper-V, or Azure Stack HCI
- Working knowledge of ITIL operational frameworks
- Strong experience with:
- Azure networking (VNets, vWAN, VPNs, ExpressRoute, private endpoints, gateways)
- Azure security and governance tools, including Key Vault, Security Center, Purview, Firewall, RBAC, Managed Identities, PIM, and Azure Policy
- CI/CD automation and Git workflows using Azure DevOps, GitHub Actions, Helm, or Argo CD
- Monitoring and observability solutions such as Datadog, Grafana, Prometheus, Azure Monitor, or Log Analytics
- Kubernetes, Docker, AKS, and cloud-native deployment technologies
- Active Directory, Entra ID, Linux, and Windows Server administration
- High-availability, multi-region, disaster recovery, and business continuity strategies
- Event-driven and telemetry platforms such as Event Hub, Kafka, Redis, or Snowflake
- Enterprise troubleshooting, incident management, and operational support in 24x7 production environments
Preferred Qualifications
- Microsoft AZ-900 certification
- Experience with Azure AI Foundry
- Exposure to AI/ML technologies, TensorFlow, cognitive computing, or document intelligence solutions
- Experience supporting scalable microservices and distributed systems architectures
- ServiceNow/CMDB experience
- Experience implementing vulnerability management, dependency scanning, and release validation controls
- Experience collaborating across engineering, infrastructure, security, operations, and research teams
- Experience leading technical workshops, knowledge-sharing sessions, or cloud enablement initiatives
Ideal Candidate Profile
The ideal candidate will have strong ownership of enterprise Azure cloud environments, deep experience with Terraform and cloud automation, expertise supporting Kubernetes-based production systems, and the ability to support emerging AI platform initiatives while maintaining enterprise-grade security, scalability, governance, and operational excellence.
TM Floyd & Company is an equal opportunity employer and values diversity. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability.
We offer a generous array of benefits, depending on the length of assignment. We also offer a referral bonus of up to $1,000. Ask us for more details!
TM Floyd & Company participates in E-VERIFY.
AAP, EEO
26-00084