What will you be responsible for?
As a Cloud Platform Engineer, you play responsibility of Cloud platform i.e., Azure Administration to sustain availability & operational efficiency of environment. We have critical applications hosted on Azure Infrastructure using multiple Azure services i.e., AzureSQL, ADF, AKS, Azure VM etc. hence platform availability & reliability becomes at most priority. You will be responsible for availability, reliability, and performance of Azure Cloud Infrastructure.
As a Sr. Cloud Engineer, you will
- Play a key role in Azure Infrastructure administration making sure high available & stable environment with sustain performance.
- Experience in Azure Administration, in various services i.e., AKS, AzureSQL , AzureVM, AzureSQL, ADF etc., which includes understanding network configuration, best practices of deploying services, Performing automation using IaaC approach and also troubleshooting production issues
- Install, Configure and Maintain Azure Infrastructure services using Azure Portal/Console & also learn Providence cloud application to deploy resources.
- Need to use Azure DevOps methods, Terraform & ARM methods to automate Azure Infrastructure installation & deployment.
- Manage on-call rotations across geo-locations, using a follow-the-sun model.
- Sound troubleshooting issues skills & participating in Severity issues & CODE RED calls.
- Maintain a close monitoring on Azure resource utilization and COST of each resource.
- Use Telemetry solutions i.e. Azure Insights, Azure Log Analytics, Datadog to setup appropriate monitoring and alerting.
- Continuous work on advance Data Analytics and Intelligence demands of team and work on specific Azure services i.e., Azure Databricks, Azure AI Services, APIM, OpenAI etc.
- Cloud computing knowledge, particularly Microsoft Azure will be preferable.
- Collaborate extensively with Product teams on enabling team on running reports successfully & resolving any issues with AKS, AzureVM, Azure Storage Accounts, ADF, AzureSQL and Azure Infrastructure services.
- Immediately respond to alerts, SEV and CodeRed incidents.
What would your day look like?
- Monitor & address all incidents & user requests associated to Azure Infrastructure and respective services.
- Discussion with Product team on their application architecture, provide solutions & address operational & performance issues.
- Collaborate with Enterprise Infrastructure team, Network team and Infosec/CYBR team on implementing Enterprise level policies, remediate any security violations.
- Work with MSFT support on severity issues & escalation to achieve resolution.
Who are we looking for?
- Bachelor s/equivalent in Engineering
- 5+ years of experience as a Cloud Infrastructure administration with min. 3+ years with Azure administration.
- Strong knowledge on Azure Administration concepts, installation, deployment & configuration Azure Infrastructure & Network services i.e., AKS, AzureSQL , AzureVM, AzureSQL, ADF, Storage Account, Key vault etc.,
- Experience with deploying Infrastructure as Code using Terraform or Azure arm templates.
- Experience with Azure DevOps, CI/CD, configuration via code using Ansible playbook.
- Working experience with System reliability: design, implementation and maintain system to ensure high availability and reliability of service and platform.
- Experience with Azure Databricks, Azure AI Services, APIM, OpenAI etc. will be preferrable.
- Develop & manage monitoring, dashboard, and alerts to proactively identify and address issues using Datadog or Azure Telemetry tools.
- Participating in incident management, root cause analysis and optimization of application health.
- Experience with source code control systems such as Git/GitHub & ADO.
- Experience with application deployment to containers or Kubernetes Services via code.
- SQL integration development experience using SQL/NoSQL.
- Experience with agile methodologies and tools such as Azure Devops, TFS, and Jira.
- Proven track record of working both independently and collaboratively as part of a multi-disciplined team.
- Experience in implementing and integrating with IaaS, PaaS and SaaS data platforms and other Cloud infrastructure services such as Azure AD
- Strong critical thinking skills, and the ability to think on your feet.
- Ability to adapt quickly and maintain a positive attitude.
- Excellent verbal and written communication skills Ability to take ownership of issues, work independently or escalate as needed, and find creative ways to resolve problems.
- Good collaborative skills to work with local and global teams, strong team player and create a one team, one company culture.