Primary Skills
Cloud Architecture (IBM Cloud VPC, Networking, Security), Infrastructure as Code (Terraform + Automation), Programming / Scripting (Python / Go / TypeScript), Kubernetes & Container Orchestration, Platform Engineering & GitOps (ArgoCD, Istio), Observability & Monitoring (Prometheus, Grafana, Dynatrace), Platform Resiliency & SRE Practices (HA, Scaling, Reliability), Networking & Troubleshooting (Traffic analysis, diagnostics), System Design & Architecture Thinking
Job Description
Job Requirements:
- Strong understanding of cloud computing principles
- IBM Cloud Expertise
- Programming / IaC Tools
- Platform & Observability Tools
Key Responsibilities
- :Develop, test, and document technical solutions (code, scripts, processes) as per organizational standard
- sDeliver high-quality engineering outputs and mentor junior engineers on technical best practice
- sSolve complex technical problems and build reusable components/libraries with wider impac
- tDesign integrated, scalable, and reliable systems aligned with organizational best practice
- sAnticipate and mitigate scaling, latency, and durability challenge
- sDrive platform resiliency improvements across system
- sConduct root cause analysis (RCA) for system issues and drive corrective action
- sIdentify improvement areas and implement enhancements in team delivery practice
- sIntegrate security best practices early in system design in collaboration with security team
- sEvaluate technical risks and guide teams on mitigation strategie
s