DevOps Engineer
Cloud & Infrastructure
We are looking for someone who takes charge when others log off. Someone who doesn't wait for instructions. Someone who sees a problem, decides, and fixes it.
At Techdome, DevOps is not a support function. It is ownership. We build systems that don't just work they stay working. Quietly. Reliably. At scale
WHAT YOU'LL OWN
This is not a checklist role. This is a decision-making role. You will own production end to end.
- Run, monitor, and harden live cloud infrastructure across production environments, this is your environment, not someone else's
- Own CI/CD pipelines end-to-end build, maintain, improve, and be the last line of defence before code hits prod
- Execute Blue/Green deployments with precision, zero-downtime releases, traffic switching, and instant rollback when things go sideways
- Be the bridge between Developers and QA, catch what they miss, enforce standards, and make sure only production-ready code ships
- Lead incident response with clarity and speed, diagnose fast, communicate clearly, restore first, root-cause second
- Drive observability, alerting, dashboards, SLOs, and proactive system health before users feel the impact
- Document shifts, handoffs, and incidents so the day team inherits clarity, not chaos
WHAT WE EXPECT
Ownership: You don't escalate problems you solve them.
Accountability: Production is yours during your shift. No excuses.
Logical Thinking: You break down problems fast and act with clarity.
Independence: You don't need supervision. You need responsibility.
Calm Under Pressure: When systems fail, you don't panic. You execute.
Communication: Async-first. Shift logs are thorough. Handoffs leave no surprises.
WHAT YOU BRING
Technical Requirements
- 2–5 years in DevOps, SRE, or Infrastructure engineering in real production environments, not sandboxes
- Strong hands-on experience with Azure: VNets, NSGs, App Services, ACI, private endpoints, Cost Management
- Deep ownership of CI/CD pipelines using GitHub Actions, Jenkins, or equivalent, built and maintained, not just used
- Proven Blue/Green deployment experience: traffic routing, canary releases, instant rollback, zero-downtime releases
- Production-grade Terraform: state management, modules, drift handling, remote backends
- Real experience with monitoring and alerting: Prometheus, Grafana, Datadog, Azure Monitor, or ELK
- Scripting fluency in Python, Bash, or PowerShell: not tutorials, real production scripts
- Docker in production: image optimization, multi-stage builds, container debugging
- Strong debugging instincts across infrastructure and application layers including catching issues that slipped past Dev and QA
Strong Advantage If You Have
- Kubernetes production experience deployments, HPA, CrashLoopBackOff, real incidents
- Built a monitoring stack from scratch and designed alerting philosophy around it
- Managed real production incidents end-to-end, with post-mortems to show for it
- Worked night shifts or PST-offset roles with structured async handoffs
- Thrived in fast-paced startup or scale-up environments
Domain Experience — Highly Appreciated
- Our systems operate in environments where precision and compliance are non-negotiable. Experience in either of these domains gives you an immediate edge.
- Healthcare — HIPAA-aware infrastructure, HL7/FHIR integrations, EMR/EHR systems, audit trails, and patient data security practices
- Payments & Fintech — PCI-DSS compliant environments, high-availability transaction systems, fraud detection pipelines, and financial data sensitivity
WHY TECHDOME
Real Ownership: You won't "assist." You will own production systems that matter.
Real Systems : Production-scale challenges, real stakes, no sandbox.
Real Growth: You learn by doing. You improve by solving hard problems under pressure.
Real Impact: What you ship and fix is felt immediately — by real users, in real time.
Our Interview Process
As fast as we are!!
Step 1: AI Interview on our in-house platform, JustInterview.ai
Step 2: 1:1 discussion with Leadership
Get shortlisted, meet the decision-makers, and you're done.