DevOps Engineer
Cloud & Infrastructure
Some engineers ship code. Some manage systems. The best ensures nothing breaks - even at 3 AM.
At Techdome, DevOps is not a support function. It is ownership. We build systems that don't just work they stay working. Quietly. Reliably. At scale.
We are looking for someone who takes charge when others log off. Someone who doesn't wait for instructions. Someone who sees a problem, decides, and fixes it.
WHAT YOU'LL OWN:
This is not a checklist role. This is a decision-making role.
- You will own production — end to end.
- Run, monitor, and harden live cloud infrastructure across production environments
- Own CI/CD pipelines — build, maintain, and improve them for reliability and speed
- Lead incident response with clarity, speed, and a calm head under pressure
- Drive observability — alerting, dashboards, SLOs, and proactive system health
- Work side by side with engineers to keep systems stable and deployments safe
- Document shifts, handoffs, and incidents so the day team inherits clarity, not chaos
WHAT WE EXPECT:
Ownership — You don't escalate problems — you solve them.
Accountability — Production is yours during your shift. No excuses.
Logical Thinking — You break down problems fast and act with clarity.
Independence — You don't need supervision. You need responsibility.
Calm Under Pressure — When systems fail, you don't panic. You execute.
Communication — Async-first. Shift logs are thorough. Handoffs leave no surprises.
WHAT YOU BRING:
Technical Requirements
- 2–5 years in DevOps, SRE, or Infrastructure engineering
- Strong hands-on experience with Azure — VNets, NSGs, App Services, ACI, private endpoints, Cost Management
- Deep ownership of CI/CD pipelines using GitHub Actions, Jenkins, or equivalent
- Production-grade Terraform — state management, modules, drift handling, remote backends
- Real experience with monitoring and alerting — Prometheus, Grafana, Datadog, Azure Monitor, or ELK
- Scripting fluency in Python, Bash, or PowerShell — not tutorials, real production scripts
- Docker in production — image optimization, multi-stage builds, container debugging
- Strong debugging instincts across infrastructure and application layers
Strong Advantage If You Have
- Kubernetes production experience — deployments, HPA, CrashLoopBackOff, real incidents
- Built a monitoring stack from scratch and designed alerting philosophy around it
- Managed real production incidents end-to-end, with post-mortems to show for it
- Worked night shifts or PST-offset roles with structured async handoffs
- Thrived in fast-paced startup or scale-up environments
THE TEAM YOU'RE JOINING:
We don't hire average. We build A-players.
People who take decisions, not approvals. Who fix problems before they grow. Who think in systems, not tasks. Who care about reliability as much as speed.
▸ Ownership Mindset ▸ System Thinkers ▸ Incident-Hardened ▸ Async-First
WHY TECHDOME:
Real Ownership — You won't "assist." You will own production systems that matter.
Real Systems — Production-scale challenges, real stakes, no sandbox.
Real Growth — You learn by doing. You improve by solving hard problems under pressure.
Real Impact — What you ship and fix is felt immediately — by real users, in real time.
If you can take control of systems during the toughest hours, make decisions without waiting, and keep production stable when it matters most, we should talk.
How to Apply — Two Steps
Step 1: Apply for the role and complete the AI Interview (10–15 minutes) on JustInterview.ai, our in-house AI interview platform.
Step 2: If shortlisted our team will land in Jet speed