About The Role
Smart Bricks is looking for a highly skilled DevOps Engineer to design, manage, secure, and optimize our cloud infrastructure and deployment systems. You will play a critical role in ensuring high availability, scalability, security, monitoring, and operational excellence across our platforms and services.
You will work closely with backend, frontend, data, and product engineering teams to support rapid development cycles, reliable deployments, and scalable infrastructure.
Responsibilities
Cloud Infrastructure & Platform Engineering
- Design, implement, and manage scalable infrastructure on Google Cloud Platform (GCP)
- Manage Kubernetes clusters and containerized workloads in production environments
- Optimize infrastructure for high performance, scalability, reliability, and cost efficiency
- Maintain secure networking, DNS, domain management, SSL certificates, and WAF configurations
- Implement and maintain Infrastructure as Code (IaC) using Terraform or similar tools
CI/CD & Deployments
- Build and maintain CI/CD pipelines for frontend and backend deployments
- Automate application deployment, rollback, scaling, and operational workflows
- Improve deployment reliability, release speed, and developer productivity
- Support zero-downtime deployments and blue/green or canary release strategies
Monitoring, Observability & Reliability
- Implement and manage monitoring, logging, and alerting systems using Datadog and GCP tools
- Create dashboards, alerts, incident response workflows, and operational runbooks
- Proactively identify bottlenecks, failures, and performance issues
- Ensure high availability and uptime of systems and services
Database & Data Platform Operations
- Manage and optimize PostgreSQL databases for performance, backup, replication, and reliability
- Support BigQuery infrastructure, integrations, and performance optimization
- Implement database monitoring, maintenance, and disaster recovery strategies
Security & Compliance
- Implement cloud security best practices across infrastructure and applications
- Manage IAM, secrets, access control, firewalls, WAF policies, and network security
- Conduct infrastructure hardening and vulnerability mitigation
- Ensure operational compliance and security standards are maintained
Developer Enablement
- Support engineering teams with infrastructure, deployment, debugging, and operational tooling
- Improve developer workflows and platform self-service capabilities
- Collaborate closely with frontend, backend, and data engineering teams
Requirements
Technical Skills
- Strong experience with Google Cloud Platform (GCP)
- Hands-on experience with Kubernetes and container orchestration
- Experience with Datadog monitoring and observability platforms
- Experience with GitHub Actions
- Strong knowledge of CI/CD pipelines and deployment automation
- Experience with PostgreSQL administration and optimization
- Experience working with BigQuery or large-scale data systems
- Strong understanding of networking, DNS, SSL, WAF, and domain management
- Experience with Infrastructure as Code (Terraform preferred)
- Knowledge of Linux systems administration and shell scripting
- Experience with frontend and backend deployment pipelines
DevOps & Reliability
- Strong understanding of scalable distributed systems
- Experience designing highly available and fault-tolerant architectures
- Experience with monitoring, alerting, logging, and incident management
- Understanding of security best practices and cloud infrastructure hardening
Nice to Have
- Experience with multi-region deployments
- Experience with GitHub Actions, GitLab CI, or Jenkins
- Knowledge of service mesh technologies
- Experience with cost optimization and cloud governance
- Experience supporting AI/data platforms
Qualifications
- Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience)
- 5+ years of DevOps / SRE / Platform Engineering experience
- Strong troubleshooting and communication skills
- Ability to work in fast-paced, high-growth environments