About the Role
Our client is a growing payment platform company looking to hire a DevOps & Site Reliability Engineer to support, maintain, and scale their production systems. This role will work closely with engineering teams to ensure system reliability, performance, and security.
Key Responsibilities
• Manage and maintain cloud infrastructure (AWS / Azure / GCP)
• Build, maintain, and improve CI/CD pipelines
• Ensure high availability, performance, and reliability of systems
• Monitor system health, troubleshoot incidents, and perform root cause analysis
• Automate infrastructure provisioning and deployment using IaC tools
• Support application deployments and production releases
• Work closely with developers to improve system scalability and resilience
• Ensure security best practices across infrastructure and deployments
Requirements
• 5+ years of experience in DevOps, SRE, or Infrastructure Engineering
• Strong experience with cloud platforms (AWS preferred)
• Hands-on experience with CI/CD tools (e.g. Jenkins, GitHub Actions, GitLab)
• Experience with containerization and orchestration (Docker, Kubernetes)
• Knowledge of monitoring and logging tools (Prometheus, Grafana, ELK, etc.)
• Scripting experience (Bash, Python, or similar)
• Experience supporting production systems in a high-availability environment
• FinTech or payments industry experience is a plus