Are you ready to join a fun, nimble team that thrives on collaboration and innovation?
At Azul, we are dedicated to advancing our technology and infrastructure, and we are looking for passionate individuals to be part of our journey. As a member of our team, you will have the opportunity to work alongside talented Engineers who are committed to building and maintaining a secure and high-performance cloud infrastructure.
What You'll Do (aka the Responsibilities)- Manage connectivity between within and across multiple Cloud providers (AWS, GCP, and Azure)
- Support Cloud and on-premise Kubernetes stacks
- Design and implement IT infrastructure
- You will develop and support CI/CD pipelines
- Develop and support observability and alerting infrastructure
- Work with a team of Cloud Operations Engineers to help build and maintain the systems and code that allow us to provide an always available, secure, and performant cloud infrastructure
- Work with internal Engineering Teams to support the deployment and monitoring of their products
- Automate monitoring of cloud infrastructure using Open Telemetry, Prometheus, Grafana and other observability tools
- Deploy/provision new cloud infrastructure using automation like terraform, argocd, helm, ansible, boto3 (Python)
- Develop automated remediation for system faults to remove points of failure in cloud infrastructure
- Evaluate and make recommendations about stacks, tooling, and engineering best-practices
What You'll Bring (aka Education and Experience)- Bachelor's degree in computer science, Engineering, or a related field, or equivalent work experience.
- 5+ years of experience in a DevOps or Site Reliability Engineering (SRE) role, with a proven track record of managing large-scale infrastructure.
- Linux proficiency
- Familiarity with OpenStack
- A strong understanding of networking. The ability to diagnose and understand network issues. (BGP, IPsec, VXLAN, Geneve, 802.1Q, etc.)
- Expertise in AWS, Azure, GCP, and cloud-native technologies.
- In-depth knowledge of CI/CD tools (Jenkins, GitLab CI, ArgoCD, etc.) and best practices.
- Experience with infrastructure-as-code tools, such as Terraform, CloudFormation, Ansible, etc.
- Experience with containerization (Docker) and orchestration tools (Kubernetes, OpenShift).
- Familiarity with observability tools (OpenTelemetry, Prometheus, Grafana, Loki ELK, Splunk, etc.).
- Proficiency in scripting and programming languages, e.g. Python, Bash, Go, and Rust.
- Experience with microservices architecture.
- Familiarity with serverless technologies.
- Certifications in relevant cloud platforms (e.g., AWS Certified DevOps Engineer, Azure DevOps Engineer).
- Knowledge of infrastructure security practices (e.g., IAM, security groups, etc.).
What You'll Bring (aka Skills)- Ability to adapt to different teams and priorities, juggle multiple tasks
- Confidence in decision-making to pursue company goals
- A desire to learn and continually develop and expand your skillset
- Curiosity
- Excellent problem-solving skills and the ability to troubleshoot complex issues in production environments.
- Excellent communication and collaboration skills, with the ability to work effectively with cross-functional teams.
What We Offer- Equity Program - be part of the company success.
- Annual bonus based on company performance.
- Referral Program - earn referral bonuses and bring your colleagues.
- IT Equipment - MacBook Pro or any other HW according to your preferences.
- Work-life balance - generous holidays, sick time, flexible working hours, 100% work from home also possible.
- Most importantly, you will work with top experts worldwide who contribute to the Java ecosystem!