Required Skills:
- 7-8 years of experience in DevOps, preferably with exposure to AI/ML systems.
- Expertise in cloud infrastructure management on AWS or Azure.
- Proficiency in containerization (Kubernetes, Docker) and IaC (Terraform, Ansible).
- Strong scripting skills in Python, Typescript, or Bash.
- Experience with SRE and maintaining strict SLOs in customer-centric environments.
Ideal Candidate:
- Thrives in a 0-1 environment, rapidly progressing from proof-of-value to iteration.
- Delivers high release rates with minimal defects.
- Is passionate about AI/ML infrastructure and enterprise-level deployment.
Perks:
- Work with a top-tier team of veteran engineers.
- Significant influence on product and technical decisions.
- Opportunity to build AI infrastructure from scratch.
What would you do here
Realfast.ai is developing AI agents to enhance Salesforce implementation, aiming to improve delivery speed by at least 50%. The company’s long-term vision includes creating a platform for AI agent developers to build and deploy customized AI workflows throughout the IT services ecosystem.
Responsibilities:
- Architect and manage cloud infrastructure for Vayu, our AI platform, using IaC tools like Terraform and Ansible.
- Build CI/CD pipelines suitable for rapid iteration in a pre-PMF (pre-product-market fit) environment.
- Set up monitoring and observability for AI systems with tools like Prometheus, Grafana, and the ELK Stack.
- Implement security best practices for AI model deployment and access control.
- Support AI/MLOps with deployments of LLM-based solutions.