Job Title: Software Engineer - DevOps
Location: New York, NY (Hybrid)
Type: Full-time
About Us: We are an innovative AI SaaS startup based in New York, focused on building industry leading AI Agents-based automation solutions to empower businesses, especially starting in financial services. Our cloud-native platform leverages the latest in AI and machine learning to deliver scalable, intelligent automation. As we rapidly scale, we're looking for a talented Cloud DevOps Engineer to help optimize and secure our infrastructure while ensuring continuous delivery of high-quality software.
Role Overview: As a Cloud DevOps Engineer, you will play a critical role in designing, implementing, and maintaining our cloud infrastructure. You will work closely with the software engineering and AI/ML teams to automate deployments, monitor performance, and ensure security and scalability. You will drive the adoption of DevOps best practices, including CI/CD pipelines, cloud orchestration, and infrastructure as code, ensuring our AI platform is reliable, efficient, and scalable.
Key Responsibilities
- Design, build, and maintain scalable, secure, and high-performance cloud infrastructure (AWS/Azure/GCP).
- Automate infrastructure management using Infrastructure-as-Code (OpenTofu, Pulumi, etc.).
- Develop, implement, and manage CI/CD pipelines to streamline the software development lifecycle.
- Monitor, troubleshoot, and optimize system performance, availability, and security, including on-call support.
- Collaborate with software engineering and AI/ML teams to align infrastructure with our AI product goals.
- Ensure infrastructure meets the highest security standards and compliance requirements.
- Establish and maintain logging, monitoring, and alerting systems to support rapid response to issues.
- Conduct cloud cost optimization and ensure efficient use of cloud resources.
- Enable rapid scaling of infrastructure in response to growing data and user demands.
- Contribute to disaster recovery planning, backup strategies, and fault-tolerant designs.
Requirements
- Bachelor's or Master's degree in Computer Science.
- 5+ years of experience in a Cloud DevOps role, working in cloud environments like AWS, Google Cloud, or Azure.
- Expertise in cloud infrastructure management tools like Terraform/OpenTofu, CloudFormation, Ansible or Pulumi.
- Strong experience with CI/CD tools like Github Workflows, Jenkins, GitLab, or equivalent.
- Experience with containerization (Docker, Kubernetes, ECS, etc.) and orchestration tools.
- Solid understanding of networking, security best practices, and monitoring in cloud environments.
- Proficiency in scripting languages like Python, Bash, or PowerShell.
- Familiarity with logging/monitoring solutions like Prometheus, Grafana, Datadog, or CloudWatch.
- Knowledge of cloud-native technologies, microservices architecture, and serverless computing.
- Experience with cloud security management, including IAM policies, VPC configuration, and data encryption.
- Excellent problem-solving skills and ability to work in a fast-paced, agile environment.
Preferred Qualifications
- Experience in AI/ML environments, supporting machine learning operations (MLOps/LLMOps) and data pipelines.
- AWS/GCP/Azure certifications.
- Knowledge of GitOps practices and tools (Flux, ArgoCD, etc.).
- Understanding of regulatory and compliance standards (GDPR, SOC 2, ISO 27001, etc.).
What We Offer
- Competitive salary and equity options.
- Flexible working environment with hybrid options.
- Opportunity to work with leading-edge AI technologies in a fast-growing startup.
- Collaborative, inclusive, and innovative team culture.
- Professional development opportunities, including certifications and training.
- Health insurance and other benefits.
How to Apply: If you're passionate about cloud infrastructure, DevOps best practices, and want to help shape the future of AI technology, we'd love to hear from you. Please submit your resume and a brief cover letter outlining your experience and interest in the role at https://jobs.ashbyhq.com/artian.