Title: Software Engineer, DevOps
Level: L2
Location: Los Angeles, CA
Work setup: 3 days in office, 2 remote
Company
Talent Systems, LLC is the leading technology solution provider for casting and auditioning to the entertainment industry. Casting directors and agents worldwide use Talent Systems’ portfolio of products to source and manage talent across film, television, commercials, theater and digital projects, powering an unparalleled, global casting software ecosystem.
We are headquartered in Los Angeles and operate in the US, Canada, UK, Australia and India. Our portfolio brands include Casting Networks, Spotlight, Cast It Systems, Cast It Talent, Casting Frontier, Staff Me Up, Cast It Reach & Tagmin.
Company Core Values
Job Description
We are seeking a skilled Software Engineer, DevOps to support and enhance the infrastructure, automation, and deployment pipelines for our cloud-based SaaS applications. Working with modern technologies like NodeJS, TypeScript, microservices, GraphQL, and AWS services, you will play a critical role in enabling platform and product engineering teams to deliver scalable, reliable, and high-performing solutions. This role requires close collaboration with distributed teams across the globe, including platform teams and product engineering teams, to ensure seamless integration and operational excellence.
Job responsibilities
- Manage and optimize AWS cloud resources, including EC2, S3, Lambda, API Gateway, and DynamoDB, for cost efficiency and scalability.
- Develop and maintain monitoring and logging systems using tools like Grafana.
- Ensure high availability and performance of production systems by implementing SLOs, SLIs, and SLAs.
- Design, implement, and maintain CI/CD pipelines for seamless code integration and deployment.
- Identify and resolve bottlenecks, incidents, and system failures proactively.
- Work closely with platform teams to support shared infrastructure and ensure seamless integration with product engineering workflows.
- Collaborate with globally distributed product teams to align DevOps practices with development needs and timelines.
- Act as a bridge between engineering and operations, ensuring smooth communication and alignment of priorities.
- Automate deployment, scaling, and recovery processes to minimize manual intervention.
- Optimize build and deployment processes for speed, efficiency, and reliability.
- Implement tools and scripts to improve developer productivity and system maintainability.
- Ensure the implementation of security best practices in infrastructure, deployment pipelines, and cloud resources.
- Collaborate with security teams to address vulnerabilities and maintain compliance with industry standards.
Qualifications
- Required:
- 3+ yrs years of experience in DevOps/SRE engineering or a similar role, supporting cloud-based applications, especially AWS.
- Knowledge of Kafka/Redpanda distributed streaming platform
- Proficiency with Terraform and infrastructure as code practices
- Hands-on experience with AWS services such as EC2, Lambda, S3, DynamoDB, AWS Elastic Transcoder, AWS Elemental MediaConvert, and CloudFront.
- Proficiency in CI/CD tools such as Github Actions and FluxCD/ArgoCD.
- Experience with containerization and orchestration tools like Docker and Kubernetes.
- Familiarity with NodeJS, TypeScript, PHP, .NET, RubyOnRails, microservices, and GraphQL in the context of deployment and operations.
- Experience with monitoring and observability tools such as Mimir, LOKI Stack, Grafana, and OpenTelemetry.
- Preferred:
- Strong knowledge of networking concepts, load balancing, and CDN management.
- Strong knowledge in emerging AI technologies
- Soft skills:
- Excellent collaboration and communication skills, with experience working in globally distributed teams.
- Proactive problem-solving and troubleshooting abilities.
- A strong sense of ownership, accountability, and urgency in delivering results.
- Ability to adapt to a fast-paced and dynamic environment.