Job Title: Senior DevOps Engineer
Location: Palo Alto (Hybrid)
Duration: 6+ months with possibility of extension and FTE conversion
Pay rate: $80-85/hr on W2 or C2C
Responsibilities
Follow-the-Sun Incident Response: Provide SEV-1/SEV-2 incident coverage during PST/PDT hours (JP team off-hours), ensuring the contracted 2-hour initial response SLA is met around the clock.
● Infrastructure Management: Deploy and maintain cloud-based infrastructure on AWS (S3, Aurora/Postgres, IAM, Route 53, WAF, Cloudfront) leveraging IaC practices (Terraform) for scalability and reliability.
● Pipeline Management: Build, maintain, and improve CI/CD pipelines (GitHub Actions) to ensure efficient and consistent delivery of software across ADAS tenant environments (dev, stage, pre-production, production).
● Platform Stability & Uptime: Monitor cross-account AWS infrastructure and Kubernetes workloads (Stargate-based) to maintain the 99.5% monthly uptime target aligned with AWS/Stargate SLAs.
● Monitoring & Incident Response: Monitor application performance and infrastructure health using Sentry, Prometheus, Grafana, and related tools; respond to incidents to maintain uptime and reliability
● Automation: Identify manual processes and implement automation solutions to streamline workflows, reduce deployment times, and minimize operational overhead.
● Security Compliance: Integrate security best practices within CI/CD pipelines and infrastructure, ensuring adherence to compliance standards and IaC-only provisioning policies.
● Documentation: Document processes, configurations, runbooks, and best practices to ensure knowledge sharing and maintain operational continuity across JP/NA time zones.
● Documentation: Document processes, configurations, runbooks, and best practices to ensure knowledge sharing and maintain operational continuity.
Required (MUST)
● 3+ years of professional DevOps / SRE experience.
● CI/CD Pipelines: Proficiency in setting up, maintaining, and troubleshooting CI/CD pipelines (e.g., GitHub Actions).
● Containerization: Solid experience with Docker and Kubernetes, including deployment, scaling, and management.
● Cloud Providers: Hands-on experience with AWS (strongly preferred). Strong understanding of IaaS and PaaS offerings, IAM, and networking within cloud environments.
● Infrastructure as Code (IaC): Proficiency with Terraform for managing cloud infrastructure.
● Monitoring & Logging: Experience with monitoring and logging tools (e.g., Prometheus, Grafana, Sentry, ELK stack) for performance tracking and troubleshooting.
● Incident Management: Experience with on-call rotations, incident triage, and follow-the-sun support models.
● Strong communication skills in cross-functional environments involving engineers, product owners, and leadership — particularly across time zones (JP/NA coordination).
Nice to Have (WANT)
● Solid experience working with service mesh (e.g., Istio).
● Security Knowledge: Understanding of DevSecOps principles, including secure deployment practices, vulnerability scanning, and incident response.
● Solid understanding of software development lifecycle (SDLC) and agile delivery (Scrum / Kanban).
● Prior experience in multi-tenant or enterprise-scale platforms.
● Experience with backup/restore automation and disaster recovery procedures
Familiarity with Nx monorepo tooling and multi-tenant architectures.
BayOne is an Equal Opportunity Employer and does not discriminate against any employee or applicant for employment because of race, color, sex, age, religion, sexual orientation, gender identity, status as a veteran, and basis of disability or any federal, state, or local protected class. This job posting represents the general duties and requirements necessary to perform this position and is not an exhaustive statement of all responsibilities, duties, and skills required. Management reserves the right to revise or alter this job description.