Senior DevOps Engineer
Contract
Ann Arbor, MI (Onsite or Hybrid)
This position pays around $60/hr - $65/hr on w2
The primary responsibility of this role is to drive operational efficiency, automation, and reliability for the ADAS infrastructure. This role focuses on building sustainable automation, monitoring systems, and security practices that enable the ADAS platform to scale reliably as the tenant engineering group grows.
Responsibilities
- Automation: Identify manual processes and implement automation solutions to streamline workflows, reduce deployment times, and eliminate repetitive tasks that could overwhelm the DevOps team during peak periods.
- Infrastructure Management: Deploy and maintain cloud-based infrastructure on AWS (S3, Aurora/Postgres, IAM, Route 53, WAF, Cloudfront) leveraging IaC practices (Terraform) for scalability and reliability.
- Monitoring & Observability: Build and maintain monitoring systems to track application performance and infrastructure health. Ensure production observability meets the standards required for 99.5% uptime targets.
- Pipeline Management: Build, maintain, and improve CI/CD pipelines (GitHub Actions) to ensure efficient and consistent delivery of software across ADAS environments.
- Security Patch Deployment: Apply critical security patches and core module updates within 5 days of official release. Notify tenants within 24 hours of security advisories.
- Incident Response: Provide extended NA coverage for SEV- 1/SEV-2 Incidents during EST/EDT hours, serving as backup to the West Coast engineer during peak incident periods.
- Documentation: Document processes, configurations, runbooks, and best practices to ensure knowledge sharing and maintain operational continuity.
Required Skills:
- 3+ years of professional DevOps / SRE experience.
- CI/CD Pipelines: Proficiency in setting up, maintaining, and troubleshooting CI/CD pipelines (e.g., GitHub Actions).
- Containerization: Solid experience with Docker and Kubernetes, including deployment, scaling, and management.
- Cloud Providers: Hands-on experience with AWS (strongly preferred). Strong understanding of IaaS and PaaS offerings, IAM, and networking within cloud environments.
- Infrastructure as Code (IaC): Proficiency with Terraform for managing cloud infrastructure.
- Monitoring & Logging: Experience with monitoring and logging tools (e.g., Prometheus, Grafana, Sentry, ELK stack) for performance tracking and troubleshooting.
- Strong communication skills in cross-functional environments involving engineers, product owners, UX designers, and leadership.
Nice to Have Skills:
- Solid experience working with service mesh (e.g., Istio).
- Security Knowledge: Understanding of DevSecOps principles, including secure deployment practices, vulnerability scanning, and incident response.
- Solid understanding of software development lifecycle (SDLC) and agile delivery (Scrum / Kanban).
- Prior experience in multi-tenant or enterprise-scale platforms.
- Experience with backup/restore automation and disaster recovery procedures.
- Familiarity with Nx monorepo tooling and multi-tenant architectures.