About ARAs Client
Ara's client is a fast-growing technology company transforming warehouse and distribution center operations through advanced, AI-powered robotics solutions. Founded in 2019, the organization builds compact, flexible, and high-performance mobile robotic systems designed for modern logistics environments. Backed by marquee investors, They are focused on enabling faster deployment, significant space optimization, and step-change productivity improvements for enterprise customers.
About the Role
We are looking for a Software Engineer – DevOps / Systems Operations who thrives at the intersection of OS internals and release engineering. This role is responsible for the deployment, reliability, and on-prem infrastructure of a large-scale distributed robotics system.
Unlike traditional "Cloud DevOps" roles, this position requires deep UNIX/Linux expertise. You will be expected to diagnose issues at the kernel and application layers, manage complex release pipelines, and ensure system stability in high-load, Wi-Fi-dependent environments without relying on cloud-provider dashboards.
Roles & Responsibilities:
• Linux Systems Mastery: Operate and tune bare-metal Linux servers (CPU, memory, disk, network). Perform deep-dive troubleshooting using system logs and OS-level diagnostics.
• Release & Pipeline Management: Own the end-to-end CI/CD process, coordinating releases and executing staged production rollouts with robust rollback strategies.
• Container Excellence: Create, layer, and debug Docker images. Integrate containerized stacks into on- prem workflows and troubleshoot runtime issues.
• Automation: Develop sophisticated Bash or Python scripts for log parsing, build automation, and custom system tooling.
Requirements:
• Deep UNIX/Linux Expertise: Expert-level command of Linux internals (specifically Ubuntu Server). Comfortable reading logs and managing processes without high-level abstractions.
• Release Engineering: Proven experience in versioning, rollback strategies, and managing the lifecycle of a release.
• Containerization: Proficient with Docker (image optimization, debugging, and layering).
Hands-on Scripting: Strong proficiency in Bash and Python for automation and systems-level tooling.
Good to Have:
• SRE/Build Engineering Background: Experience as a Systems Reliability Engineer or Build/Release Engineer.
• On-Prem Kubernetes: Experience managing K8s in a bare-metal or edge environment.
• Database Ops: Operational familiarity with Postgres, MongoDB, or Redis (availability and backups).
What We Are NOT Looking For
• Pure Cloud Specialists: Candidates whose experience is limited to configuring AWS/Azure/GCP services
(EKS, Lambda, etc.) without deep OS-level troubleshooting skills.
• Dashboard-Reliant Engineers: We need experts who can diagnose a system failure from a terminal when
the high-level dashboard doesn't provide the answer