Location: Remote (U.S.)
About Vehlo:
We started Vehlo in 2019 with a simple goal: to be the industry’s favorite provider of repair shop technology.
Across every part of the auto repair industry, Vehlo is igniting vehicle service success with software and financial solutions that unlock your potential. Our founder-led products power the entire service lane experience and keep customers coming back with streamlined tools that help you handle communication, workflow automation, touchless payments, valet pickup, and much more. We’re out to simplify the customer journey from start to finish and give power back to the people under the hood, making their jobs easier and your shop more profitable —just ask our over 30,000 customers, who generate more than 60M annual repair orders, and process over $12B in payments volume annually. At Vehlo, our only purpose is your success, and together, we’re reaching your goals faster than ever.
Being a Veep comes with more than a comprehensive benefits package—our biggest benefit is opportunity: Opportunity to make an impact, opportunity for growth, and opportunity for recognition and rewards. This is not a mega-corporation where you wonder what people are doing all day - every Veep is moving the ball forward day in, day out for our customers or for each other.
About the Role:
We're hiring a platform engineer to deliver our 2026 strategy across IT performance visibility, vulnerability and risk management, APM, and FinOps, which in practice means rolling out New Relic across our AWS accounts, migrating to GitHub Enterprise Advanced Security, building the Jira and Confluence foundation for risk and performance reporting, and standardizing the Terraform modules and GitHub Actions workflows a small team relies on. Day to day, you'll also handle the operational work that keeps the platform running, including Dependabot triage, alert tuning, deploy support, and one off brand requests. This is an all hands on keyboard team where every engineer ships code, writes Terraform, resolves incidents, and helps each other through hard problems rather than guarding private corners.
What You’ll Do:
- New Relic & APM Implementation: Deploy and manage New Relic APM, infrastructure, and browser agents across AWS services (ECS, Elastic Beanstalk, Lambda, EC2, EKS). Establish standardized alert policies and dashboards using Terraform. Optimize telemetry ingest, manage drop rules, and control observability costs.
- GitHub Enterprise & Vulnerability Management: Lead migration to GitHub Enterprise, implementing SSO, branch protections, CODEOWNERS, and Advanced Security features. Develop reusable GitHub Actions workflows to streamline CI/CD. Operationalize vulnerability data into actionable Jira workflows with SLA tracking and brand-level reporting.
- Jira, Confluence & Reporting: Design and manage Jira projects, workflows, automation rules, and permissions. Administer Confluence spaces, templates, and backups. Build centralized reporting to provide leadership with visibility into delivery performance, risk, and application health (APM).
- FinOps & Cost Visibility: Create domain-level cost dashboards leveraging AWS, New Relic, and SaaS data. Drive cost optimization initiatives (e.g., S3 Intelligent Tiering, lifecycle policies, telemetry drop rules, resource decommissioning). Support vendor renewal evaluations and cost analysis.
- Standardization & Enablement: Develop reusable Terraform modules, GitHub Actions workflows, and engineering templates. Author reference documentation and promote adoption of best practices across teams.
- Infrastructure & Daily Operations: Own shared AWS infrastructure, including provisioning, access management, networking, and ongoing maintenance. Triage Dependabot PRs, fine-tune alerts, support team migrations, participate in on-call rotations, and create/run operational runbooks.
Travel Requirement: less than 5%.
Duties, responsibilities, and activities may change at any time with or without notice, in accordance with applicable laws.
Qualifications
What You Bring:
- Experience: 3–6 years of experience in Cloud Engineering, DevOps, Site Reliability Engineering (SRE), Platform Engineering, or Developer Productivity.
- Observability Expertise: Hands-on experience with observability platforms at scale (e.g., New Relic, Datadog, or similar), including agent deployment, alerting, dashboards, ingest management, and integrations.
- GitHub Administration & Automation: Experience with GitHub at an organizational level, including teams, SSO, branch protection, OIDC, and reusable GitHub Actions workflows. Exposure to GitHub Advanced Security is a plus; willingness to grow into admin-level ownership.
- Atlassian Tools Familiarity: Working knowledge of Jira and Confluence as a user; familiarity with project configuration, workflows, and collaboration. Administrative experience is a plus but not required.
- AWS Cloud Proficiency: Production experience with AWS services such as IAM, S3, Lambda, and at least one compute platform (e.g., ECS, EC2, EKS). Comfortable operating in multi-account environments with SSO.
- Infrastructure as Code: Experience using Terraform (or equivalent IaC tools), including authoring and maintaining reusable modules from scratch.
- Scripting & Automation: Proficiency in at least one scripting or programming language such as Python or Bash.
- Communication Skills: Strong written communication skills with experience creating runbooks, technical design documents, and stakeholder-facing reports.
Eligible employees may receive:
- Medical, dental, vision, and life insurance
- 401(k) with company match
- Paid time off and holidays
Compensation is based on experience, knowledge, and skills and represents a good faith estimate in accordance with applicable laws.
Work Environment & Physical Requirements:
This role may be performed in a remote, hybrid, or office-based environment depending on business needs.
- Ability to remain in a stationary position (sitting or standing) for extended periods
- Ability to operate a computer and standard office equipment (e.g., keyboard, mouse, headset)
- Ability to view and interpret information on a computer screen for extended periods
- Ability to communicate effectively via phone, video, and written communication
- Ability to participate in virtual meetings with or without reasonable accommodation
Remote Work Expectations (if applicable):
- Maintain a dedicated, safe, and distraction-free workspace
- Reliable high-speed internet connection sufficient for video conferencing and job-related systems
- Ability to maintain productivity in a remote environment
- Must reside in a state where the company is authorized to employ workers
- Must be able to work core hours aligned to US business hours
Additional Information:
- Reasonable accommodations may be made to enable individuals with disabilities to perform essential job functions.
- Employment may be contingent upon a background check in accordance with applicable laws.
Note: This job description is intended to outline the general responsibilities and requirements of the role. It is not an exhaustive list of all duties, tasks, or responsibilities that may be required. Responsibilities and priorities may evolve over time, and the company reserves the right to make changes at any time with or without notice.
Vehlo is an equal opportunity employer that is committed to diversity and inclusion in the workplace. We prohibit discrimination and harassment of any kind based on race, color, sex, religion, sexual orientation, national origin, disability, genetic information, pregnancy, or any other protected characteristic as outlined by federal, state, or local laws.
This policy applies to all employment practices within our organization, including hiring, recruiting, promotion, termination, layoff, recall, leave of absence, compensation, benefits, training, and apprenticeship. Vehlo makes hiring decisions based solely on qualifications, merit, and business needs at the time.