DevOps Engineer
Frisco Onsite
$135k-150k
Stelvio are working with a client who builds technology solutions for the Intelligent Transportation Systems sector, supporting safer, smarter, and more connected transport infrastructure.
The team operates with the pace and ownership of a smaller startup environment, while being part of a larger organisation. They are building and scaling a SaaS platform that supports real-time data processing across cloud, on-premise, and edge environments.
This role is suited to a hands-on DevOps Engineer who enjoys working with modern infrastructure, distributed systems, data platforms, and high-availability environments.
Responsibilities
- Implement and manage production-grade Kubernetes clusters
- Use Argo CD for GitOps deployments and Terraform for Infrastructure as Code
- Build and maintain scalable infrastructure across cloud and on-premise environments
- Develop and support CI/CD pipelines for containerised applications
- Monitor infrastructure performance and ensure high availability for critical systems
- Support S3-based Data Lake infrastructure integrated with Dagster
- Manage and optimise NATS messaging systems for real-time event streaming
- Support Numaflow pipelines for real-time stream processing and reliable data flow
- Deploy and manage on-premise and edge computing infrastructure
- Support hybrid cloud solutions connecting edge deployments with central cloud infrastructure
- Work closely with development teams deploying Go and Python applications
- Manage and improve build systems, including Bazel or similar tooling
- Integrate security practices into CI/CD processes
- Support compliance with security standards and internal policies
- Implement monitoring and observability across distributed systems, data pipelines, and message brokers
- Identify and resolve performance, reliability, and data processing issues
- Support disaster recovery and system reliability initiatives
Qualifications
- 5+ years of experience in DevOps, IT infrastructure, or software engineering
- 3+ years of hands-on production Kubernetes experience
- Strong experience managing Kubernetes clusters, including networking, storage, security, and reliability
- Strong Terraform experience for Infrastructure as Code
- Experience building and maintaining CI/CD pipelines
- Experience with GitOps practices, ideally using Argo CD
- Experience managing Data Lakes or Data Warehouses, such as Hadoop, Spark, Snowflake, BigQuery, or similar
- Experience working in high-availability or performance-critical environments
- Knowledge of monitoring and observability tools such as Grafana, Prometheus, or OpenTelemetry
- Experience diagnosing and resolving performance and reliability issues
- Familiarity with cloud, on-premise, or hybrid infrastructure environments
- Experience with large-scale build systems such as Bazel, Buck, NX, Turbo, Pants, or similar
- Understanding of networking, load balancing, and security protocols
- Experience with microservices and container orchestration
- A Bachelor’s degree in Computer Science, Engineering, a related field, or equivalent work experience
Preferred experience includes:
- Edge deployment or hybrid cloud architecture experience
- Background in transportation or another environment involving deployed and distributed software
- AWS Certified Solutions Architect, Certified Kubernetes Administrator, or similar certification