About the Company
At Kognitos, we are looking for a Lead Infrastructure Engineer to own and scale the core systems that drive developer velocity and platform reliability. If you're passionate about Terraform, scalable cloud infrastructure, and empowering high-performance engineering teams, this is your chance to lead from the front and push the boundaries of modern infrastructure.
About the Role
In this hybrid role, you’ll be at the intersection of Developer Productivity and Site Reliability Engineering (SRE). You’ll design, implement, and maintain the infrastructure that enables our teams to move fast and operate with confidence. Whether it’s codifying infrastructure with Terraform, leading DevOps initiatives, or building internal tools from scratch, you'll play a critical role in shaping how Kognitos ships software and scales systems. We need someone who thrives on innovation and isn’t afraid to challenge conventional approaches with cutting-edge tools and thinking.
Responsibilities:
- Own and evolve our Infrastructure as Code (IaC) systems using Terraform.
- Design and maintain scalable, cloud-native infrastructure (primarily AWS).
- Build and improve CI/CD pipelines using GitHub Actions and other tools.
- Lead monitoring and observability efforts using tools like Prometheus, Grafana, Signoz, and Datadog.
- Improve internal tooling to streamline developer workflows and automate manual processes.
- Champion reliability, security, and performance across all systems.
- Mentor engineers and promote DevOps best practices across the organization.
- Drive cost optimization and ensure infrastructure efficiency.
- Support compliance frameworks and standards within the infrastructure environment.
Qualifications:
9+ years of experience in Infrastructure, SRE, DevOps, or Developer Productivity roles.
Required Skills:
- Deep experience with Terraform and other Infrastructure as Code tools.
- Strong knowledge of cloud platforms (AWS, GCP, or Azure).
- Proficiency with container technologies (Kubernetes, Docker).
- Solid scripting or automation experience (Python, Bash, or Go).
- Experience with CI/CD systems and practices (GitHub Actions, Jenkins, etc.).
- Hands-on experience with monitoring tools like Signoz, Datadog, and Prometheus.
Preferred Skills:
- Prior experience with compliance frameworks and cost optimization in cloud environments.
- Ability to take ownership and lead infrastructure projects end-to-end.
- Strong communicator who can work across teams and technical levels.
- Highly organized, detail-oriented, and adaptable in a fast-paced environment.
- Curious and forward-thinking, actively exploring and implementing emerging technologies to stay ahead of the curve.
- Experience with AI-driven observability and debugging techniques.
Equal Opportunity Statement:
Kognitos is committed to fostering a diverse and inclusive workplace. We believe that our differences make us stronger and encourage applicants from all backgrounds to apply.