Job Description
Job Title: Senior DevOps Engineer (Infrastructure/SRE)
Department: Technology
Location: Gurgaon
Work Mode: On-site
Working Hours: 10 AM - 7 PM
Terms: Permanent
Experience: 4-6 years
Education: B.Tech/MCA
Notice Period: Immediately
About Us
At Infra360.io, we are a next-generation cloud consulting and services company committed to delivering comprehensive, 360-degree solutions for cloud, infrastructure, DevOps, and security. We partner with clients to transform and optimize their technology landscape, ensuring resilience, scalability, cost efficiency and innovation.
Our core services include Cloud Strategy, Site Reliability Engineering (SRE), DevOps, Cloud Security Posture Management (CSPM), and related Managed Services. We specialize in driving operational excellence across multi-cloud environments, helping businesses achieve their goals with agility and reliability.
We thrive on ownership, collaboration, problem-solving, and excellence, fostering an environment where innovation and continuous learning are at the forefront. Join us as we expand and redefine what’s possible in cloud technology and infrastructure.
Role Summary
We are looking for a Senior DevOps Engineer (Infrastructure) to design, automate, and manage cloud-based and datacentre infrastructure for diverse projects. The ideal candidate will have deep expertise in a public cloud platform (AWS, GCP, or Azure), with a strong focus on cost optimization, security best practices, and infrastructure automation using tools like Terraform and CI/CD pipelines.
This role involves designing scalable architectures (containers, serverless, and VMs), managing databases, and ensuring system observability with tools like Prometheus and Grafana. Strong leadership, client communication, and team mentoring skills are essential. Experience with VPN technologies and configuration management tools (Ansible, Helm) is also critical. Multi-cloud experience and familiarity with APM tools are a plus.
Ideal Candidate Profile
Solid 4-6 years of experience as a DevOps engineer with a proven track record of architecting and automating solutions on Cloud
Experience in troubleshooting production incidents and handling high-pressure situations.
Strong leadership skills and the ability to mentor team members and provide guidance on best practices.
Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
Extensive experience with Kubernetes, Terraform, ArgoCD, and Helm.
Strong with at least one public cloud AWS/GCP/Azure
Strong with Cost Optimization and Security Best practices
Strong with Infrastructure automation using Terraform and CI/CD automation
Strong with Configuration Management using Ansible, Helm etc
Good with designing architectures (Containers, Serverless, VMs etc)
Hands-on Experience working on Multiple Projects
Strong with Client communication and requirements gathering
Databases management experience
Good experience with Prometheus, Grafana & Alert Manager
Able to manage multiple clients and take ownership of client issues.
Experience with Git and coding best practices
Proficiency in cloud networking, including VPCs, DNS, VPNs (OpenVPN, OpenSwan, Pritunl, Site-to-Site VPNs), load balancers, and firewalls, ensuring secure and efficient connectivity.
Strong understanding of cloud security best practices, identity and access management (IAM), and compliance requirements for modern infrastructure.
Good to have
Multi-cloud experience with AWS, GCP & Azure
Experience with APM & Observability tools like - Newrelic, Datadog, and OpenTelemetry
Proficiency in scripting languages (Python, Go) for automation and tooling to improve infrastructure and application reliability.
Key Responsibilities
Design and Development:
Architect, design, and develop high-quality, scalable, and secure cloud-based software solutions.
Collaborate with product and engineering teams to translate business requirements into technical specifications.
Write clean, maintainable, and efficient code, following best practices and coding standards.
Cloud Infrastructure:
Develop and optimise cloud-native applications, leveraging cloud services like AWS, Azure, or Google Cloud Platform (GCP).
Implement and manage CI/CD pipelines for automated deployment and testing.
Ensure the security, reliability, and performance of cloud infrastructure.
Technical Leadership:
Mentor and guide junior engineers, providing technical leadership and fostering a collaborative team environment.
Participate in code reviews, ensuring adherence to best practices and high-quality code delivery.
Lead technical discussions and contribute to architectural decisions.
Problem Solving and Troubleshooting:
Identify, diagnose, and resolve complex software and infrastructure issues.
Perform root cause analysis for production incidents and implement preventative measures.
Continuous Improvement:
Stay up-to-date with the latest industry trends, tools, and technologies in cloud computing and software engineering.
Contribute to the continuous improvement of development processes, tools, and methodologies.
Drive innovation by experimenting with new technologies and solutions to enhance the platform.
Collaboration:
Work closely with DevOps, QA, and other teams to ensure smooth integration and delivery of software releases.
Communicate effectively with stakeholders, including technical and non-technical team members.
Client Interaction & Management:
Will serve as a direct point of contact for multiple clients.
Able to handle the unique technical needs and challenges of two or more clients concurrently.
Involve both direct interaction with clients and internal team coordination.
Production Systems Management:
Must have extensive experience in managing, monitoring, and debugging production environments.
Will work on troubleshooting complex issues and ensure that production systems are running smoothly with minimal downtime.