Vision
We aim to build the operating system for commerce, through a combination of world- class infrastructure, logistics operations of the highest quality, and cutting-edge engineering and technology capabilities.
We are looking for a Principal Engineer / Devops Architect to support application development, infrastructure and security from the start by automating workflows to keep the DevOps workflow from slowing down. The ideal individual should have the mindset of Automation with built-in security, not security that functions as a perimeter around apps and data. The candidate will work closely with the architects, quality assurance, and development teams throughout the application lifecycle to achieve application availability, scalability, and operational effectiveness in the most secure way that is aligned with automation. The position also offers to achieve & maintain the uptime, performance and scalability of very low latency and high traffic apps hosted on microservice platforms. The candidate will work with cutting-edge technology leveraging Containers, Kubernetes, Jenkins, Saltstack, Kafka, Redis, RabbitMQ, Elastic Search, Git, Service Mesh, Prometheus, Loki etc
JOB DESCRIPTION
Role & Responsibilities:
- Work with the engineering team to define best practices and processes as appropriate to support the entire infrastructure lifecycle – Plan, Build, Deploy, and Operate such as automate lifecycle activities – self-service, orchestration and provisioning, configuration management
- Lead and define application analysis for infrastructure design, capacity planning, monitoring and operational requirements.
- Provide project leadership and ownership over all aspects of service delivery.
- Closely partner with development partners to assess the feasibility, complexity and scope of new capabilities and solutions
- Collaborates with the architecture and infrastructure delivery teams to ensure consistent solution design and integration
- Prepares technical quality of infrastructure delivery enter/exit gates for design, integration testing, deployment and operations
- Drive continuous improvement including design, service strategy, and standardization of processes and methodologies.
- Provide architecture leadership across multiple infrastructure teams.
- Continuous review of the current environment to detect deficiencies, and recommend, plan and implement improvement solutions.
- Provide leadership for strategic planning or complex problem-solving for programs and projects including the creation, maintenance, and governance of roadmaps.
- Review SOPs and design documents to ensure that they follow the standards.
- Develop and enforce security policies, standards, and best practices for cloud and Kubernetes environments. Ensure compliance with industry regulations and organizational security requirements.
- Implement network security controls, such as firewalls, network security groups, and Virtual Private Cloud (VPC) configurations to protect communication between cloud and Kubernetes resources.
- Protect data at rest and in transit by implementing encryption, data masking, and access controls. Secure data storage solutions, databases, and data pipelines.
- Ensure the security of containers and containerized applications by using container runtime security tools, vulnerability scanning, and secure image registries.
- Set up logging and monitoring systems to detect and respond to security incidents. Implement alerting and anomaly detection for both cloud and Kubernetes resources.
- Ensure that cloud and Kubernetes environments comply with relevant security and privacy regulations (e.g., PCI-DSS..) and industry standards (e.g., CIS benchmarks).
- Develop and maintain an incident response plan to quickly respond to and recover from security incidents. Conduct tabletop exercises and drills to test the plan.
- Implement security automation through infrastructure as code (IaC) and DevSecOps practices. Use tools like Terraform, Ansible, and Kubernetes Operators to ensure security is built into the deployment process.
- Assist with security audits and compliance assessments conducted by internal or external auditors.
- Promote a Secure DevOps culture, fostering collaboration between development, operations, and security teams.
Experience & Skills
- 8+ years of experience designing and implementing cloud infrastructure & software applications.
- Bachelor’s and/or Master’s degree in Computer Science.
- Experience designing and deploying high-performance systems with reliable monitoring and logging practices for distributed systems.
- Effectively work across team boundaries to establish overarching architecture, and provide
- Excellent communication skills, both written and verbal.
- Expertise in Cloud Infrastructure solutions like Microsoft Azure, Google Cloud or AWS
- Hands-on experience in managing and deploying containerized environments using Docker and Kubernetes is a must.
- Hands-on experience in service mesh and advanced traffic routing.
- Experience with data stores like MongoDB, Cassandra, Elasticsearch, RabbitMQ, and Kafka.
- Awareness of data platform technologies like Kafka, Spark, airflow, superset, Trino, Pinot, KSQL, Kafka connect etc
- Experience in designing highly efficient in-house observability platforms using open-source tools like Loki, Mimir, Thanos Prometheus and Grafana,
- Experience in cloud infrastructure security, k8s security and IaC.