Skills:
Continuous Deployment, Infrastructure as Code, Containerization, Automation, Monitoring, Cloud Computing, Scripting, SRE,
Required Experience
- 2+ years experience in designing and implementing complex application and systems solutions
- Mandatory; solid experience with working on UNIX/Linux platforms.
- Strong architectural background in Public/Private/Hybrid Cloud platforms like Amazon Web Services, MS Azure, Google Cloud Platform, Openstack etc.
- Experience in DevOps tools like JIRA, Rally, Confluence, Jenkins, Git, SVN, Maven, Nexus, Docker, Ansible, Chef, Puppet, Kubernetes etc.
- Ability to write code in at least one language (Go, Rust, Python, Perl, BASH, Java, JavaScript)
- Experience with automated deployment and release management (CI/CD, Jenkins, Artifact management system, Ansible, Puppet or an equivalent)
- Good understanding of offerings like Iaas, PaaS, SaaS, CaaS etc. relevant design experience is an added advantage
- Experience with automating monitors and system performance reporting using ELK, Logstash, Kafka, and Prometheus, TICK, Grafana etc.
- Familiar with security concerns-> networking concerts (DNS, two factor authentication, SSL, VPN, VPC) will be added advantage.
- Working experience on databases like Casandra, Hadoop, Redis, MySQL and Oracle.
- Added Advantage if worked on IoT device validation for key areas like NB-IoT protocol, FOTA (Firmware over the Air), Device Management, LWM2M, MQTT.
Responsibilities.
- Managing large scale production environments (50+ VMs, multiple services) on the cloud (GCP/Azure preferable).
- In production: deploy, support, and monitor systems on the cloud (2+ years)
- Knowledge of best practices and IT operations in an always-up, always-available service
- Develop and maintain scripts to monitor the environment and to automate tasks
- Partner with the engineering team to ensure that the applications are designed with operability in mind
- Available 24X7 on need basis, if there are any second-tier escalations.
- Improve monitoring, alerting and resilience of the system.
- Manage the Continuous integration platform.
- Implement appropriate release procedures
- Design & planning of cloud platform architecture
- Strong network administration and troubleshooting skills