Main Mission
Design, deploy, manage, and optimize the cloud-based Kubernetes infrastructure for a callbot application integrating Asterisk, Python microservices, Redis, ChromaDB, and Whisper. Ensure scalability, security, high availability, and overall performance of the platform.
Key Responsibilities
Infrastructure & Cloud
• Deploy and manage Kubernetes clusters (EKS, GKE, AKS, or self-hosted).
• Provision and maintain cloud resources (VMs, networking, storage, load balancers).
• Configure VPC, subnets, firewall rules, and secure access.
Containerization & Services
• Write and optimize Dockerfiles for all services (Asterisk, Python services, Whisper, Redis, ChromaDB).
• Maintain private container registries (Docker Hub, AWS ECR, GCP Artifact Registry).
• Implement CI/CD pipelines for build and deployment automation.
Kubernetes Orchestration
• Create and maintain manifests (Deployments, Services, ConfigMaps, StatefulSets).
• Deploy and manage Redis clusters with persistence and scalability.
• Configure persistent volumes for Redis, ChromaDB, Whisper models, and audio storage.
• Optionally integrate a service mesh (Istio, Linkerd) for observability and security.
Scalability & Observability
• Configure load balancers for SIP/RTP and HTTP traffic.
• Set up horizontal and vertical pod autoscaling.
• Deploy monitoring and metrics tools (Prometheus, Grafana).
• Integrate centralized logging (ELK/EFK stack).
• Perform system integration and load testing.
Security & Backup
• Enforce RBAC, Network Policies, and Pod Security standards.
• Manage secrets securely (Kubernetes Secrets, Vault).
• Implement backup and disaster recovery strategies (Velero, Redis/ChromaDB snapshots).
• Conduct security audits and resilience testing.
Required Technical Skills
• Cloud Platforms: GCP (preferred), AWS, Azure.
• Orchestration: Kubernetes (EKS, GKE, AKS, or self-hosted clusters).
• Telephony/VoIP: Asterisk in containers, SIP/RTP traffic management.
• Databases & Caching: Redis (cluster mode, persistence), ChromaDB.
• AI/ML: Deployment and optimization of Whisper models.
• DevOps: Docker, CI/CD (GitLab CI, GitHub Actions, ArgoCD).
• Monitoring & Logging: Prometheus, Grafana, ELK/EFK stack.
• Security: RBAC, Network Policies, HashiCorp Vault.
• Backup & Recovery: Velero, snapshots, disaster recovery strategies.
• Networking: Load balancing (NGINX Ingress, cloud LBs), session affinity.