Infrastructure Engineer
Imizizi
Reference: JHB001411-NS-1
ESSENTIAL SKILLS
- Cloud infrastructure engineering (Azure preferred) with a focus on high-availability, scalable AI platforms (Kubernetes, container orchestration, networking, IAM).
- Strong hands-on experience with Kubernetes (AKS), Helm, and platform-level CI/CD pipelines.
- Solid understanding of conversational AI architectures (LLM-based services, APIs, grounding layers, vector stores).
- Infrastructure-as-Code expertise (Terraform, ARM/Bicep) for reproducible and compliant environments. Security-by-design mindset: identity, secrets management, network isolation, and secure service communication.
- Observability fundamentals: logging, metrics, tracing for AI workloads (latency, token usage, cost drivers). Strong collaboration skills with Dev, Data Science, and Product to translate functional requirements into resilient infrastructure.
ADVANTAGEOUS SKILLS
- Experience operating enterprise-grade AI platforms under regulatory, data protection, and compliance constraints.
- Knowledge of cost optimisation for AI workloads (GPU/CPU trade-offs, scaling strategies, usage-based charging).
- Exposure to MLOps / LLMOps concepts (model deployment, versioning, prompt lifecycle, evaluation).
- Familiarity with event-driven architectures (Kafka, Azure Event Hub) in AI-driven systems.
- Experience with cross-region / multi-environment setups (DEV, INT, PREPROD, PROD).
ROLE & RESPONSIBILITIES
- Own the end-to-end infrastructure landscape for Conversational AI platforms (e.g. ConvAIS).
- Design, implement, and operate secure, scalable, and cost-efficient AI runtime environments.
- Act as the technical backbone between product, development, data science, and operations.
- Define infrastructure standards, reference architectures, and guardrails for conversational AI use cases.
- Ensure platform reliability, performance, and availability, including incident support and root cause analysis.
- Enable fast but compliant onboarding of new AI use cases and teams.
- Drive automation first: CI/CD, environment provisioning, monitoring, and recovery.
- Contribute to platform roadmap discussions with a strong infrastructure and operational perspective. Act as a multiplier: mentoring engineers and spreading infrastructure and DevOps maturity.
QUALIFICATIONS/EXPERIENCE
- Degree in Computer Science, Information Technology, Engineering, or comparable practical experience.
- 5+ years of professional experience in infrastructure, platform engineering, or DevOps roles.
- Proven track record operating mission-critical platforms in an enterprise environment.
- Hands-on experience with cloud-native architectures and Kubernetes in production.
- Practical exposure to AI/ML platform support is required; conversational AI experience is a strong plus.
- Experience working in cross-functional, international teams.
- Strong problem-solving mindset with the ability to balance speed, stability, and compliance.
Submit your CV to: ***email_hidden*** and Subject line