Website PickTwo
**About PickTwo**
We provide platform engineering expertise that empowers AI teams to move fast without breaking things.
**Role Snapshot**
– Build secure, automated cloud infrastructure that supports data and ML workloads
– Implement observability, reliability, and cost-optimisation practices
– Enable a world-class developer experience for hybrid squads
**What You Will Do**
– Design and manage infrastructure as code (Terraform/Pulumi) across AWS/Azure/GCP
– Stand up Kubernetes, serverless, and data platform components with strong guardrails
– Instrument logging, metrics, and alerting (Prometheus, Grafana, OpenTelemetry)
– Partner with security to embed DevSecOps controls and incident response plans
**What You Bring**
– 5+ years in DevOps/SRE/platform roles supporting production workloads
– Deep understanding of CI/CD, networking, secrets management, and identity
– Experience supporting ML/AI pipelines or high-throughput data systems
– Calm approach to incidents and a love for documentation and knowledge sharing
**Why This Role Matters**
Reliable infrastructure keeps mission-critical AI services online. Your work multiplies the impact of every delivery squad.
To apply for this job email your details to support@picktwo.africa.