Our client is seeking an AI Operations Platform Consultant for a 6 month contract. The consultant will work a hybrid schedule and be responsible for deploying, managing, operating, and troubleshooting containerized services on Kubernetes.
Requirements
- Ability to pass an in-depth background check
- Experience deploying, managing, operating, and troubleshooting containerized services at scale on Kubernetes for mission-critical applications (OpenShift)
- Experience with deploying, configuring, and tuning LLMs using TensorRT-LLM and Triton Inference server
- Experience deploying and troubleshooting LLM models on a containerized platform, monitoring, load balancing, etc.
- Experience with standard processes for operation of a mission critical system – incident management, change management, event management, etc.
Benefits
- Excellent benefits
- Compensation packages