At d-Matrix, we are focused on unleashing the potential of generative AI to power the transformation of technology. We are seeking individuals passionate about tackling challenges and are driven by execution.
Requirements
- Engineering degree in Electrical Engineering, Computer Engineering, Computer Science, or related field, with 10+ years of relevant experience in AI/ML hardware, software and infrastructure
- Strong background in deep learning and neural networks, in particular generative AI and Academic experience in computer architecture, hardware software co-design, performance modeling
- Proven experience analyzing and tuning inference performance on GPUs.
- Experience with processor and system-level performance modeling.
- Experience with common deep learning software packages like PyTorch, vLLM, etc. as well as understanding of model compilation and execution stack.
- Experience with OpenAI Triton and/or CUDA.
- Strong programming skills in C++, Python.
- Excellent communication and presentation skills
- Experience in customer engineering and field support for enterprise-level AI and datacenter products, with a focus on AI/ML software and generative AI inference, preferred.
Benefits
- Equal Opportunity Employment Policy