As an AI Kernel Optimization Engineer, you will play a key role in pushing the limits of AI inference performance on Openchip RISC-V platforms.
Requirements
- MSc or PhD in Computer Engineering or Computer Science, or equivalent practical experience
- 3+ years of experience in performance optimization for AI Inference or HPC use cases
- Strong background in low-level performance optimization (vectorization, memory access optimization, loop unrolling, instruction scheduling, data-tiling, etc.)
- Proficiency in C/C++ and good understanding of assembly-level optimizations (SIMD, intrinsics, compiler flags)
- Solid understanding of CPU/GPU/AI accelerator architecture (pipelines, caches, memory hierarchies, compute units)
- Experience with RISC-V architectures or other custom ISAs
- Experience with profiling and performance analysis tools (perf, VTune, nvprof, etc.)
- Strong knowledge of parallel programming (SIMD, multithreading, OpenMP, CUDA, or similar)
- Solid software engineering skills (version control, CI/CD, testing)
Benefits
- Meal vouchers
- Premium health coverage
- Sustainable mobility incentives
- Generous paternity leave