Vyro is a rapidly growing Gen-AI and SaaS-focused company that empowers creativity across industries with state-of-the-art tools. The company is looking for an LLM Engineer to design, develop, and fine-tune LLMs, build agentic AI workloads, and deploy scalable solutions.
Requirements
- 4+ years of industry experience in Machine Learning or NLP
- Bachelor’s degree in Computer Science (BSCS) or a related field
- Frontier Model Orchestration
- Deep experience leveraging closed-source SOTA models from OpenAI, Anthropic, Google, and xAI
- Strong understanding of complex reasoning, tool-use, and multi-step AI pipelines
- Advanced Architectures
- Expert grasp of transformer variants and Mixture-of-Experts (MoE) architectures
- Proven hands-on experience with open-weight SOTA models such as Llama 3.x, Mistral Large, Qwen 2.5, Phi-4, etc.
- Agentic Frameworks
- Mastery of multi-agent orchestration using frameworks like LangGraph (stateful agents), AutoGen, or CrewAI
- Experience implementing DSPy for declarative, self-optimizing prompt pipelines
- Production RAG & Memory Systems
- Implementation experience with GraphRAG and hybrid retrieval strategies
- Expertise with vector stores (Qdrant, Milvus, Weaviate) and semantic caching for long-term agent memory
- Inference Optimization
- Experience deploying high-throughput models using vLLM, TensorRT-LLM, or SGLang
- Familiarity with FlashAttention-2, KV caching, and quantization techniques (AWQ, EXL2)
Benefits
- Competitive salary
- Benefits package