Vyro is a rapidly growing Gen-AI and SaaS-focused company that empowers creativity across industries with state-of-the-art tools. The company is looking for an LLM Engineer to design, develop, and fine-tune LLMs, build agentic AI workloads, and deploy scalable solutions.

Requirements

4+ years of industry experience in Machine Learning or NLP
Bachelor’s degree in Computer Science (BSCS) or a related field
Frontier Model Orchestration
Deep experience leveraging closed-source SOTA models from OpenAI, Anthropic, Google, and xAI
Strong understanding of complex reasoning, tool-use, and multi-step AI pipelines
Advanced Architectures
Expert grasp of transformer variants and Mixture-of-Experts (MoE) architectures
Proven hands-on experience with open-weight SOTA models such as Llama 3.x, Mistral Large, Qwen 2.5, Phi-4, etc.
Agentic Frameworks
Mastery of multi-agent orchestration using frameworks like LangGraph (stateful agents), AutoGen, or CrewAI
Experience implementing DSPy for declarative, self-optimizing prompt pipelines
Production RAG & Memory Systems
Implementation experience with GraphRAG and hybrid retrieval strategies
Expertise with vector stores (Qdrant, Milvus, Weaviate) and semantic caching for long-term agent memory
Inference Optimization
Experience deploying high-throughput models using vLLM, TensorRT-LLM, or SGLang
Familiarity with FlashAttention-2, KV caching, and quantization techniques (AWQ, EXL2)

Benefits

Competitive salary
Benefits package

Requirements

4+ years of industry experience in Machine Learning or NLP
Bachelor’s degree in Computer Science (BSCS) or a related field
Frontier Model Orchestration
Deep experience leveraging closed-source SOTA models from OpenAI, Anthropic, Google, and xAI
Strong understanding of complex reasoning, tool-use, and multi-step AI pipelines
Advanced Architectures
Expert grasp of transformer variants and Mixture-of-Experts (MoE) architectures
Proven hands-on experience with open-weight SOTA models such as Llama 3.x, Mistral Large, Qwen 2.5, Phi-4, etc.
Agentic Frameworks
Mastery of multi-agent orchestration using frameworks like LangGraph (stateful agents), AutoGen, or CrewAI
Experience implementing DSPy for declarative, self-optimizing prompt pipelines
Production RAG & Memory Systems
Implementation experience with GraphRAG and hybrid retrieval strategies
Expertise with vector stores (Qdrant, Milvus, Weaviate) and semantic caching for long-term agent memory
Inference Optimization
Experience deploying high-throughput models using vLLM, TensorRT-LLM, or SGLang
Familiarity with FlashAttention-2, KV caching, and quantization techniques (AWQ, EXL2)

Benefits

Competitive salary
Benefits package

LLM Engineer

About the Company

Job Description

Requirements

Benefits

Similar Jobs

LLM Engineer

AI Prompt Engineer

AI Prompt Intern - (Engineering, Design, Business & Arts)

LLM Engineer

About the Company

Job Description

Requirements

Benefits

Similar Jobs

LLM Engineer

AI Prompt Engineer

AI Prompt Intern - (Engineering, Design, Business & Arts)

Job Details

About Vyro