Are you passionate about open-source LLMs like LLaMA, DeepSeek, or Qwen? We're on the lookout for an Associate AI Engineer eager to work hands-on with cutting-edge models, conduct exploratory research, and deliver impactful AI-driven products.
What You'll Be Doing
- Working with open-source LLMs (DeepSeek, Qwen, LLaMA, etc.) and deploying them both locally and in the cloud
- Fine-tuning models with domain-specific knowledge
- Implementing model distillation and quantization
- Preparing and managing high-quality training datasets
- Designing scalable and efficient ML architectures
- Engaging in traditional ML model development and training
- Splitting your time 50/50 between exploratory R&D and delivering production-grade solutions
Required Skills
- Strong proficiency in Python with hands-on AI/ML experience
- Experience deploying models on local hardware (NVIDIA GPUs or CPU-only setups)
- Comfortable in Linux environments and using tools like Docker, Git, and other basic DevOps tools
- Ability to read research papers and rapidly prototype experimental ideas
Bonus Skills
- Experience with transformer-based models using Hugging Face Transformers, PyTorch, or TensorFlow
- Understanding of prompt engineering, fine-tuning strategies (LoRA, PEFT, QLoRA), and evaluation techniques
- Familiarity with quantization and model compression (bitsandbytes, GGML, GPTQ)
How to Apply
Please send us:
Links to GitHub repositories, fine-tuning notebooks, or any LLM projects you’ve worked on (Please add links in your CV)
A short paragraph about your favorite recent LLM release and why it excites you
---------------------------------------------------------------------------
Generating Apply Link...