Join our dynamic team at as a Large Language Model Developer/Machine Learning Engineer. If you are passionate about pushing the boundaries of natural language processing (NLP) and developing state-of-the-art language models, this opportunity is for you. At, we’re a team of innovators dedicated to solving real-world problems in healthcare using generative AI and applying cutting-edge technologies. As a Lead Software Engineer, you’ll play a pivotal role in driving technological advancements within our company.

  • Develop and fine-tune advanced language models using HuggingFace Transformers, PEFT (Parameter-Efficient Fine-Tuning), and TRL (Transformer Reinforcement Learning) 
  • Implement efficient model training and deployment strategies using PyTorch, HuggingFace Accelerate, quantization libraries, and NVIDIA Triton Inference Server 
  • Collaborate with cross-functional teams to integrate language models into healthcare applications and optimize their performance 
  • Stay up-to-date with the latest advancements in NLP, experiment with emerging techniques, and contribute to the development of novel algorithms and architectures 

Basic Qualifications:

  • Master’s or Ph.D. in Computer Science, Machine Learning, or a related field 
  • At least 3 years of experience in developing and deploying deep learning models, particularly in NLP 
  • Proficiency in Python and deep learning frameworks such as PyTorch or TensorFlow 


Preferred Qualifications:

  • Experience in developing and fine-tuning large language models using HuggingFace Transformers or similar libraries 
  • Hands-on experience with PEFT techniques for efficient model fine-tuning and adaptation 
  • Knowledge of text retrieval and ranking techniques, such as TRL or dense passage retrieval 
  • Experience with distributed training and optimization techniques using frameworks like HuggingFace Accelerate 
  • Familiarity with deploying models using NVIDIA Triton Inference Server or similar model serving platforms 
  • Prompt engineering techniques for synthetic data generation and experience curating training/evaluation datasets w/ human feedback for further fine-tuning & model alignment 


