Back to jobs

ML PhD Intern - LLMs & Generative AI

ML PhD Intern - LLMs & Generative AI

Truveta is the world’s first health provider led data platform with a vision of Saving Lives with Data. Our mission is to enable researchers to find cures faster, empower every clinician to be an expert, and help families make the most informed decisions about their care. Achieving Truveta’ s ambitious vision requires an incredible team of talented and inspired people with a special combination of health, software and big data experience who share our company values.

Our headquarters are in the greater Seattle area, but we celebrate and embrace a remote culture.  Participation in the internship program requires that you are physically present in the United States for the duration of the internship.

Who We Need 

We are seeking a highly motivated and talented Machine Learning PhD Intern to join our AI research team and contribute to our innovative projects in the field of Large Language Modeling (LLM) and clinical data analysis. Beyond core capabilities, we are seeking problem solvers, passionate and collaborative teammates, and those willing to roll up their sleeves while making a difference. If you are interested in the opportunity to pursue purposeful work, join a mission-driven team, and build a rewarding career while having fun, Truveta may be the perfect fit for you.

Internship Details

Our Research PhD Internship is designed for candidates who have finished their classes and are working only on research work to complete their PhD. Candidates must be within 1 year of their graduation date from their PhD program. Candidates are expected to demonstrate both independence in defining their research strategy within a domain as well as an ability to apply innovative solutions to our products. Our internships are designed to be a minimum of 10 weeks with the opportunity to extend beyond the initially agreed term based on the company’s needs and the candidate’s desires.

This Opportunity

We are looking for machine learning experts who can utilize applied science and software development skills in building our Foundation Models that help us address some of the hardest problems towards our vision of Saving Lives with Data. You will work in an exciting and fast-paced environment, collaborating closely with multiple teams across the company. You will work as part of an organization that brings together talent from diverse backgrounds including software engineering, big data, machine learning and AI, clinical informatics, and medicine making our team an exciting place to work. We value and encourage diversity in the belief that our differences make us and our products better.

In this role, you will:

  • Collaborate with researchers and engineers to design, develop, and refine large language models and generative models for various applications.
  • Utilize your expertise in machine learning and natural language processing to develop novel algorithms and methodologies for generative modeling tasks.
  • Implement, train, and fine-tune LLM and GPT-like models on large-scale datasets to ensure optimal performance and accuracy.
  • Stay up to date with the latest research advancements and techniques in the field of language modeling, generative modeling, and machine learning.
  • Deliver the next generation of innovation in trustworthy healthcare.

Key Qualifications

  • Currently pursuing a Ph.D. in Computer Science, Electrical Engineering, or a related field, with a focus on machine learning, natural language processing (NLP), Large Language Models (LLMs), multi-modal foundation models, and generative AI
  • Strong theoretical and practical background in NLP including experience with state-of-the-art architectures
  • Proficiency in deep learning frameworks (e.g., PyTorch, TensorFlow, etc.) and libraries commonly used in NLP and Generative AI
  • Solid programming skills in Python and the ability to write clean, efficient, and well-documented code
  • Excellent problem-solving and troubleshooting abilities, along with a strong analytical mindset and persistence in resolving problems
  • Strong communication skills and the ability to work effectively in a collaborative research environment

Preferred Qualifications

  • Experience with distributed parallel training, large-scale multi-modal foundation and generative models
  • Familiarity with parameter-efficient tuning techniques, Reinforcement Learning from Human Feedback (RLHF), and prompt engineering techniques
  • Familiarity with training multi-modal foundation models
  • Familiarity with cloud-based infrastructure and experience deploying large-scale machine learning models in production environments
  • A track record of publications and contributions to the machine learning and natural language processing communities

This internship opportunity offers a unique chance to work on state-of-the-art language models and contribute to transformative research with the vision of Saving Lives with Data. You will be part of a dynamic team of researchers and engineers who are passionate about pushing the boundaries of machine learning and natural language understanding in the healthcare domain. Join us and make a significant impact on the future of healthcare and patient well-being.

Why Truveta? 

Be a part of building something special. Now is the perfect time to join Truveta. We have strong, established leadership with decades of success. We are well-funded. We are building a culture that prioritizes people and their passions across personal, professional and everything in between. Join us as we build an amazing company together. 

We offer: 

  • Competitive compensation
  • Company-issued laptop and equipment
  • Opportunities for future full-time positions
  • The hourly pay for this position is $45

Truveta is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.

 

Apply for this job

*

indicates a required field

Resume/CV*

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf


Voluntary Self-Identification

For government reporting purposes, we ask candidates to respond to the below self-identification survey. Completion of the form is entirely voluntary. Whatever your decision, it will not be considered in the hiring process or thereafter. Any information that you do provide will be recorded and maintained in a confidential file.

As set forth in Truveta’s Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law.

Select...
Select...
Race & Ethnicity Definitions

If you believe you belong to any of the categories of protected veterans listed below, please indicate by making the appropriate selection. As a government contractor subject to the Vietnam Era Veterans Readjustment Assistance Act (VEVRAA), we request this information in order to measure the effectiveness of the outreach and positive recruitment efforts we undertake pursuant to VEVRAA. Classification of protected categories is as follows:

A "disabled veteran" is one of the following: a veteran of the U.S. military, ground, naval or air service who is entitled to compensation (or who but for the receipt of military retired pay would be entitled to compensation) under laws administered by the Secretary of Veterans Affairs; or a person who was discharged or released from active duty because of a service-connected disability.

A "recently separated veteran" means any veteran during the three-year period beginning on the date of such veteran's discharge or release from active duty in the U.S. military, ground, naval, or air service.

An "active duty wartime or campaign badge veteran" means a veteran who served on active duty in the U.S. military, ground, naval or air service during a war, or in a campaign or expedition for which a campaign badge has been authorized under the laws administered by the Department of Defense.

An "Armed forces service medal veteran" means a veteran who, while serving on active duty in the U.S. military, ground, naval or air service, participated in a United States military operation for which an Armed Forces service medal was awarded pursuant to Executive Order 12985.

Select...

Voluntary Self-Identification of Disability

Form CC-305
Page 1 of 1
OMB Control Number 1250-0005
Expires 04/30/2026

Why are you being asked to complete this form?

We are a federal contractor or subcontractor. The law requires us to provide equal employment opportunity to qualified people with disabilities. We have a goal of having at least 7% of our workers as people with disabilities. The law says we must measure our progress towards this goal. To do this, we must ask applicants and employees if they have a disability or have ever had one. People can become disabled, so we need to ask this question at least every five years.

Completing this form is voluntary, and we hope that you will choose to do so. Your answer is confidential. No one who makes hiring decisions will see it. Your decision to complete the form and your answer will not harm you in any way. If you want to learn more about the law or this form, visit the U.S. Department of Labor’s Office of Federal Contract Compliance Programs (OFCCP) website at www.dol.gov/ofccp.

How do you know if you have a disability?

A disability is a condition that substantially limits one or more of your “major life activities.” If you have or have ever had such a condition, you are a person with a disability. Disabilities include, but are not limited to:

  • Alcohol or other substance use disorder (not currently using drugs illegally)
  • Autoimmune disorder, for example, lupus, fibromyalgia, rheumatoid arthritis, HIV/AIDS
  • Blind or low vision
  • Cancer (past or present)
  • Cardiovascular or heart disease
  • Celiac disease
  • Cerebral palsy
  • Deaf or serious difficulty hearing
  • Diabetes
  • Disfigurement, for example, disfigurement caused by burns, wounds, accidents, or congenital disorders
  • Epilepsy or other seizure disorder
  • Gastrointestinal disorders, for example, Crohn's Disease, irritable bowel syndrome
  • Intellectual or developmental disability
  • Mental health conditions, for example, depression, bipolar disorder, anxiety disorder, schizophrenia, PTSD
  • Missing limbs or partially missing limbs
  • Mobility impairment, benefiting from the use of a wheelchair, scooter, walker, leg brace(s) and/or other supports
  • Nervous system condition, for example, migraine headaches, Parkinson’s disease, multiple sclerosis (MS)
  • Neurodivergence, for example, attention-deficit/hyperactivity disorder (ADHD), autism spectrum disorder, dyslexia, dyspraxia, other learning disabilities
  • Partial or complete paralysis (any cause)
  • Pulmonary or respiratory conditions, for example, tuberculosis, asthma, emphysema
  • Short stature (dwarfism)
  • Traumatic brain injury
Select...

PUBLIC BURDEN STATEMENT: According to the Paperwork Reduction Act of 1995 no persons are required to respond to a collection of information unless such collection displays a valid OMB control number. This survey should take about 5 minutes to complete.