Job Application for Data Engineer at RA Capital Management, LLC

Imagine if you had the skills, knowledge, and teammates to both understand the root of the world’s most pressing problems and build the technologies and companies best positioned to solve them. RA Capital has done exactly that for more than two decades, backing bold ideas in medicines to further human health and now expanding into Planetary Health to improve how efficiently we utilize the world’s precious resources.

RA Capital is among the leading providers of capital and services to the most promising innovators in the world. We invest flexibly—seed to IPO and beyond, anywhere in the world—with $10 B+ under management and a culture that prizes curiosity, rigor, and collaborative debate. We are investors who not only fund companies but get elbow deep in building them. From helping them recruit talent to helping them recruit patients for their studies to helping match them to strategic partners and even going to Washington to win reforms, RA Capital’s large team has people with nearly every relevant expertise one might need to turn an idea into a cure that actually helps people.

If you live for first-principles problem-solving with great colleagues, thrive on complexity, and want to do meaningful work that ripples across industries and ecosystems, you’ll feel at home at RA Capital. Here, questions are welcomed, ideas are tested, and victories are shared. Even our lawyers are creative and engaging. And don’t get us started on our compliance team’s wicked sense of humor; nothing about what we do is boring.

Are you ready to bring your creativity, discipline and collaborative spirit to help us invent the future? Join us and you’ll collaborate daily with investors, founders, physicians, biologists, engineers, economists, and reform advocates who think in systems and act with urgency.

Join us to invent a happier, healthier, more productive future - and have fun doing it.

About the Team

RA Capital’s Data Engineering team is responsible for ensuring high-quality, reliable, and accessible data throughout the organization. We emphasize data integrity, compliance, and usability to support strategic decision-making across RA Capital. Our team oversees the complete data lifecycle—partnering with internal stakeholders and external vendors—to build scalable data infrastructure that fuels a data-driven culture.

About the Role

We are seeking a skilled Data Engineer with data experience and a strong interest in AI/LLM-powered data access to join our Data Engineering team. This role is pivotal in designing and maintaining robust data pipelines and extending that data accessibility through AI-driven solutions.

The ideal candidate will possess deep technical knowledge in data engineering and a working understanding of large language model (LLM) systems and the Model Context Protocol (MCP). You’ll help bridge structured enterprise data with AI interfaces that power self-service and natural language query workflows.

Responsibilities

Design, build, and optimize end-to-end enterprise data pipelines for ingesting and integrating vendor data
Develop and maintain robust ETL processes and data integrations between data warehouses (e.g., Databricks) and downstream applications.
Write production-level Python and SQL code to standardize, reconcile, and match healthcare data, applying NLP and ML techniques when needed.
Develop scalable data models in Databricks to support efficient reporting and analytics across clinical, financial, and operational datasets.
Implement rigorous data quality controls and validation checks to ensure data accuracy and compliance
Collaborate with external data vendors to define delivery specifications and transformation logic.
Partner with internal IT, analytics, and business stakeholders to align data efforts with organizational objectives.
Work closely with AI/ML engineers and product teams to support LLM-based data access layers above Hasura or similar GraphQL engines.
Contribute to the integration and evaluation of Model Context Protocol (MCP) in real-world applications, enabling scalable, secure, and interpretable LLM usage.
Document data architectures, pipelines, workflows, and processes for both technical and non-technical audiences.
Provide Tier 1 support for monitoring data flows and resolving pipeline or integration issues.
Ensure ongoing compliance with data governance and security standards.

Key Skills & Experience

1–2+ years experience in a data engineering role
Expertise in building scalable ETL/ELT pipelines and data integration workflows.
Strong skills in Python, SQL, and Spark. Experience with Java is a plus.
Hands-on experience with Databricks; familiarity with AWS (S3, EC2, EBS) preferred.
Strong understanding of data validation, quality assurance, and compliance practices
Exposure to LLM applications and AI-driven data interfaces, particularly in structured enterprise data environments.
Familiarity with Model Context Protocol (MCP) and how it supports contextual integrity, auditability, and chain-of-thought in AI/LLM-based data access.
Proven ability to manage external data vendors and collaborate on schema, format, and delivery improvements.
Ability to clearly convey technical details to non-technical stakeholders and align data projects with business needs.

Key Requirements

Master’s degree or higher from a top Computer Science or Data Science program.
1–2+ years of experience in data engineering, software development, and managing production-grade pipelines
Must be based in Boston area
Ability to work a hybrid schedule in our Boston office
Must be authorized to work in the United States.

RA Capital is an equal opportunity employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. If you require an accommodation during the interview process, please reach out to careers@racap.com for assistance.

First Name

Last Name

Preferred First Name

Phone

Resume/CV*

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf

School

Select...

Degree

Select...

Discipline

Select...

LinkedIn Profile

Website

Do you have a Bachelors and/or Masters degree in Computer Science, Data Science or a related field?

Select...

Are you currently based in Massachusetts? This is a local on-site/hybrid position in our downtown Boston office. Relocation assistance is not provided.

Select...

Can you describe your hands-on experience with AI/ML, big data, data pipelines, and AWS?

Are you authorized to work in the United States without sponsorship?

Select...

Voluntary Self-Identification

For government reporting purposes, we ask candidates to respond to the below self-identification survey. Completion of the form is entirely voluntary. Whatever your decision, it will not be considered in the hiring process or thereafter. Any information that you do provide will be recorded and maintained in a confidential file.

As set forth in RA Capital Management, LLC’s Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law.

Gender

Select...

Are you Hispanic/Latino?

Select...

Race & Ethnicity Definitions

If you believe you belong to any of the categories of protected veterans listed below, please indicate by making the appropriate selection. As a government contractor subject to the Vietnam Era Veterans Readjustment Assistance Act (VEVRAA), we request this information in order to measure the effectiveness of the outreach and positive recruitment efforts we undertake pursuant to VEVRAA. Classification of protected categories is as follows:

A "disabled veteran" is one of the following: a veteran of the U.S. military, ground, naval or air service who is entitled to compensation (or who but for the receipt of military retired pay would be entitled to compensation) under laws administered by the Secretary of Veterans Affairs; or a person who was discharged or released from active duty because of a service-connected disability.

A "recently separated veteran" means any veteran during the three-year period beginning on the date of such veteran's discharge or release from active duty in the U.S. military, ground, naval, or air service.

An "active duty wartime or campaign badge veteran" means a veteran who served on active duty in the U.S. military, ground, naval or air service during a war, or in a campaign or expedition for which a campaign badge has been authorized under the laws administered by the Department of Defense.

An "Armed forces service medal veteran" means a veteran who, while serving on active duty in the U.S. military, ground, naval or air service, participated in a United States military operation for which an Armed Forces service medal was awarded pursuant to Executive Order 12985.

Veteran Status

Select...

Voluntary Self-Identification of Disability

Form CC-305

Page 1 of 1

OMB Control Number 1250-0005

Expires 04/30/2026

Why are you being asked to complete this form?

We are a federal contractor or subcontractor. The law requires us to provide equal employment opportunity to qualified people with disabilities. We have a goal of having at least 7% of our workers as people with disabilities. The law says we must measure our progress towards this goal. To do this, we must ask applicants and employees if they have a disability or have ever had one. People can become disabled, so we need to ask this question at least every five years.

Completing this form is voluntary, and we hope that you will choose to do so. Your answer is confidential. No one who makes hiring decisions will see it. Your decision to complete the form and your answer will not harm you in any way. If you want to learn more about the law or this form, visit the U.S. Department of Labor’s Office of Federal Contract Compliance Programs (OFCCP) website at www.dol.gov/ofccp.

How do you know if you have a disability?

A disability is a condition that substantially limits one or more of your “major life activities.” If you have or have ever had such a condition, you are a person with a disability. Disabilities include, but are not limited to:

Alcohol or other substance use disorder (not currently using drugs illegally)
Autoimmune disorder, for example, lupus, fibromyalgia, rheumatoid arthritis, HIV/AIDS
Blind or low vision
Cancer (past or present)
Cardiovascular or heart disease
Celiac disease
Cerebral palsy
Deaf or serious difficulty hearing
Diabetes
Disfigurement, for example, disfigurement caused by burns, wounds, accidents, or congenital disorders
Epilepsy or other seizure disorder
Gastrointestinal disorders, for example, Crohn's Disease, irritable bowel syndrome
Intellectual or developmental disability
Mental health conditions, for example, depression, bipolar disorder, anxiety disorder, schizophrenia, PTSD
Missing limbs or partially missing limbs
Mobility impairment, benefiting from the use of a wheelchair, scooter, walker, leg brace(s) and/or other supports
Nervous system condition, for example, migraine headaches, Parkinson’s disease, multiple sclerosis (MS)
Neurodivergence, for example, attention-deficit/hyperactivity disorder (ADHD), autism spectrum disorder, dyslexia, dyspraxia, other learning disabilities
Partial or complete paralysis (any cause)
Pulmonary or respiratory conditions, for example, tuberculosis, asthma, emphysema
Short stature (dwarfism)
Traumatic brain injury

Disability Status

Select...

PUBLIC BURDEN STATEMENT: According to the Paperwork Reduction Act of 1995 no persons are required to respond to a collection of information unless such collection displays a valid OMB control number. This survey should take about 5 minutes to complete.

Data Engineer

About the Team

About the Role

Responsibilities

Key Skills & Experience

Key Requirements

Apply for this job

Voluntary Self-Identification

Voluntary Self-Identification of Disability