Back to jobs
New

Data Engineer II

San Francisco

About Knit Health

Knit Health is building a novel clinical foundation model to improve the way healthcare is delivered. We combine expertise in AI with deep clinical knowledge to develop safe, trustworthy systems that improve care, expand access, and reduce waste. Knit is led by a founding team from the University of California Berkeley who have developed a novel AI architecture which learns to reason like physicians. We’re now closing the loop and using our novel foundation model, together with frontier clinical LLMs, to build a next generation clinical intelligence platform for providers. We are venture backed and have partnered with multiple US-based health systems and data providers.

What you’ll do 

As a Data Engineer II, you'll be a foundational member of a small, high-impact team building the data backbone of our clinical AI platform. Your work will directly enable the research, products, and decisions that shape where the company goes next.

  • Design, build, and maintain the data pipelines and infrastructure that power both our product and research applications — from ingestion through analytics-ready delivery
  • Partner closely with our data science and ML teams to integrate, structure, and scale the stack as our needs evolve
  • Help establish and uphold standards for data quality, testing, documentation, and observability across the stack
  • Navigate the complex and often ambiguous landscape of healthcare data, bringing clarity, organization, and thoughtful structure to messy problem spaces
  • Contribute to architectural decisions that will shape how we work with data at scale

Minimum qualifications

We're looking for candidates who meet one of the following:

  • 4-6 years of professional experience specifically in data engineering (building data pipelines, ETL/ELT workflows, data modeling, and warehouse architectures)
  • An advanced degree (MS or PhD) in data science, computer science, computer engineering, or an adjacent technical discipline, paired with demonstrable data engineering project work
  • A combination of internships, research, and substantial project experience that clearly demonstrates equivalent data engineering capability

Regardless of path, you should be able to demonstrate proficiency in SQL and Python and hands-on experience with at least one major cloud platform (Azure, AWS, etc.).

What we're looking for

  • Engineering fundamentals: comfort with version control (Git), code review, testing, and the habits of writing code others can read, maintain, and trust
  • SQL: strong command of joins, window functions, CTEs, and aggregate logic; a basic understanding of query performance and when to worry about it
  • Python: fluency writing clean, modular code for data manipulation, transformation, and scripting; familiarity with common libraries such as pandas and at least one testing framework (pytest or similar)
  • ML data processing: An understanding of basic machine learning and AI concepts as well as an understanding of the typical AI/ML data workflows.
  • Spark / distributed processing: working familiarity with PySpark and an understanding of how distributed compute differs from single-machine workflows
  • Cloud platforms: hands-on experience with at least one major cloud provider; Azure and Databricks preferred, but strong experience with AWS or GCP translates
  • Data engineering concepts: a solid grounding in batch and streaming processing, data modeling, orchestration, data quality, governance, and database fundamentals (both relational and columnar)
  • Communication: the ability to explain technical tradeoffs clearly, in writing and in conversation, to both engineers and non-engineers
  • Healthcare: Prior exposure to healthcare data or the healthcare domain more broadly

Nice-to-haves

  • Familiarity with healthcare interoperability standards such as FHIR and HL7
  • Awareness of healthcare privacy and compliance frameworks (HIPAA, BAAs, and similar)
  • An eye for compute cost structures and the instincts to build with efficiency in mind

Your first year

In your first few months, you'll get deep exposure to our existing data infrastructure, our healthcare data sources, and the research and product workflows your pipelines support. By the end of your first year, we'd expect you to:

  • Own meaningful pieces of our data platform end-to-end, from design through production
  • Lead the integration of a new data source or domain, including its modeling, quality safeguards, and downstream interfaces
  • Have raised the bar somewhere — whether in testing, documentation, cost, reliability, or developer experience
  • Be a trusted collaborator to our data science and ML teams, shaping how they work with data rather than just responding to requests

Team structure

  • You'll report to our Director of Data Engineering
  • You'll work alongside the broader data science team on shared infrastructure, tooling, and data problems
  • You'll partner closely with our core model AI team, i.e. the engineers and researchers who consume your data for model training, in a tight feedback loop where data quality directly shapes model performance
  • You'll have real visibility into how your work lands downstream and the impact it has on foundation model training

Salary Range

Knit Health offers a competitive compensation package that includes base salary, equity, and opportunities for advancement. The starting salary range for the Data Engineer II is approximately $110,000 to $120,000 per year. 

Benefits

Generous benefits for full-time employees include: medical, dental, and vision coverage with 100% of premiums paid for employees and dependents (full coverage for dental, vision, and our Gold medical plan; employees may choose to buy up to Platinum); coverage begins on the first day of employment. Additional benefits include a 401(k) plan and 24 days of PTO annually.

Final Notes

Please note this job description is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities that are required of the employee for this job. Duties, responsibilities, and activities may change at any time with or without notice.

Knit Health is an equal opportunity employer and is committed to a diverse workplace. People from diverse racial, ethnic and cultural backgrounds, women, LGBTQ+ individuals, and persons with disabilities are highly encouraged to apply. 

Create a Job Alert

Interested in building your career at Knit Health? Get future opportunities sent straight to your email.

Apply for this job

*

indicates a required field

Phone
Resume/CV*

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter*

Accepted file types: pdf, doc, docx, txt, rtf


Voluntary Self-Identification

For government reporting purposes, we ask candidates to respond to the below self-identification survey. Completion of the form is entirely voluntary. Whatever your decision, it will not be considered in the hiring process or thereafter. Any information that you do provide will be recorded and maintained in a confidential file.

As set forth in Knit Health’s Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law.

Select...
Select...
Race & Ethnicity Definitions

If you believe you belong to any of the categories of protected veterans listed below, please indicate by making the appropriate selection. As a government contractor subject to the Vietnam Era Veterans Readjustment Assistance Act (VEVRAA), we request this information in order to measure the effectiveness of the outreach and positive recruitment efforts we undertake pursuant to VEVRAA. Classification of protected categories is as follows:

A "disabled veteran" is one of the following: a veteran of the U.S. military, ground, naval or air service who is entitled to compensation (or who but for the receipt of military retired pay would be entitled to compensation) under laws administered by the Secretary of Veterans Affairs; or a person who was discharged or released from active duty because of a service-connected disability.

A "recently separated veteran" means any veteran during the three-year period beginning on the date of such veteran's discharge or release from active duty in the U.S. military, ground, naval, or air service.

An "active duty wartime or campaign badge veteran" means a veteran who served on active duty in the U.S. military, ground, naval or air service during a war, or in a campaign or expedition for which a campaign badge has been authorized under the laws administered by the Department of Defense.

An "Armed forces service medal veteran" means a veteran who, while serving on active duty in the U.S. military, ground, naval or air service, participated in a United States military operation for which an Armed Forces service medal was awarded pursuant to Executive Order 12985.

Select...

Voluntary Self-Identification of Disability

Form CC-305
Page 1 of 1
OMB Control Number 1250-0005
Expires 04/30/2026

Why are you being asked to complete this form?

We are a federal contractor or subcontractor. The law requires us to provide equal employment opportunity to qualified people with disabilities. We have a goal of having at least 7% of our workers as people with disabilities. The law says we must measure our progress towards this goal. To do this, we must ask applicants and employees if they have a disability or have ever had one. People can become disabled, so we need to ask this question at least every five years.

Completing this form is voluntary, and we hope that you will choose to do so. Your answer is confidential. No one who makes hiring decisions will see it. Your decision to complete the form and your answer will not harm you in any way. If you want to learn more about the law or this form, visit the U.S. Department of Labor’s Office of Federal Contract Compliance Programs (OFCCP) website at www.dol.gov/ofccp.

How do you know if you have a disability?

A disability is a condition that substantially limits one or more of your “major life activities.” If you have or have ever had such a condition, you are a person with a disability. Disabilities include, but are not limited to:

  • Alcohol or other substance use disorder (not currently using drugs illegally)
  • Autoimmune disorder, for example, lupus, fibromyalgia, rheumatoid arthritis, HIV/AIDS
  • Blind or low vision
  • Cancer (past or present)
  • Cardiovascular or heart disease
  • Celiac disease
  • Cerebral palsy
  • Deaf or serious difficulty hearing
  • Diabetes
  • Disfigurement, for example, disfigurement caused by burns, wounds, accidents, or congenital disorders
  • Epilepsy or other seizure disorder
  • Gastrointestinal disorders, for example, Crohn's Disease, irritable bowel syndrome
  • Intellectual or developmental disability
  • Mental health conditions, for example, depression, bipolar disorder, anxiety disorder, schizophrenia, PTSD
  • Missing limbs or partially missing limbs
  • Mobility impairment, benefiting from the use of a wheelchair, scooter, walker, leg brace(s) and/or other supports
  • Nervous system condition, for example, migraine headaches, Parkinson’s disease, multiple sclerosis (MS)
  • Neurodivergence, for example, attention-deficit/hyperactivity disorder (ADHD), autism spectrum disorder, dyslexia, dyspraxia, other learning disabilities
  • Partial or complete paralysis (any cause)
  • Pulmonary or respiratory conditions, for example, tuberculosis, asthma, emphysema
  • Short stature (dwarfism)
  • Traumatic brain injury
Select...

PUBLIC BURDEN STATEMENT: According to the Paperwork Reduction Act of 1995 no persons are required to respond to a collection of information unless such collection displays a valid OMB control number. This survey should take about 5 minutes to complete.