Data Engineer
About us
Thesis is building the AI-powered care team platform for infinitely scalable clinical capacity. We radically increase access and improve quality of care by combining AI agents with clinical experts to take on high-impact clinical operations and care management activities for healthcare organizations.
We’re based in NYC, are growing rapidly, and are backed by $60 million in funding from Oak HC/FT, CRV, Black Opal Ventures, and experienced C-level healthtech angel investors.
About the role
We are looking for a highly driven Data Engineer to help design, build, and scale the data foundations that power Thesis’s AI agents, including owning critical client integrations. You will work closely with the product, engineering, and AI/ML teams to ensure Thesis’s data architecture is scalable, reliable, secure, and actionable. This is a broad-scope, high-impact role for a technical builder who thrives in ambiguity and wants meaningful ownership over modern data infrastructure in an early-stage healthcare startup.
Responsibilities
- Build core data infrastructure: Design, implement, and maintain scalable, data pipelines and architectures across ingestion, transformation, and storage layers
- Own client integrations and data ingestion: Structure and maintain durable integrations and ETL workflows to reliably onboard and normalize client data.
- Design robust data models: Build and maintain clean, well-documented data models that support reporting, analytics, and downstream automation.
- Enable product and analytics outcomes: Ensure data flows accurately from source systems to products, analytics, and AI/ML use cases.
- Drive data quality and reliability: Implement monitoring, testing, and observability to ensure high data quality and system uptime.
We expect you to have:
- Strong technical foundation: 4-6+ years of experience in Data Engineering or Software Engineering with a data focus; strong proficiency in Python and SQL.
- Modern data stack experience: Hands-on experience with cloud data platforms (AWS and/or GCP), data warehouses (Snowflake, Redshift, or BigQuery), orchestration tools (Airflow or similar), and transformation tools (dbt preferred).
- Healthcare data chops: Familiarity integrating with healthcare data standards and systems (e.g., FHIR/HL7, EHR APIs), including working with messy clinical/claims data.
- Data modeling expertise: Experience designing data models for different sources, workflows, and business processes.
- Clear communication: Ability to explain complex technical concepts to both technical and non-technical stakeholders.
- NYC-based: You are based in New York and excited to be in-office 3+ days per week.
Target compensation for this role is $200-$250k, plus equity and a generous benefits package.
Thesis Care is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status.
Create a Job Alert
Interested in building your career at Thesis? Get future opportunities sent straight to your email.
Apply for this job
*
indicates a required field
.png?1774412043)