Back to jobs
New

Senior Data Engineer – Healthcare Data & AI Systems

Remote

Job Title: Senior Data Engineer – Healthcare Data & AI Systems

Location: Remote (Colorado-based)

Role Type:  Full time

Reports to: Chief Technology Officer

Company: SideBy Care

About SideBy Care

SideBy Care is the first AI-powered virtual care service for GI practices and their patients with Disorders of Gut-Brain Interaction (DGBI). Disorders of Gut-Brain Interaction (like IBS) affect over 100 M people in the US alone. 

Our gut and brain are closely connected through a network of nerves, hormones, and chemicals. They constantly communicate with each other, which is why stress or anxiety can cause symptoms like cramping or diarrhea. Similarly, gut problems can affect mood and stress levels, creating a cycle that impacts both your digestive and mental health.

SideBy Care provides gut-brain therapy with diet & lifestyle support in a virtual care service covered by insurance. 92% of patients report symptom improvement by 6 weeks!

We are seeking a Senior Data Engineer to build and manage the data backbone of our platform, spanning AI Agents, AI-driven insights, reporting and robust warehouse infrastructure in Snowflake.

Role Overview

As a Senior Data Engineer, you will lead the architecture and implementation of complex data systems that power our analytics, clinical decision-making, and care optimization efforts. You’ll design pipelines to move, transform, and validate data from EMRs and other healthcare systems into a warehouse environment optimized for analysis, reporting, and machine learning.

You will also work at the frontier of healthcare AI—enabling predictive models and reasoning systems using tools like TensorFlow, LLMs (e.g., Claude, LLaMA, GPT), and advanced analytics frameworks.

This is a highly technical and strategic role with visibility into product, clinical, and executive decision-making.

Responsibilities

  • Architect and implement robust data pipelines between EMRs, internal systems, and Snowflake, ensuring scalability, reliability, and data provenance
  • Lead the design of warehouse schemas for multiple use cases: transactional processing, reporting (BI), and statistical/ML analysis
  • Define and enforce standards for data semantics, integrity, quality, lineage, and access control
  • Collaborate with data scientists and ML engineers to enable production-grade ML workflows (e.g., TensorFlow pipelines, model monitoring, A/B testing infrastructure)
  • Experiment with and support the deployment of LLMs to enable reasoning, summarization, and classification on structured and unstructured data (e.g., clinical notes)
  • Build monitoring and alerting around pipeline health and data trustworthiness
  • Integrate and normalize complex healthcare data sources (FHIR/HL7, custom APIs, third-party vendors) into a unified analytics model
  • Partner with engineering and product teams to deliver data-driven features, dashboards, and insights

Requirements

  • 5+ years of experience in data engineering or backend systems, with senior or staff-level contributions
  • Deep Python proficiency, with production experience in ETL, data validation, and orchestration frameworks (e.g., Airflow, Dagster, dbt)
  • Strong experience with data warehouse design, including star/snowflake schemas, denormalization strategies, and performance optimization
  • Strong understanding of data privacy and security practices, especially in healthcare (HIPAA, de-identification, audit logging, etc.)
  • Proven experience managing complex integrations with EMRs or clinical systems
  • Familiarity with LLM and ML development tools (e.g., TensorFlow, PyTorch, LangChain, transformers, vector DBs)
  • Experience deploying or supporting predictive models in production environments
  • Expertise in Snowflake or similar cloud data platforms (e.g., BigQuery, Redshift)
  • Strong grasp of data modeling, provenance, and semantics for analytical and AI purposes
  • Experience working with AWS services such as S3, Lambda, Batch, Event Bridge, Cloud Front, EC2, etc

Nice to Have

  • Experience working with graph-based reasoning engines or healthcare ontologies
  • Knowledge of analytics frameworks like Superset or Looker
  • Familiarity with HL7, FHIR, or other clinical interoperability standards
  • Exposure to real-time or streaming data systems (Kafka, Pulsar)

Why Join Us?

  • Help make a new form of AI-driven virtual care available for millions of people with gut-brain conditions (like Irritable Bowel Syndrome and more than 30 other conditions)
  • Be a foundational contributor to a modern healthcare data stack and AI platform
  • Shape how LLMs and ML are responsibly deployed in real-world clinical settings
  • Work with a small, fast-moving, mission-driven team of engineers and clinicians
  • Competitive pay
  • Flexible remote work culture

 

The compensation package is based on the candidate's professional experience and certifications. 

Sideby Care Compensation Range

$80 - $110 USD

Apply for this job

*

indicates a required field

Resume/CV

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf