Back to jobs
New

Data Engineer

Rockville, MD
(ID: 2026-2532)

 


Axle is a bioscience and information technology company that offers advancements in translational research, biomedical informatics, and data science applications to research centers and healthcare organizations nationally and abroad. With experts in biomedical science, software engineering, and program management, we focus on developing and applying research tools and techniques to empower decision-making and accelerate research discoveries. We work with some of the top research organizations and facilities in the country including multiple institutes at the National Institutes of Health (NIH).

 

Benefits We Offer:

  • 100% Medical, Dental & Vision Coverage for Employees
  • Paid Time Off and Paid Holidays
  • 401K match up to 5%
  • Educational Benefits for Career Growth
  • Employee Referral Bonus
  • Flexible Spending Accounts:
    • Healthcare (FSA)
    • Parking Reimbursement Account (PRK)
    • Dependent Care Assistant Program (DCAP)
    • Transportation Reimbursement Account (TRN)

We are seeking a Data Engineer to support biomedical science, clinical research data integration, and advanced data analysis initiatives. In this role, you will design, build, optimize, and maintain data pipelines and data workflows that support the ingestion, transformation, harmonization, validation, and delivery of complex biomedical datasets. You will collaborate closely with scientists, researchers, data scientists, bioinformaticians, application developers, and technical stakeholders to ensure data is accessible, well-structured, secure, documented, and reusable for biomedical research, analytics, reporting, and discovery. 

The ideal candidate will have strong experience with Python, SQL, ETL/ELT development, data modeling, data quality practices, and research data lifecycle support. This role requires the ability to work with complex multi-source datasets, support analytics and application-facing data products, and contribute to scalable, well-governed data solutions that align with the Data Science Client Services branch priorities for data accessibility, interoperability, reproducibility, modernization, and secure research enablement. 

Key Responsibilities 

  • Data Pipeline Development: Design, build, test, and maintain data pipelines to ingest, transform, harmonize, and integrate diverse biomedical and research data sources, including clinical, genomic, experimental, imaging, biospecimen, operational, and other scientific datasets. Develop reusable transformation logic and curated datasets that support analytics, reporting, dashboards, applications, APIs, and downstream research workflows. 

  • Data Integration and Lifecycle Support: Support the full research data lifecycle by enabling reliable data movement from source systems and storage environments into structured, analysis-ready formats. Assist with data ingestion, curation, metadata capture, data refreshes, source-to-target mapping, schema management, and long-term maintainability of data products and workflows. 

  • Collaboration: Work closely with data scientists, bioinformaticians, researchers, application developers, project managers, and government stakeholders to gather requirements and deliver practical data solutions. Translate scientific and operational data needs into technical specifications, data models, transformation logic, and reusable datasets that accelerate biomedical research workflows and support informed decision-making. 

  • Quality & Governance: Implement data validation checks, reconciliation routines, testing practices, and monitoring processes to ensure data accuracy, completeness, consistency, and integrity. Follow data governance and security best practices, including documentation of transformations, lineage, assumptions, access requirements, and compliance considerations related to sensitive, regulated, de-identified, or access-controlled research data. 

  • Dashboarding & Integration: Create or support interactive dashboards, reporting layers, APIs, and application-ready datasets that allow researchers and stakeholders to visualize, explore, and analyze data. Support integration between data pipelines, databases, cloud platforms, analytics environments, and approved application platforms to enable scalable and secure data access. 

  • Operational Support and Modernization: Troubleshoot data pipeline failures, source system inconsistencies, data quality issues, schema changes, access issues, and performance bottlenecks. Contribute to modernization efforts by improving automation, documentation, scalability, reproducibility, and platform readiness across environments. 

Required Qualifications 

  • Education & Background: Bachelor’s degree in Computer Science, Data Science, Bioinformatics, Biomedical Informatics, Information Systems, Engineering, or a related field, or equivalent practical experience. Proven experience as a Data Engineer, Analytics Engineer, Data Integration Developer, Bioinformatics Engineer, or similar data-intensive role, preferably supporting analytics, biomedical research, healthcare, scientific computing, or research data teams. 

  • Data Engineering Expertise: Strong proficiency in Python and SQL for data manipulation, transformation, scripting, automation, and analysis. Hands-on experience building ETL/ELT processes and data pipelines to support large, complex, multi-source datasets. Familiarity with scalable data processing approaches, including Spark/PySpark or similar frameworks, for high-volume or complex transformations is required. 

  • Analytical Skills: Solid understanding of data modeling, relational databases, data warehouses, data lakes, metadata, and database concepts. Ability to work with complex, multi-modal datasets, including structured, semi-structured, and unstructured data, and optimize data workflows for reliability, performance, usability, and long-term maintainability. 

  • Best Practices: Knowledge of software engineering and data engineering best practices, including version control using Git, code review, automated testing, documentation, peer review, and change management. Experience ensuring data quality and using lineage, provenance tracking, audit trails, or documentation practices to support transparency, reproducibility, and data flow traceability. 

  • Collaboration & Communication: Excellent problem-solving skills and the ability to communicate effectively with both technical and non-technical stakeholders. Comfortable working in an interdisciplinary environment with biomedical researchers, analysts, developers, and project teams. Capable of translating domain-specific needs into technical solutions and explaining technical risks, limitations, and dependencies in clear stakeholder-focused language. 

  • Domain Alignment: Strong interest in biomedical science, clinical research, healthcare data, and scientific discovery. Ability to quickly learn domain-specific concepts, data structures, terminology, and research workflows. Demonstrated awareness of sensitive data handling, privacy, access control, data governance, and regulatory or compliance expectations associated with biomedical and clinical research data. 

Preferred Qualifications (Plus Skills) 

  • Platform-as-a-Service and Data Platform Experience: Hands-on experience building data solutions in modern data platforms or platform-as-a-service environments such as Snowflake, Databricks, Palantir, cloud data warehouses, data lakes, or similar platforms. Experience supporting integrations across databases, cloud storage, APIs, analytics platforms, dashboards, and application environments is preferred. 

  • Research and Application Enablement: Experience preparing curated datasets for dashboards, APIs, web applications, reporting tools, notebooks, or scientific computing environments. Familiarity with research-facing tools and platforms such as Posit Connect, R/Shiny, Streamlit, Jupyter, Galaxy, Code Ocean, or similar analytics and application delivery environments is a plus. 

  • Cloud, Storage, and Automation Experience: Experience working with cloud or hybrid data environments, object storage such as S3, relational databases such as Postgres, automated data refreshes, scheduled jobs, API-based integrations, and secure data movement across controlled environments. 

  • Biomedical Domain Knowledge: Previous experience in biomedical research, healthcare analytics, clinical research, public health, pharmaceutical research and development, or scientific data management. Familiarity with biomedical data standards or datasets, such as clinical trial data, clinical imaging, laboratory data, biospecimen data, transcriptomics/genomic data, HL7/FHIR, CDISC, OMOP, or related standards, and an understanding of the scientific research process will help you excel in this role. 

  • Governance and Reproducibility: Experience supporting data governance, metadata management, data lineage, reproducible workflows, documentation standards, and secure handling of de-identified, sensitive, or access-controlled research datasets. 

 

Disclaimer: The above description is meant to illustrate the general nature of work and level of effort being performed by individuals assigned to this position or job description. This is not restricted as a complete list of all skills, responsibilities, duties, and/or assignments required. Individuals may be required to perform duties outside of their position, job description or responsibilities as needed.

The diversity of Axle’s employees is a tremendous asset. We are firmly committed to providing equal opportunity in all aspects of employment and will not tolerate any illegal discrimination or harassment based on age, race, gender, religion, national origin, disability, marital status, covered veteran status, sexual orientation, status with respect to public assistance, and other characteristics protected under state, federal, or local law and to deter those who aid, abet, or induce discrimination or coerce others to discriminate.

Accessibility: If you need an accommodation as part of the employment process please contact: careers@axleinfo.com

This role has a market-competitive salary with an anticipated base compensation range listed below. Actual salaries will vary depending on a candidate’s experience, qualifications, skills, and location.

Create a Job Alert

Interested in building your career at Axle? Get future opportunities sent straight to your email.

Apply for this job

*

indicates a required field

Phone
Resume/CV*

Accepted file types: pdf, doc, docx, txt, rtf


Select...
Select...
Select...
Select...
Select...
Select...
Select...

Voluntary Self-Identification

For government reporting purposes, we ask candidates to respond to the below self-identification survey. Completion of the form is entirely voluntary. Whatever your decision, it will not be considered in the hiring process or thereafter. Any information that you do provide will be recorded and maintained in a confidential file.

As set forth in Axle’s Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law.

Select...
Select...
Race & Ethnicity Definitions

If you believe you belong to any of the categories of protected veterans listed below, please indicate by making the appropriate selection. As a government contractor subject to the Vietnam Era Veterans Readjustment Assistance Act (VEVRAA), we request this information in order to measure the effectiveness of the outreach and positive recruitment efforts we undertake pursuant to VEVRAA. Classification of protected categories is as follows:

A "disabled veteran" is one of the following: a veteran of the U.S. military, ground, naval or air service who is entitled to compensation (or who but for the receipt of military retired pay would be entitled to compensation) under laws administered by the Secretary of Veterans Affairs; or a person who was discharged or released from active duty because of a service-connected disability.

A "recently separated veteran" means any veteran during the three-year period beginning on the date of such veteran's discharge or release from active duty in the U.S. military, ground, naval, or air service.

An "active duty wartime or campaign badge veteran" means a veteran who served on active duty in the U.S. military, ground, naval or air service during a war, or in a campaign or expedition for which a campaign badge has been authorized under the laws administered by the Department of Defense.

An "Armed forces service medal veteran" means a veteran who, while serving on active duty in the U.S. military, ground, naval or air service, participated in a United States military operation for which an Armed Forces service medal was awarded pursuant to Executive Order 12985.

Select...

Voluntary Self-Identification of Disability

Form CC-305
Page 1 of 1
OMB Control Number 1250-0005
Expires 04/30/2026

Why are you being asked to complete this form?

We are a federal contractor or subcontractor. The law requires us to provide equal employment opportunity to qualified people with disabilities. We have a goal of having at least 7% of our workers as people with disabilities. The law says we must measure our progress towards this goal. To do this, we must ask applicants and employees if they have a disability or have ever had one. People can become disabled, so we need to ask this question at least every five years.

Completing this form is voluntary, and we hope that you will choose to do so. Your answer is confidential. No one who makes hiring decisions will see it. Your decision to complete the form and your answer will not harm you in any way. If you want to learn more about the law or this form, visit the U.S. Department of Labor’s Office of Federal Contract Compliance Programs (OFCCP) website at www.dol.gov/ofccp.

How do you know if you have a disability?

A disability is a condition that substantially limits one or more of your “major life activities.” If you have or have ever had such a condition, you are a person with a disability. Disabilities include, but are not limited to:

  • Alcohol or other substance use disorder (not currently using drugs illegally)
  • Autoimmune disorder, for example, lupus, fibromyalgia, rheumatoid arthritis, HIV/AIDS
  • Blind or low vision
  • Cancer (past or present)
  • Cardiovascular or heart disease
  • Celiac disease
  • Cerebral palsy
  • Deaf or serious difficulty hearing
  • Diabetes
  • Disfigurement, for example, disfigurement caused by burns, wounds, accidents, or congenital disorders
  • Epilepsy or other seizure disorder
  • Gastrointestinal disorders, for example, Crohn's Disease, irritable bowel syndrome
  • Intellectual or developmental disability
  • Mental health conditions, for example, depression, bipolar disorder, anxiety disorder, schizophrenia, PTSD
  • Missing limbs or partially missing limbs
  • Mobility impairment, benefiting from the use of a wheelchair, scooter, walker, leg brace(s) and/or other supports
  • Nervous system condition, for example, migraine headaches, Parkinson’s disease, multiple sclerosis (MS)
  • Neurodivergence, for example, attention-deficit/hyperactivity disorder (ADHD), autism spectrum disorder, dyslexia, dyspraxia, other learning disabilities
  • Partial or complete paralysis (any cause)
  • Pulmonary or respiratory conditions, for example, tuberculosis, asthma, emphysema
  • Short stature (dwarfism)
  • Traumatic brain injury
Select...

PUBLIC BURDEN STATEMENT: According to the Paperwork Reduction Act of 1995 no persons are required to respond to a collection of information unless such collection displays a valid OMB control number. This survey should take about 5 minutes to complete.