
Data Engineer
Company Overview
Outer Biosciences’ mission is to improve human skin health by discovering and developing novel bioactives that are more effective and safer than anything available today. Founded by a scientific team with deep experience in engineering advanced organ systems, our fully-integrated technology platform combines complex, long-lasting, clinically-relevant ex vivo skin models with multimodal data analysis and predictive machine learning. Brief company overview here.
Role Overview
Outer Biosciences is seeking a Data Engineer to join our data science team and help drive the development of our data platform in support of our innovative bioactive discovery efforts. This critical position requires proven skills in data infrastructure engineering. We are a small, motivated team looking for an individual deeply committed to and motivated by collaboration and learning. The role provides a unique opportunity to help steer the development and implementation of a visionary strategy that will play a crucial role in transforming skin health through pioneering bioactive discovery. There is a preference for this role to be hybrid.
Key responsibilities
- Design, implement, and support scalable data storage solutions for large volumes of heterogeneous scientific data.
- Develop reusable code and tools to improve efficiency, and scalability of Outer Bio’s data platform
- Contribute to the development and implementation of data quality control and governance policies and procedures in accordance with best practices
- Optimize, deploy, and maintain data pipelines for next-generation sequencing (NGS) analysis, histology analysis, and machine learning.
- Support broad accessibility to data exploration through data visualization through the development and support of interactive dashboards
- Develop, and execute new methods and protocols and proactively incorporate new technology or techniques into practice with minimal supervision.
- Maintain strong collaboration with Outer's executive, biology, and business teams to drive integrated research and development efforts in support of company-wide initiatives
Required Qualifications
- Degree in Engineering, Computer Science, Bioinformatics, Computational Biology, or a related discipline
- Demonstrated experience (3+ years) managing data systems in a production setting as a data engineer in biopharmaceutical or life sciences company
- Understanding of data science workflows in bioinformatics and/or ML-driven research
- Advanced proficiency in Unix/Linux, Python and modern developer tools
- Highly productive, agile, and resilient with exceptional troubleshooting skills.
- Energetic self-starter and independent thinker, with strong attention to detail.
- Excellent written, verbal, and interactive communication skills. Fluency in English.
- Strong teamwork and collaboration skills with the ability to work effectively in a diverse group.
- Strong organizational skills with the ability to prioritize and manage multiple projects simultaneously
Bonus points
- Familiarity with scientific data standards, ontologies, and best practices for metadata capture
- Experience with workflow orchestration software (e.g., NextFlow, MLflow)
- Experience with creating user-facing data visualization
- Experience in advanced statistics, AI/ML, and or predictive modeling
- Proficiency in R
- Knowledge of cloud computing services such as AWS or GCP
- Experience working with NGS data and analysis pipelines
We are deeply committed to diversity, equality, and inclusion in all its forms and practices. As a company we promote inclusion and desire to work with people from all walks of life. As we grow our team, we strive to be the change we want to see. If your experience is somewhat different from what we've described and you believe you can bring value and contribute in the role, we'd be honored to learn more about you
Apply for this job
*
indicates a required field