Data Scientist
Snapdocs is a rapidly growing company that is disrupting the residential mortgage market, bringing scalable and sophisticated software to a pillar of the US economy that still relies on fax machines and manila envelopes. Today, 20% of real estate transactions are processed through our platform. Our products rely on carefully designed workflows, AI-based automations, and empathetic user experiences to deliver best-in-class customer experiences. We are backed by investors like Sequoia, Y Combinator, and F-Prime.
We are an innovative team. As we expand our product offering to serve more customers in more ways, we need to grow our team with smart, hungry, and curious people. That’s where you come in…
About the Role
Snapdocs is looking for a Data Scientist to help us improve and scale our Document Quality Control (QC) services. You’ll partner with our lead data scientists to develop and optimize solutions for document classification, information extraction, and annotation detection—across hundreds of document types.
This is a high-impact role where you’ll apply the latest in NLP and Generative AI to real-world use cases, helping us build reliable, intelligent systems that make sense of messy, unstructured document data.
What You’ll Do
- Improve the performance and generalizability of Document QC services, including classification, extraction, and annotation detection
- Apply and test cutting-edge generative AI methods to improve outcomes across varied document types
- Optimize model performance for new customer document sets
- Partner with DS Operations to ensure high-quality training and evaluation data
- Contribute to service repositories and help productionize models in collaboration with Engineering
Initial Priorities
Your first project will be either:
- Testing, implementing, and optimizing a general classification methodology on a new customer dataset
or - Deploying and refining an information extraction pipeline for a new customer use case
What We’re Looking For:
- 3+ years in data science, preferably working on ML-driven products
- Experience with a wide range of machine learning approaches, including LLMs and traditional supervised/unsupervised learning
- Background in document classification or information extraction (text and images)
- Experience collaborating with Product and Engineering to ship models into production
Technical Skills:
- Expert in Python and ML libraries (e.g., Scikit-learn, HuggingFace)
- Comfortable with SQL and working in production data pipelines
- Hands-on experience with cloud platforms (AWS, GCP, or Azure)
- Understanding of service architecture and working with APIs
- Familiar with CI/CD practices and software engineering fundamentals
Nice to Have:
- Experience with semantic similarity techniques (e.g., BERT, RAG) for classification
- Background in prompt engineering and LLM observability
- Experience monitoring model drift and performance post-deployment
Who You Are
- A collaborative, thoughtful problem solver
- Curious and passionate about ML, NLP, and Generative AI
- Comfortable working on ambiguous problems with imperfect data
- Driven to turn research into scalable, production-ready systems
Apply Now
If you're excited to build next-gen AI systems that operate at real-world scale, we’d love to hear from you.
At Snapdocs, we believe our differences make us stronger. We’re building a team of curious, driven people from all backgrounds who are united by a shared desire to solve meaningful problems and build something that matters. We value trust, autonomy, and the kind of collaboration that brings out the best ideas—and the best in each other.
To support our team, we offer a comprehensive & thoughtful benefits package for all full-time employees, which includes:
- Excellent medical, dental, and vision coverage
- 401(k) with up to 4% company match
- 16 weeks of paid parental leave
- Flexible Paid Vacation Time Off + 10 Sick Days for exempt roles
- Generous Accrued Paid Vacation Time Off + 10 sick days for non-exempt roles
- Summer & Winter Break (~1-week each) + 9 Holidays per year
- Healthcare and Dependent Care FSA
- HSA Employer Contribution ($75-150 for individuals, $150-$250 for families)
- $15K Family Building Benefit (lifetime limit)
- Life and Disability Insurance
- $1,500 Annual Lifestyle Stipend to support your well-being
Please note: Part-time employees are not eligible for benefits at this time
Snapdocs is proud to be an equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, or Veteran status. If you have a disability or special need that requires accommodation, please let us know.
California residents applying for positions at Snapdocs are subject to our candidate privacy policy. (www.snapdocs.com/california-candidate-privacy)
Apply for this job
*
indicates a required field