
Data Engineer
About 10a Labs: 10a Labs is an applied research and AI security company trusted by AI unicorns, Fortune 10 companies, and U.S. tech leaders. We combine proprietary technology, deep expertise, and multilingual threat intelligence to detect abuse at scale. We also deliver state-of-the-art red teaming across high-impact security and safety challenges.
In this role, you will:
- Design, implement, and optimize end-to-end data pipelines for scraping and processing structured and unstructured data using Google Cloud Platform (or similar) and best practices.
- Automate red teaming, including developing automated workflows for prompt generation, model evaluation, and execution of AI experiments.
- Conduct ad hoc web scraping and data collection to support research and intelligence initiatives.
- Design and automate workflows and research experiments, including for data curation, storage, and organization.
- Brainstorm novel research approaches to both known and emerging problems involving AI, data, and the internet.
- Implement robust error handling, logging, and monitoring.
- Design and maintain database schemas and pipeline infrastructure.
- Prepare data for further analysis, including data cleaning, transformation, anonymization, and masking.
- Contribute to the development of internal and external APIs, following best practices.
- Collaborate with ML engineers, data engineers, and software developers to deliver actionable insights and functional tools, including internal and external dashboards, APIs, and data dumps.
Requirements:
- Degree (or equivalent work experience) in Computer Science, Engineering, Information Science, Data Science or a related field (graduate degree preferred).
- 2+ years of professional experience in data engineering or a closely related field.
- Ability to communicate complex technical ideas clearly to non-technical audiences
- Proficiency in Python, SQL.
- Experience with web scraping/crawling (e.g., Beautiful Soup, Selenium, Scrapy).
- Familiarity with Google Cloud Platform (or similar), including storage and database services (e.g., Cloud Storage, CloudSQL, Cloud Spanner) and workflow orchestration (e.g., Cloud Composer/Airflow, Cloud Run, Pub/Sub).
- Experience building and managing data pipelines, especially for text data.
- Comfort working in fast-moving, high-impact environments, such as startups, AI research labs, or security-focused teams.
- Experience deploying APIs on cloud platforms (GCP, AWS, Azure) with robust testing, CI/CD, and performance monitoring practices.
Compensation & Benefits:
- Salary Range: $105K–$125K, depending on experience and location.
- Bonus: Performance-based annual bonus.
- Professional Development: Support for conferences, continuing education, or leadership training.
- Work Environment: Fully remote, U.S.-based.
- Health Benefits: Comprehensive health, dental, and vision coverage.
- Time Off: Generous PTO and paid holiday schedule.
- Retirement: 401(k) plan.
Work With Us: 10a Labs is committed to building an inclusive, equitable workplace where diverse backgrounds, experiences, and perspectives are valued. We encourage applications from candidates of all identities and walks of life, and we believe our work is strongest when it reflects the world we serve.
Create a Job Alert
Interested in building your career at 10a Labs? Get future opportunities sent straight to your email.
Apply for this job
*
indicates a required field