Back to jobs
New

Staff Data Engineer

Seattle, Washington, United States

ABOUT LVT

LVT is redefining how businesses operate in the physical world, moving beyond traditional security solutions to deliver AI-driven, actionable intelligence that makes sites smarter, safer, and more secure. Since pioneering our first mobile, solar-powered units, our commitment to scrappy, hands-on innovation has made us an established leader and one of the fastest-growing companies in intelligent site technology. We are building the next generation of solutions—from our physical units in the field to a powerful Agentic AI platform—that allows our customers to gain unprecedented visibility and control over safety, compliance, and operations. This is your chance to join a cutting-edge team that isn't just watching the world change, but actively building the technology that is changing it.

We’re a team that’s focused on growth and innovation, and we’re proud that our crew, products, and leadership are being recognized for it.

  • A Top-Tier Growth Company: Named one of the Financial Times’ Fastest Growing Companies 2025 and #10 on the Inc. 5000 Rocky Mountain Regional list for 2025.
  • Innovative Leadership: Our CEO, Ryan Porter, was named an EY Entrepreneur of the Year 2025, and our CTO, Steve Lindsey, was inducted into the Silicon Slopes CTO Hall of Fame in 2024.
  • Product & Software Excellence: We were named one of The Software Report’s Top 100 Software Companies of 2023 and are a winner of the Security Today Govies Award for 2025.

 

ABOUT THIS ROLE

LVT's AI systems are only as good as the data behind them. As we move toward Physical AI, the binding constraint shifts from model architecture to the data flywheel.

We are seeking a Staff Data Engineer to own that flywheel end to end including logs, sensor telemetry, labels and annotations, evaluation and benchmark sets. Every AI team trains and evaluates from a single stack that transforms data from the raw source through standardized, versioned, governed datasets. 

This is a senior individual-contributor and technical-leadership role; formal people management is not required. You will partner closely with AI/ML research, the ML platform / MLOps function. You own the data side of the contract that defines what a model consumes and emits and annotation, edge, and infrastructure teams. You should be equally comfortable discussing dataset schema design, storage and partitioning trade-offs for multimodal data, versioning and migration strategy, and the governance controls that keep sensitive video and sensor data safe.

ROLE RESPONSIBILITIES

  • Data Flywheel Ownership: Own the end-to-end loop that converts raw edge telemetry and video into labeled training data, frozen evaluation sets and feeds model outputs back into the next round.
  • Layered Dataset Pipelines: Build and own the pipelines that register raw source data, standardize it into a single well-defined schema, and join and aggregate it into curated datasets so every team trains, validates, and benchmarks from one consistent store through one reader, rather than copying and reformatting data per use case.
  • Labels & Annotation Data Lifecycle: Own how labels and semantic annotations are appended to datasets without rewriting source data, then versioned, quality-checked, and served, partnering with annotation and data-operations teams on label production and verification while you own the dataset, storage, and serving side.
  • Evaluation & Benchmark Sets: Own the frozen, versioned validation and benchmark datasets that make model comparisons valid over time stable enough that an accuracy delta reflects the model, not a shifting dataset including the review and scrubbing discipline required before any set is shared externally.
  • Dataset Versioning: Own schema and content versioning so producers can evolve datasets without breaking consumers opt-in versions, append-without-rewrite for new fields, and the reader/writer indirection that lets data migrate underneath clients on a controlled rollout instead of forced lockstep migrations.
  • Framework Integration & Self-Serve Access: Own the read/write libraries and integrations researchers depend on PyTorch/Lightning dataloaders, a simple record-level CRUDL API, and Spark/analytics access and self-service so AI teams stay focused on model development.
  • Governance Enforced: Make governance machine-enforced in the flywheel rather than documented after the fact classification of clips, frames, labels, and embeddings; scrubbing and anonymization in load jobs; and lineage and provenance for every dataset version, annotation campaign, and training input.
  • Technical Mentorship: Set the data-engineering standards for the flywheel schema conventions, dataset contracts, quality gates and mentor IC work toward them, growing the function as the team forms.

 

OUR IDEAL CANDIDATE

  • Data Engineering Depth: 8+ years building and operating large-scale data pipelines and data-lake or lakehouse systems in production ingestion, ETL/ELT, partitioning and storage-format decisions, and the reader/writer libraries consumers rely on.
  • ML Data Specialty: Has built data pipelines for model training and evaluation, labeled data, and evaluation/benchmark sets with a working understanding of how data quality and versioning move model results.
  • Lakehouse Architecture: Strong experience with medallion-style layered data architectures and modern table/lake formats (e.g. Iceberg, Delta, Parquet, or comparable), including schema evolution and dataset versioning.
  • Multimodal Data at Scale: Experience with large multimodal data video, image, sensor/telemetry  and the storage and access patterns that make it queryable at scale (denesting, repartitioning, binary-inline vs. reference storage).
  • Framework Integration: Hands-on with the data side of ML frameworks PyTorch/Lightning dataloaders and Spark and strong Python knowledge.
  • Governance & Provenance: Practical experience enforcing data governance in pipelines classification, access control, lineage and provenance, retention, particularly for privacy sensitive data.
  • Technical Leadership: A track record of setting data-engineering direction and leveling up engineers (technical leadership; formal management not required).
  • Education: Bachelor's or Master's in Computer Science, Engineering, or a related field, or equivalent practical experience.

PREFERRED QUALIFICATIONS

  • Streaming or near-real-time ingestion from edge/IoT sources into a data lake (e.g. Kafka, Lambda, EMR, or similar).
  • Append-without-rewrite and hash-indexed dataset techniques on open table formats, and dataset/feature-versioning systems.
  • Generative-AI data work: fine-tuning and evaluation dataset curation for LLMs/VLMs.
  • Exposing datasets to AI agents through MCP-style query interfaces, with semantic schema and plain-language documentation for retrieval.
  • Computer-vision / video annotation tooling and workflows (e.g. Encord, Labelbox, or similar).

 

COMPENSATION

The beginning annual salary range for this role is $171,900 - $221,000 USD and is determined by location, job-related experience, and education/training. Your total earning potential is amplified by a bonus structure tied to meeting goals, and you will become an owner from day one through our employee equity program.

BENEFITS

We believe you do your best work when your whole life is supported. We invest in our crew’s health, families, and financial futures with a benefits package designed to support you inside and outside the office. Full-time benefits include, but not limited to: Comprehensive health, dental and vision coverage, retirement benefits (401k match up to 4%), and flexible PTO.

LVT IS PROUD TO BE AN EQUAL OPPORTUNITY EMPLOYER. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status. All candidates must pass a drug screening and background check upon employment. Some roles may also require passing a federal background check and fingerprinting. Must be authorized to work in the U.S. If reasonable accommodation is needed to participate in the job application or interview process, and/or to perform essential job functions, please reach out to your recruiter.

Create a Job Alert

Interested in building your career at LVT? Get future opportunities sent straight to your email.

Apply for this job

*

indicates a required field

Phone
Resume/CV*

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf


Select...
Select...
Select...
Select...
Select...

Voluntary Self-Identification

For government reporting purposes, we ask candidates to respond to the below self-identification survey. Completion of the form is entirely voluntary. Whatever your decision, it will not be considered in the hiring process or thereafter. Any information that you do provide will be recorded and maintained in a confidential file.

As set forth in LVT’s Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law.

Select...
Select...
Race & Ethnicity Definitions

If you believe you belong to any of the categories of protected veterans listed below, please indicate by making the appropriate selection. As a government contractor subject to the Vietnam Era Veterans Readjustment Assistance Act (VEVRAA), we request this information in order to measure the effectiveness of the outreach and positive recruitment efforts we undertake pursuant to VEVRAA. Classification of protected categories is as follows:

A "disabled veteran" is one of the following: a veteran of the U.S. military, ground, naval or air service who is entitled to compensation (or who but for the receipt of military retired pay would be entitled to compensation) under laws administered by the Secretary of Veterans Affairs; or a person who was discharged or released from active duty because of a service-connected disability.

A "recently separated veteran" means any veteran during the three-year period beginning on the date of such veteran's discharge or release from active duty in the U.S. military, ground, naval, or air service.

An "active duty wartime or campaign badge veteran" means a veteran who served on active duty in the U.S. military, ground, naval or air service during a war, or in a campaign or expedition for which a campaign badge has been authorized under the laws administered by the Department of Defense.

An "Armed forces service medal veteran" means a veteran who, while serving on active duty in the U.S. military, ground, naval or air service, participated in a United States military operation for which an Armed Forces service medal was awarded pursuant to Executive Order 12985.

Select...

Voluntary Self-Identification of Disability

Form CC-305
Page 1 of 1
OMB Control Number 1250-0005
Expires 04/30/2026

Why are you being asked to complete this form?

We are a federal contractor or subcontractor. The law requires us to provide equal employment opportunity to qualified people with disabilities. We have a goal of having at least 7% of our workers as people with disabilities. The law says we must measure our progress towards this goal. To do this, we must ask applicants and employees if they have a disability or have ever had one. People can become disabled, so we need to ask this question at least every five years.

Completing this form is voluntary, and we hope that you will choose to do so. Your answer is confidential. No one who makes hiring decisions will see it. Your decision to complete the form and your answer will not harm you in any way. If you want to learn more about the law or this form, visit the U.S. Department of Labor’s Office of Federal Contract Compliance Programs (OFCCP) website at www.dol.gov/ofccp.

How do you know if you have a disability?

A disability is a condition that substantially limits one or more of your “major life activities.” If you have or have ever had such a condition, you are a person with a disability. Disabilities include, but are not limited to:

  • Alcohol or other substance use disorder (not currently using drugs illegally)
  • Autoimmune disorder, for example, lupus, fibromyalgia, rheumatoid arthritis, HIV/AIDS
  • Blind or low vision
  • Cancer (past or present)
  • Cardiovascular or heart disease
  • Celiac disease
  • Cerebral palsy
  • Deaf or serious difficulty hearing
  • Diabetes
  • Disfigurement, for example, disfigurement caused by burns, wounds, accidents, or congenital disorders
  • Epilepsy or other seizure disorder
  • Gastrointestinal disorders, for example, Crohn's Disease, irritable bowel syndrome
  • Intellectual or developmental disability
  • Mental health conditions, for example, depression, bipolar disorder, anxiety disorder, schizophrenia, PTSD
  • Missing limbs or partially missing limbs
  • Mobility impairment, benefiting from the use of a wheelchair, scooter, walker, leg brace(s) and/or other supports
  • Nervous system condition, for example, migraine headaches, Parkinson’s disease, multiple sclerosis (MS)
  • Neurodivergence, for example, attention-deficit/hyperactivity disorder (ADHD), autism spectrum disorder, dyslexia, dyspraxia, other learning disabilities
  • Partial or complete paralysis (any cause)
  • Pulmonary or respiratory conditions, for example, tuberculosis, asthma, emphysema
  • Short stature (dwarfism)
  • Traumatic brain injury
Select...

PUBLIC BURDEN STATEMENT: According to the Paperwork Reduction Act of 1995 no persons are required to respond to a collection of information unless such collection displays a valid OMB control number. This survey should take about 5 minutes to complete.