Back to jobs
New

Machine Learning Operations (MLOps) Architect - Generative Al Focus

Arlington, VA

 

Attention: Kapitus is aware that individuals posing as recruiters may be communicating with job seekers about supposed positions with Kapitus. Kapitus has received reports that the content and method of communication can vary, but messages may contain requests for payment (e.g., fees for equipment or training) and/or for sensitive financial information.

Kapitus will never ask a candidate for employment for payment or financial information during the initial application or interview process. All open positions are posted in location specific employment portals available at www.kapitus.com/careers   All legitimate Kapitus job postings on employment sites will direct candidates to complete an application through these portals before completion of the hiring process.

Candidates with additional questions or concerns regarding any recruiting communications or Kapitus’ recruiting process in general should email recruiting@kapitus.com 

 

We are seeking a senior MLOps Architect to design and scale a modern ML and Generative AI platform across AWS. This role will own the architecture for traditional ML and LLM/Generative AI pipelines, ensuring production reliability, governance, cost optimization (FinOps), and enterprise-grade security. The ideal candidate has deep expertise in AWS, SageMaker, Databricks, Atlan (data catalog/governance), and modern MLOps tooling, and understands how to operationalize LLMs, RAG systems, and foundation models within a governed, scalable MLOps stack. This is a strategic, hands-on architecture role responsible for integrating GenAI capabilities into an enterprise ML platform.

What you’ll Do:

MLOps & GenAI Platform Architecture

  • Design and implement scalable ML and LLM infrastructure on AWS (SageMaker, EKS, S3, IAM, Lambda, Step Functions, CloudWatch).
  • Architect end-to-end ML and Generative AI lifecycle workflows:
    • Data ingestion & preprocessing o Feature engineering / embedding generation o Model training & fine-tuning (traditional ML + foundation models)
    • Model evaluation & validation
    • Deployment (real-time, batch, streaming)
    • Monitoring & retraining
  • Integrate LLM pipelines (prompt workflows, RAG architectures, fine-tuning flows) into the enterprise MLOps stack.
  • Define standards for CI/CD/CT pipelines across ML and GenAI workloads.

 

Generative AI & LLM Operationalization

  • Architect Retrieval-Augmented Generation (RAG) pipelines including:
    • Embedding generation workflows
    • Vector database integration
    • Document ingestion and chunking strategies
    • Retrieval evaluation and monitoring
  • Design and deploy LLM-based services using:
    • Managed services (e.g., SageMaker endpoints, Bedrock-style APIs)
    • Containerized custom inference services

 

  • Establish prompt versioning, evaluation frameworks, and experiment tracking for LLM systems.
  • Implement guardrails for hallucination control, safety monitoring, bias detection, and usage logging.
  • Define architecture for LLM fine-tuning workflows (including data curation, evaluation, and cost controls).
  • Implement scalable orchestration of LLM pipelines using workflow engines and event-driven patterns.

 

Deployment, Monitoring & Reliability

  • Architect scalable inference patterns for:
    • Traditional ML models
    • LLM APIs
    • RAG systems
  • Implement model monitoring frameworks for:
    • Performance degradation
    • Drift detection
    • LLM output quality
    • Latency and token usage metrics
  • Define SLAs/SLOs for ML and GenAI systems.
  • Design safe deployment strategies (blue/green, canary, shadow testing).
  • Establish logging, observability, and traceability standards for GenAI systems

 

FinOps & Cost Optimization

  • Implement cost tracking for:
    • Training workloads o GPU utilization
    • Inference endpoints o Token consumption (LLM APIs)
    • Vector database storage
  • Optimize LLM workloads for cost-performance tradeoffs (model size, batching, caching strategies).
  • Design autoscaling and compute optimization strategies for GPU and CPU-based inference.
  • Partner with finance and engineering teams to forecast ML/GenAI infrastructure spend.

 

Platform Enablement & Standards

  • Define enterprise standards for:
    • Experiment tracking
    • Model registry
    • Prompt registry
    • Artifact management
    • Embedding versioning

 

  • Provide architectural guidance to data science, AI, and engineering teams.
  • Evaluate and recommend tooling across the ML/GenAI stack (MLflow, feature    stores, vector databases, orchestration tools).
  • Drive documentation and reusable patterns for ML and GenAI development.

 

What We’re Looking for

 

  • 6+ years of experience in ML engineering, data engineering, or MLOps roles.
  • Proven experience architecting ML platforms in AWS.
  • Strong hands-on experience with SageMaker (training, pipelines, deployment).
  • Experience operationalizing LLM or Generative AI systems in production.
  • Experience building RAG pipelines and integrating vector databases.
  • Experience working with Databricks in production.
  • Experience implementing data governance and catalog systems (e.g., Atlan).
  • Strong understanding of CI/CD principles for ML and GenAI.
  • Experience with containerization (Docker) and orchestration (Kubernetes/EKS).
  • Deep knowledge of infrastructure-as-code (Terraform, CloudFormation).
  • Strong understanding of observability and monitoring for ML systems.
  • Experience implementing cloud cost optimization strategies (FinOps).
  • Strong Python proficiency.
  • Experience with foundation model fine-tuning and parameter-efficient methods.
  • Experience implementing model registries and experiment tracking tools.
  • Experience designing feature stores and embedding stores.
  • Familiarity with AI risk management, bias mitigation, and safety controls.
  • Experience supporting regulated or data-sensitive environments.
  • Platform-level architectural thinking.
  • Deep understanding of how to integrate GenAI into enterprise ML ecosystems.
  • Ability to balance scalability, governance, security, performance, and cost.
  • Strong technical leadership and cross-functional collaboration skills.
  • Hands-on ability to move from architecture design to implementation

Kapitus Total Rewards Package Includes: 

  • Competitive Base Salary Range of $117,800 – $189,000 Kapitus is providing this as a good faith salary range to comply with applicable law. The applicant’s final salary will depend on a number of factors including the applicant’s geographic location, skills, and experience.
  • Annual Incentive Compensation Eligibility  Up to 10% annually
  • Health Insurance: Comprehensive medical, dental, and employer-paid vision plans through UnitedHealthcare (UHC), with various coverage levels available to meet the needs of our employees and their families. Additional perks through UHC include: Sweat Equity, free subscription to the Calm App, UHC rewards, Real Appeal, and Quit For Life.
  • Flexible Spending Account: Set aside pre-tax dollars from your paycheck to pay for qualified out-of-pocket medical, dental, vision, pharmacy or dependent care expenses. 
  • Lifestyle Spending Account: Employer sponsored post-tax benefits that allow reimbursement for expenses related to physical, mental and financial well-being. 
  • 100% Company Paid Insurances: Kapitus fully covers the cost of basic short-term and long-term disability insurance, as well as vision insurance, ensuring our employees have comprehensive protection without any personal expense.
  • Voluntary Insurance: Supplemental life insurance as well as enhanced short- and long-term disability coverage are available through Mutual of Omaha, providing additional security for our employees. Additionally, Colonial Accident and Hospitalization insurances are also available, offering further protection against unforeseen events.
  • Paid Maternity and Parental Leave: Beyond state-mandated leave policies, Kapitus provides company-paid maternity and parental leave, supporting our employees during important family milestones.
  • Commuter Benefits: We offer pre-tax benefits on parking and commuter expenses to cover travel to and from work.
  • LifeBalance Program: Enhance your lifestyle with our LifeBalance membership, which offers discounts on outdoor activities, the arts, health, and fitness. Additional benefits include: 
    • Pet and car insurance discounts.
    • Financial services such as LegalShield.
    • Relaxation and stress management tools.
  • Plum Benefits Discount Program: Access exclusive discounts on shows, travel, car rentals, and more, enriching your personal and family life.
  • Tuition Reimbursement: Pursue further education with up to $5,000 annually in tuition reimbursement, plus opportunities to attend relevant conferences and career development events. Managed through our LSA plan, Kapitus Academy. 
  • Travel Reimbursement: We also offer travel reimbursement for all work-related travel, supporting your involvement in career and personal development activities.
  • Paid Time Off and Sick Time.
  • Retirement Benefits: Our 401K plan is managed through Fidelity. To support your long-term financial goals, the company provides a 25% match on your contributions, up to 6% of your annual salary.

 

About Kapitus:

Kapitus is one of the most reliable and respected names in small business financing. As both a direct lender and a marketplace built with a trusted network of lending partners, we can provide small businesses with the financing they need when, and how it is needed. We have spent our entire existence building a culture that makes us excited to come to work in the morning. Our company is fast paced, teammates need to be self-directed and have an internal motivation to do the right thing, even when the right thing takes a lot of hard work. We show our teammates our appreciation by offering great benefits, competitive pay and solid opportunity for growth.

Company Mission: At Kapitus, our mission is to help small business owners grow their organizations by providing tailored, transparent, and ethical financing solutions. We invest in every business owner’s story and we are dedicated to building lasting relationships to champion their goals. We promise to keep the best interests of our clients at the center of the financing process by operating with transparency, fairness, and integrity.

Consideration will be given to qualified remote candidates residing in states where Kapitus and/or one of its subsidiaries has an established physical presence.

Apply for this job

*

indicates a required field

Phone
Resume/CV*

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf


Education

Select...
Select...
Select...
Select...
Select...

Select...
Select...
Select...
Select...
Select...
Select...
Select...
Select...

Voluntary Self-Identification

For government reporting purposes, we ask candidates to respond to the below self-identification survey. Completion of the form is entirely voluntary. Whatever your decision, it will not be considered in the hiring process or thereafter. Any information that you do provide will be recorded and maintained in a confidential file.

As set forth in Kapitus’s Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law.

Select...
Select...
Race & Ethnicity Definitions

If you believe you belong to any of the categories of protected veterans listed below, please indicate by making the appropriate selection. As a government contractor subject to the Vietnam Era Veterans Readjustment Assistance Act (VEVRAA), we request this information in order to measure the effectiveness of the outreach and positive recruitment efforts we undertake pursuant to VEVRAA. Classification of protected categories is as follows:

A "disabled veteran" is one of the following: a veteran of the U.S. military, ground, naval or air service who is entitled to compensation (or who but for the receipt of military retired pay would be entitled to compensation) under laws administered by the Secretary of Veterans Affairs; or a person who was discharged or released from active duty because of a service-connected disability.

A "recently separated veteran" means any veteran during the three-year period beginning on the date of such veteran's discharge or release from active duty in the U.S. military, ground, naval, or air service.

An "active duty wartime or campaign badge veteran" means a veteran who served on active duty in the U.S. military, ground, naval or air service during a war, or in a campaign or expedition for which a campaign badge has been authorized under the laws administered by the Department of Defense.

An "Armed forces service medal veteran" means a veteran who, while serving on active duty in the U.S. military, ground, naval or air service, participated in a United States military operation for which an Armed Forces service medal was awarded pursuant to Executive Order 12985.

Select...

Voluntary Self-Identification of Disability

Form CC-305
Page 1 of 1
OMB Control Number 1250-0005
Expires 04/30/2026

Why are you being asked to complete this form?

We are a federal contractor or subcontractor. The law requires us to provide equal employment opportunity to qualified people with disabilities. We have a goal of having at least 7% of our workers as people with disabilities. The law says we must measure our progress towards this goal. To do this, we must ask applicants and employees if they have a disability or have ever had one. People can become disabled, so we need to ask this question at least every five years.

Completing this form is voluntary, and we hope that you will choose to do so. Your answer is confidential. No one who makes hiring decisions will see it. Your decision to complete the form and your answer will not harm you in any way. If you want to learn more about the law or this form, visit the U.S. Department of Labor’s Office of Federal Contract Compliance Programs (OFCCP) website at www.dol.gov/ofccp.

How do you know if you have a disability?

A disability is a condition that substantially limits one or more of your “major life activities.” If you have or have ever had such a condition, you are a person with a disability. Disabilities include, but are not limited to:

  • Alcohol or other substance use disorder (not currently using drugs illegally)
  • Autoimmune disorder, for example, lupus, fibromyalgia, rheumatoid arthritis, HIV/AIDS
  • Blind or low vision
  • Cancer (past or present)
  • Cardiovascular or heart disease
  • Celiac disease
  • Cerebral palsy
  • Deaf or serious difficulty hearing
  • Diabetes
  • Disfigurement, for example, disfigurement caused by burns, wounds, accidents, or congenital disorders
  • Epilepsy or other seizure disorder
  • Gastrointestinal disorders, for example, Crohn's Disease, irritable bowel syndrome
  • Intellectual or developmental disability
  • Mental health conditions, for example, depression, bipolar disorder, anxiety disorder, schizophrenia, PTSD
  • Missing limbs or partially missing limbs
  • Mobility impairment, benefiting from the use of a wheelchair, scooter, walker, leg brace(s) and/or other supports
  • Nervous system condition, for example, migraine headaches, Parkinson’s disease, multiple sclerosis (MS)
  • Neurodivergence, for example, attention-deficit/hyperactivity disorder (ADHD), autism spectrum disorder, dyslexia, dyspraxia, other learning disabilities
  • Partial or complete paralysis (any cause)
  • Pulmonary or respiratory conditions, for example, tuberculosis, asthma, emphysema
  • Short stature (dwarfism)
  • Traumatic brain injury
Select...

PUBLIC BURDEN STATEMENT: According to the Paperwork Reduction Act of 1995 no persons are required to respond to a collection of information unless such collection displays a valid OMB control number. This survey should take about 5 minutes to complete.