Back to jobs
New

AI Field Engineer - Microsoft Foundry

San Mateo, CA

About Us:

At Fireworks, we’re building the future of generative AI infrastructure. Our platform delivers the highest-quality models with the fastest and most scalable inference in the industry. We’ve been independently benchmarked as the leader in LLM inference speed and are driving cutting-edge innovation through projects like our own function calling and multimodal models. Fireworks is a Series C company valued at $4 billion and backed by top investors including Benchmark, Sequoia, Lightspeed, Index, and Evantic. We’re an ambitious, collaborative team of builders, founded by veterans of Meta PyTorch and Google Vertex AI.

The Role

As an AI Field Engineer for Microsoft Foundry, you will be one of the technical owners of Fireworks' most strategic partnership. You’ll work closely with Microsoft's field teams, Azure-aligned ISVs, and the SIs that run enterprise AI transformation programs to make Fireworks the default inference and fine-tuning layer in every Azure AI architecture your partners touch. The role sits at the intersection of engineering, partner development, and customer delivery. You build reference architectures, run benchmarks, debug production integrations, and co-develop POCs — all while holding your own in executive-level conversations about strategy, roadmap, and business outcomes.

You spend most of your time building and enabling. You ship code, run joint POCs with Microsoft field teams, and architect deployments that span Azure Foundry and Fireworks. But you also lead discovery conversations, align partner stakeholders, and translate field signals into product improvements that compress the feedback loop from partner to roadmap. 

The Segment

As a Field Engineer aligned with our Partnerships team you own the technical relationship between Fireworks and the Microsoft ecosystem, Azure field teams, ISVs building on Azure Foundry, and the SIs that deliver AI transformation programs on Azure. The Microsoft partnership is a core go-to-market bet: clients like UIPath, Stack Blitz, Motif run via Fireworks on Foundry.. Your job is to scale that pattern across the partner ecosystem. These engagements involve large, multi-stakeholder organizations, so you will need to navigate both the enterprise buyer (IT, security, compliance) and the builder (ML engineers, platform teams, app developers), while building the trusted-advisor relationships inside Microsoft's field that multiply your reach.

What You'll Work On

Technical Delivery and Deployment

  • Be the technical lead on co-sell motions with Microsoft — joint reference architectures, Azure Foundry integration patterns, and shared POCs for strategic accounts.
  • Build end-to-end POCs and MVPs alongside partner engineering teams, working inside their codebases, infrastructure, and constraints.
  • Run load tests and establish latency, throughput, and cost baselines against realistic customer traffic profiles, and tune deployments to hit those targets.
  • Deploy and validate new model families on inference frameworks (vLLM, SGLang), determining optimal shapes, quantization configs, and serving patterns across workloads.

Model Strategy and Fine-Tuning

  • Guide Microsoft’s customers on model selection, fine-tuning strategy (SFT, DPO, RFT), and evaluation methodology.
  • Build and run fine-tuning pipelines directly with customers, navigating trade-offs between model families, compute cost, and quality targets.
  • Design and implement evaluation frameworks that measure production-quality metrics, not just benchmark scores.

Product Feedback and Platform Improvement

  • Own the feedback loop — surface partner-driven product gaps to Fireworks engineering, and translate the roadmap back into partner messaging.
  • Ship external technical content: reference architectures, integration guides, and benchmark posts that make it easy for partners to win deals with us.
  • Track pipeline health; flag risks and opportunities to Field leadership weekly.

What We're Looking For

Minimum Qualifications

  • 3+ years in a pre-sales, partner engineering, forward-deployed, or technical consulting role.
  • Demonstrated ability to build production software with customers, not just advise on it. You have shipped code running in someone else's production environment.
  • Strong Python skills. Comfortable reading, writing, and debugging production code. Familiarity with Kubernetes and infrastructure engineering.
  • Hands-on fluency with LLM inference: latency/throughput tradeoffs, batching strategies, quantization, structured outputs, function calling. You can explain why 50ms p99 matters to an enterprise CTO.
  • Real experience with fine-tuning — LoRA at minimum, RFT a strong plus. You understand when SFT is enough and when it isn't.
  • Deep familiarity with the Azure AI stack: Azure Foundry, Azure OpenAI Service, Azure ML, AKS, Entra/RBAC for AI workloads. You know where Fireworks fits and where it doesn't.
  • Exceptional communication: able to run a sharp discovery call, present to a VP, and debug a latency issue with an ML engineer in the same afternoon.

Preferred Qualifications

  • 5+ years in technical field or engineering roles where you've owned a technical relationship with a hyperscaler or major SI, not just supported one
  • Experience with inference serving frameworks (vLLM, SGLang, TensorRT-LLM) and tuning deployments for real workloads.
  • Prior role at a hyperscaler, AI-native cloud, or inference provider.
  • Experience with agentic frameworks (LangChain, LlamaIndex, or custom tool-use pipelines) — you understand how inference latency and reliability shapes agent behavior at scale.
  • Background in model evaluation — you understand why benchmark gaming is rampant and what rigorous evals actually look like.
  • You've written a technical blog post or reference architecture that people actually read.
  • Track record taking GenAI POCs from prototype to production-scale deployments.

On-Target Expectations (Plus Equity)

$280,000 - $320,000 USD

Total compensation also includes meaningful equity in a fast-growing startup, along with a competitive salary and comprehensive benefits package. Base salary is determined by a range of factors including individual qualifications, experience, skills, interview performance, market data, and work location.

Fireworks AI is an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all innovators.

Total compensation for this role also includes meaningful equity in a fast-growing startup, along with a competitive salary and comprehensive benefits package. Base salary is determined by a range of factors including individual qualifications, experience, skills, interview performance, market data, and work location. The listed salary range is intended as a guideline and may be adjusted.

On Target Earnings (Plus Equity)

$280,000 - $320,000 USD

Why Fireworks AI?

  • Solve Hard Problems: Tackle challenges at the forefront of AI infrastructure, from low-latency inference to scalable model serving.
  • Build What’s Next: Work with bleeding-edge technology that impacts how businesses and developers harness AI globally.
  • Ownership & Impact: Join a fast-growing, passionate team where your work directly shapes the future of AI—no bureaucracy, just results.
  • Learn from the Best: Collaborate with world-class engineers and AI researchers who thrive on curiosity and innovation.

Fireworks AI is an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all innovators.

Create a Job Alert

Interested in building your career at Fireworks AI? Get future opportunities sent straight to your email.

Apply for this job

*

indicates a required field

Phone
Resume/CV*

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf


Select...
Select...
Which office location(s) are you interested in? *

Voluntary Self-Identification

For government reporting purposes, we ask candidates to respond to the below self-identification survey. Completion of the form is entirely voluntary. Whatever your decision, it will not be considered in the hiring process or thereafter. Any information that you do provide will be recorded and maintained in a confidential file.

As set forth in Fireworks AI’s Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law.

Select...
Select...
Race & Ethnicity Definitions

If you believe you belong to any of the categories of protected veterans listed below, please indicate by making the appropriate selection. As a government contractor subject to the Vietnam Era Veterans Readjustment Assistance Act (VEVRAA), we request this information in order to measure the effectiveness of the outreach and positive recruitment efforts we undertake pursuant to VEVRAA. Classification of protected categories is as follows:

A "disabled veteran" is one of the following: a veteran of the U.S. military, ground, naval or air service who is entitled to compensation (or who but for the receipt of military retired pay would be entitled to compensation) under laws administered by the Secretary of Veterans Affairs; or a person who was discharged or released from active duty because of a service-connected disability.

A "recently separated veteran" means any veteran during the three-year period beginning on the date of such veteran's discharge or release from active duty in the U.S. military, ground, naval, or air service.

An "active duty wartime or campaign badge veteran" means a veteran who served on active duty in the U.S. military, ground, naval or air service during a war, or in a campaign or expedition for which a campaign badge has been authorized under the laws administered by the Department of Defense.

An "Armed forces service medal veteran" means a veteran who, while serving on active duty in the U.S. military, ground, naval or air service, participated in a United States military operation for which an Armed Forces service medal was awarded pursuant to Executive Order 12985.

Select...

Voluntary Self-Identification of Disability

Form CC-305
Page 1 of 1
OMB Control Number 1250-0005
Expires 04/30/2026

Why are you being asked to complete this form?

We are a federal contractor or subcontractor. The law requires us to provide equal employment opportunity to qualified people with disabilities. We have a goal of having at least 7% of our workers as people with disabilities. The law says we must measure our progress towards this goal. To do this, we must ask applicants and employees if they have a disability or have ever had one. People can become disabled, so we need to ask this question at least every five years.

Completing this form is voluntary, and we hope that you will choose to do so. Your answer is confidential. No one who makes hiring decisions will see it. Your decision to complete the form and your answer will not harm you in any way. If you want to learn more about the law or this form, visit the U.S. Department of Labor’s Office of Federal Contract Compliance Programs (OFCCP) website at www.dol.gov/ofccp.

How do you know if you have a disability?

A disability is a condition that substantially limits one or more of your “major life activities.” If you have or have ever had such a condition, you are a person with a disability. Disabilities include, but are not limited to:

  • Alcohol or other substance use disorder (not currently using drugs illegally)
  • Autoimmune disorder, for example, lupus, fibromyalgia, rheumatoid arthritis, HIV/AIDS
  • Blind or low vision
  • Cancer (past or present)
  • Cardiovascular or heart disease
  • Celiac disease
  • Cerebral palsy
  • Deaf or serious difficulty hearing
  • Diabetes
  • Disfigurement, for example, disfigurement caused by burns, wounds, accidents, or congenital disorders
  • Epilepsy or other seizure disorder
  • Gastrointestinal disorders, for example, Crohn's Disease, irritable bowel syndrome
  • Intellectual or developmental disability
  • Mental health conditions, for example, depression, bipolar disorder, anxiety disorder, schizophrenia, PTSD
  • Missing limbs or partially missing limbs
  • Mobility impairment, benefiting from the use of a wheelchair, scooter, walker, leg brace(s) and/or other supports
  • Nervous system condition, for example, migraine headaches, Parkinson’s disease, multiple sclerosis (MS)
  • Neurodivergence, for example, attention-deficit/hyperactivity disorder (ADHD), autism spectrum disorder, dyslexia, dyspraxia, other learning disabilities
  • Partial or complete paralysis (any cause)
  • Pulmonary or respiratory conditions, for example, tuberculosis, asthma, emphysema
  • Short stature (dwarfism)
  • Traumatic brain injury
Select...

PUBLIC BURDEN STATEMENT: According to the Paperwork Reduction Act of 1995 no persons are required to respond to a collection of information unless such collection displays a valid OMB control number. This survey should take about 5 minutes to complete.