AI Capacity Planning & Procurement Manager
About Us:
Here at Fireworks, we’re building the future of generative AI infrastructure. Fireworks offers the generative AI platform with the highest-quality models and the fastest, most scalable inference. We’ve been independently benchmarked to have the fastest LLM inference and have been getting great traction with innovative research projects, like our own function calling and multi-modal models. Fireworks is funded by top investors, like Benchmark and Sequoia, and we’re an ambitious, fun team composed primarily of veterans from Pytorch and Google Vertex AI.
The Role:
We are an AI company building reliable, high-performance model serving infrastructure. Our customers run mission-critical workloads and expect consistency, cost transparency, and predictable scaling. Capacity — GPU, network, and cloud economics — is existential for us. You will be one of the earliest hires focused on making sure we always have the right compute, at the right price, ahead of demand.
Key Responsibilities:
- Architect the Multi-Year Capacity Strategy: Develop and own the strategic, multi-year capacity plan by synthesizing inputs from the company strategy, business forecast, Industry trend and Engineering’s product roadmap.
- Infrastructure Cost Management: Directly manage the biggest cost on our P&L. Translate market insights, technology advancement, and forward-looking plans into rigorous financial models to enable fast business growth while minimizing Total Cost of Ownership (TCO) per unit of performance (e.g., TFLOP). Partner with infra, finance, and GTM to codify capacity strategy into budgets and KPIs
- Build market intelligence with advanced compute technology roadmap, lead time, and pricing for relevant SKUs (H200/B200/B300/GB200/GB300/MI355, networking, storage)
- Own end-to-end procurement of cloud capacity (GPUs, storage, networking, etc) across multiple vendors (cloud, bare-metal, colocation, integrators, brokers) including RFQs/RFPs, commercial Negotiation.
- Define Capacity Management discipline with processes and tools: inventory, allocation, unit economics, cost attribution and optimization
- Stand up processes for disciplined renewals, rev-share allocations, and hedging against supply shocks
Minimum Qualifications:
- 5–10+ years of experience in capacity management, sourcing, data center supply, cloud procurement, infrastructure operations, or a related field (e.g., hyperscaler, colocation provider, OEM, or LLM infrastructure startup)
- Demonstrated ability to develop and communicate multi-year capacity plans that align business, product, and financial objectives
- Strong knowledge of GPU and server SKUs, networking topologies, power and space constraints, and a high-level understanding of the global AI compute supply and demand landscape
- Proven experience negotiating contracts valued at seven figures or higher, with a solid grasp of key terms such as commit profiles, flexibility clauses, drawdowns, credits, and SLA remedies
- Proficiency in unit economics and scenario modeling, including total cost of ownership (TCO) and TFLOP-month analysis
- Ability to operate effectively in a fast-paced, low-process environment and contribute to company-defining initiatives
Preferred Qualifications:
- Established network across major capacity suppliers, including NVIDIA partners, cloud providers, brokers, integrators, and colocation vendors
- Experience connecting capacity planning with real-world model serving workload patterns and performance requirements
- Background in AI infrastructure environments, such as hyperscaler clouds or AI infrastructure startups
Total compensation for this role also includes meaningful equity in a fast-growing startup, along with a competitive salary and comprehensive benefits package. Base salary is determined by a range of factors including individual qualifications, experience, skills, interview performance, market data, and work location. The listed salary range is intended as a guideline and may be adjusted.
Base Pay Range (Plus Equity)
$150,000 - $250,000 USD
Why Fireworks AI?
- Solve Hard Problems: Tackle challenges at the forefront of AI infrastructure, from low-latency inference to scalable model serving.
- Build What’s Next: Work with bleeding-edge technology that impacts how businesses and developers harness AI globally.
- Ownership & Impact: Join a fast-growing, passionate team where your work directly shapes the future of AI—no bureaucracy, just results.
- Learn from the Best: Collaborate with world-class engineers and AI researchers who thrive on curiosity and innovation.
Fireworks AI is an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all innovators.
Create a Job Alert
Interested in building your career at Fireworks AI? Get future opportunities sent straight to your email.
Apply for this job
*
indicates a required field
