Staff Software Engineer AI
About the Role
We are seeking an experienced Staff AI Engineer to join our AI and Data Platform team, where you'll play a pivotal role in building and scaling our next-generation AI workforce platform. You'll work on cutting-edge agent-based systems that are transforming supply chain operations for Fortune 500 companies, delivering real business value through intelligent automation.
Key Responsibilities
Technical Leadership
- Design and implement production-scale AI agent systems and orchestration frameworks (LangGraph, LangChain, similar architectures)
- Lead architecture for multi-agent systems handling complex business workflows
- Optimize deployment strategies using both LLMs and SLMs based on use case requirements
- Build natural language-configurable business process automation frameworks
- Implement multi-modal AI systems for document understanding (tables, charts, layouts)
AI/ML Implementation & Optimization
- Deploy and optimize LLMs/SLMs in production with fine-tuning techniques (LoRA, QLoRA, DPO)
- Implement quantization strategies (INT8, INT4) and model distillation for edge deployment
- Build evaluation frameworks including LLM-as-judge systems and regression testing
- Design streaming architectures for real-time LLM responses (SSE, WebSockets)
- Create semantic caching and embedding-based retrieval systems
- Develop GraphRAG and long-context handling strategies (100k+ tokens)
System Architecture & Engineering
- Design scalable microservices with comprehensive observability (LangSmith, Arize, custom telemetry)
- Build secure multi-tenant systems with prompt injection prevention and output validation
- Implement cost optimization through intelligent model routing and fallback strategies
- Develop document processing pipelines with OCR and layout understanding
- Create event-driven architectures for real-time shipment tracking and exception handling
Data & Infrastructure
- Build data pipelines for training data curation, synthetic generation, and PII masking
- Implement RLHF/RLAIF feedback loops for continuous improvement
- Design experiment tracking and model registry systems (MLflow, DVC)
- Optimize inference costs through batch processing and spot instance utilization
- Establish model governance, audit trails, and compliance frameworks
Required Qualifications
Technical Skills
- 8+ years software engineering, 3+ years in production AI/ML systems
- Expertise in Python, PyTorch/JAX, and AI frameworks (LangChain, Transformers, PEFT)
- Experience with LLMs (GPT-4, Claude, Gemini) and SLMs (Phi, Llama, Mistral)
- Hands-on experience with:
- Fine-tuning techniques (LoRA, QLoRA, DPO, RLHF)
- Model optimization (quantization, distillation, pruning)
- Vector databases and RAG architectures
- Streaming systems and real-time processing
- Security measures (prompt injection prevention, jailbreak detection)
- Strong background in distributed systems, Kubernetes, and cloud platforms
Domain Knowledge(nice to have)
- Experience with document intelligence and multi-modal AI systems
- Understanding of supply chain operations, EDI/API integrations
- Knowledge of token economics and consumption-based pricing models
- Familiarity with enterprise compliance requirements (GDPR, CCPA, SOC2)
Professional Skills
- Track record of delivering complex projects with measurable business impact
- Experience with technical sales support, POCs, and customer success
- Strong communication for technical and non-technical audiences
- Data-driven decision making for model selection and cost optimization
Preferred Qualifications
- Supply chain, logistics, or transportation management experience
- Experience with OCR pipelines and document extraction at scale
- Knowledge of GraphRAG and knowledge graph integration
- Contributions to open-source AI projects (Hugging Face, Ollama)
- Experience reducing inference costs by 50%+ through optimization
- Familiarity with MoE architectures and constitutional AI approaches
- Background in building usage-based billing and margin optimization
- Experience with specialized tools (vLLM, TGI, Triton, ONNX, TensorRT)
What You'll Work On
- Building specialized AI agents solving supply chain problems
- Fine-tuning domain-specific models for supply chain terminology
- Implementing hybrid architectures combining cloud LLMs with edge SLMs
- Creating secure document intelligence systems for Fortune 500 clients
- Developing real-time exception handling for shipment tracking
- Building observability and evaluation frameworks for agent performance
- Designing fallback strategies and multi-provider redundancy
Technical Environment
Models: GPT-4, Claude, Gemini, Llama 3, Mistral, Phi-3, custom fine-tuned models
Fine-tuning: LoRA/QLoRA, PEFT, DeepSpeed, bitsandbytes, Axolotl
Infrastructure: Kubernetes, AWS SageMaker/Bedrock, GPU clusters, edge devices
Frameworks: LangChain, LangGraph, vLLM, FastAPI, Transformers
Observability: LangSmith, Weights & Biases, custom telemetry
Data: PostgreSQL, Redis, Vector DBs, Kafka, feature stores
Impact & Growth
You'll directly contribute to AI initiatives generating millions in revenue while shaping systems processing millions of transactions daily. Lead technical decisions affecting 25+ engineers while mentoring the next generation of AI engineers. Be at the forefront of production AI optimization, balancing performance, cost, and latency for enterprise customers.
Who we are:
FourKites®, the leader in AI-driven supply chain transformation for global enterprises and pioneer of advanced real-time visibility, turns supply chain data into automated action. FourKites’ Intelligent Control Tower™ breaks down enterprise silos by creating a real-time digital twin of orders, shipments, inventory and assets. This comprehensive view, combined with AI-powered digital workers, enables companies to prevent disruptions, automate routine tasks, and optimize performance across their supply chain. FourKites processes over 3.2 million supply chain events daily — from purchase orders to final delivery — helping 1,600+ global brands prevent disruptions, make faster decisions and move from reactive tracking to proactive supply chain orchestration.
Working at FourKites
We provide competitive compensation with stock options, outstanding benefits and a collaborative culture for all employees around the globe, including:
- 5 global recharge days, in addition to standard holidays, and a hybrid, flexible approach to work.
- Parental leave for all parents, an annual wellness stipend and volunteer days also provide you with time and resources for self care and to care for others.
- Opportunities throughout the year to learn and celebrate diversity.
- Access to leading AI tools and foundation models, with the freedom to experiment and find creative ways to be more effective in your role
Create a Job Alert
Interested in building your career at FourKites? Get future opportunities sent straight to your email.
Apply for this job
*
indicates a required field