Back to jobs

AI Engineer (Voice) - Enterprise

San Francisco & Palo Alto, CA; London

About xAI

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.

Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity.

We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important.

All engineers and researchers are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.

About the Role

You will work directly with our enterprise customers, owning the strategy and execution of Voice AI solutions. You’ll act as a specialized AI startup CTO, focusing on voice-driven technologies, leading high-stakes projects, and delivering measurable impact. If you excel at combining deep technical expertise with customer-focused innovation, particularly in the Voice AI domain, we’d love to hear from you. Your day-to-day work may include:

  • Designing and building end-to-end Voice AI solutions, from understanding customer pain points to scoping product specs and deploying LLM-powered voice interfaces.
  • Benchmarking voice models, writing evaluations, or analyzing performance to identify weaknesses in speech recognition, synthesis, or natural language understanding.
  • Improving model performance through system prompt tuning, fine-tuning voice-specific models, or optimizing for low-latency voice interactions.
  • Analyzing voice request logs, prompt data, or audio inputs to enhance system accuracy and user experience.
  • Building internal tools to automate voice AI workflows, such as transcription pipelines or real-time voice processing.
  • Enhancing xAI’s Voice AI SDKs or developer documentation based on customer feedback and enterprise use cases.

Focus

  • Deep expertise in solving enterprise voice AI challenges, delivering robust and scalable voice-driven solutions.
  • Proven ability to ship high-quality code and complete voice AI projects in demanding environments. 
  • Ability to handle ambiguity, adapt to evolving requirements, and prioritize effectively in a fast-paced startup setting.
  • Exceptional communication skills to clarify voice-specific requirements with customers and drive projects to successful completion.
  • Emphasis on designing, implementing, and maintaining efficient voice AI architectures, including speech-to-text, text-to-speech, and real-time voice processing.
  • Proficiency in managing complex codebases and optimizing voice data pipelines for high-throughput, low-latency performance.
  • Define critical benchmarks for Voice AI performance: Establish key performance benchmarks tailored to enterprise voice use cases, such as speech recognition accuracy, natural language understanding, and real-time latency, reflecting customer prompt distributions.
  • Initiate human data collection for Voice AI: Design and manage campaigns to acquire high-quality voice and conversational data from diverse enterprise contexts, supporting model training and validation.
  • Drive Voice AI integration with enterprise partners: Collaborate with cross-functional teams to integrate Voice AI capabilities into enterprise workflows, enabling seamless adoption in areas like customer service, virtual assistants, and telephony systems.

Requirements

An ideal candidate meets at least the following:

  • Strong engineering background.
  • Experience interfacing between technical and customer-facing teams.
  • Excellent verbal and written communication skills in English.
  • Ability to translate business and voice-specific product needs into technical solutions.
  • Proven experience implementing voice AI or machine learning products, including APIs, back-end, and front-end voice interfaces.
  • Strong proficiency in Python and/or TypeScript.
  • Solid understanding of HTTP protocol and real-time communication protocols (e.g., WebRTC).

Standout Experiences

Candidates may distinguish themselves with:

  • Building evaluations for voice AI capabilities, such as speech recognition accuracy or naturalness of synthesized speech.
  • Demonstrating expertise in machine learning fundamentals, including voice model evaluation, training, or fine-tuning.
  • Deploying voice AI models to production, optimizing for low-latency and high-reliability environments.
  • Writing developer documentation or creating voice-specific SDKs.
  • Working with large-scale audio datasets, optimizing voice processing pipelines, or scaling systems for enterprise-grade workloads.
  • Using infrastructure tools like Pulumi or Terraform for deploying voice AI systems.

Interview Process

After submitting your application, our team reviews your CV and Statement of Exceptional Work. If selected, you’ll be invited to a 15-minute technical phone interview where we’ll discuss your background and voice AI specialization. Successful candidates proceed to the main process:

  • 15 min Technical Screen
  • 2x 45 min Coding Interview (focused on voice AI or related challenges)

The Statement of Exceptional Work is a critical factor in our evaluation.

We aim to complete the main process within one week. All applications are reviewed by our technical team, not recruiters. Interviews are conducted via Google Meet or in-person.

Benefits

  • Competitive cash-based compensation
  • xAI equity
  • Private health and dental insurance
  • Unlimited time off subject to prior approval

Annual Salary Range

$180,000 - $440,000 USD



xAI is an equal opportunity employer.

California Consumer Privacy Act (CCPA) Notice

Apply for this job

*

indicates a required field

Resume/CV*

Accepted file types: pdf, doc, docx, txt, rtf


If you are currently employed in the field, please tell us the name of your employer.

If you are currently employed in the field, please tell us your role including your seniority level (e.g. Software Engineer II).

If you have a public LinkedIn profile, please provide its URL.

If you have a public X profile, please provide its URL.

If you have a Google Scholar page, please provide its URL.

In 100 words or less, tell us about a piece of work you are most proud of.

Select...