Back to jobs

AI Infrastructure Engineer

Los Angeles, San Francisco, Palo Alto, Toronto

About HeyGen

At HeyGen, our mission is to make visual storytelling accessible to all. Over the last decade, visual content has become the preferred method of information creation, consumption, and retention. But the ability to create such content, in particular videos, continues to be costly and challenging to scale. Our ambition is to build technology that equips more people with the power to reach, captivate, and inspire audiences.
Learn more at www.heygen.com.  Visit our Mission and Culture doc here

About HeyGen

HeyGen stands at the forefront of cutting-edge AI-powered platforms, revolutionizing the realm of video creation.

Position Summary:

At HeyGen, we are at the forefront of developing applications powered by our cutting-edge AI research. As an AI Infrastructure Engineer, you will lead the development of fundamental AI systems and infrastructure. These systems are essential for powering our innovative applications, including Photo Avatar, Instant Avatar, Streaming Avatar, and Video Translation. Your role will be crucial in enhancing the efficiency and scalability of these systems, which are vital to HeyGen's success.

Key Responsibilities:

  • Design, build, and maintain the AI infrastructure and systems needed to support our AI applications. Examples include
    • AI workflow scheduling system to improve GPU efficiency and throughput of our batch inference systems
    • Model optimization to improve inference performance
    • Auto Train systems to power our avatar models
    • Large scale model evaluation systems
    • Online model serving systems
  • Collaborate with data scientists and machine learning engineers to understand their computational and data needs and provide efficient solutions.
  • Stay up-to-date with the latest industry trends in AI infrastructure technologies and advocate for best practices and continuous improvement.
  • Assist in budget planning and management of cloud resources and other infrastructure expenses.

Qualifications:

  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field
  • 5+ years of experience
  • Proven experience in managing infrastructure for large-scale AI or machine learning projects
  • Excellent problem-solving skills and the ability to work independently or as part of a team.
  • Proficiency in Python and C++
  • Experience with GPU computing and optimizing computational workflows
  • Familiarity with AI and machine learning frameworks like TensorFlow or PyTorch.

Preferred Qualifications:

  • Experience with CUDA
  • Experience optimizing large deep learning model performance
  • Experience building large scale batch inference system
  • Prior experience in a startup or fast-paced tech environment.

What HeyGen Offers

  • Competitive salary and benefits package.
  • Dynamic and inclusive work environment.
  • Opportunities for professional growth and advancement.
  • Collaborative culture that values innovation and creativity.
  • Access to the latest technologies and tools.

 

HeyGen is an Equal Opportunity Employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.

Apply for this job

*

indicates a required field

Resume/CV

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf