Back to jobs
New

AI Specialist (AI Engineering)

San Francisco Bay Area, USA

We are looking for an AI Specialist Engineer to enhance the performance of large language and vision models for on-device inference. Your expertise will be crucial in developing and deploying cutting-edge AI solutions, ensuring optimal efficiency across diverse hardware architectures.

Responsibilities:

  • Compress and optimize large language and vision models for on-device inference.
  • Develop pipelines for model distillation and hardware-specific compilation.
  • Benchmark performance across various NPU/GPU architectures.

Qualifications:

  • Expertise in model distillation, pruning, and 4-bit/8-bit quantization techniques.
  • Hands-on experience with TensorRT, ONNX Runtime, and edge deployment.
  • Strong C++ and Python skills.

 

 

Create a Job Alert

Interested in building your career at Hyphen Connect Limited? Get future opportunities sent straight to your email.

Apply for this job

*

indicates a required field

Phone
Resume/CV

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf


Working Location *
Select...
Web3 Vertical Experience *

N/A if none.