
Senior Engineer, ML Models
Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.
As a Senior Machine Learning Engineer on the AI Models team, you will bring up and optimize advanced AI models on Tenstorrent hardware. You will work directly on real workloads, refine performance, and validate model behavior on a custom accelerator. This role is ideal for someone who enjoys practical ML engineering and wants to see their work run at scale.
This role is remote, based in Cyprus.
We welcome candidates at various experience levels for this role. During the interview process, candidates will be assessed for the appropriate level, and offers will align with that level, which may differ from the one in this posting.
Who You Are
- Confident Python developer with hands-on experience using PyTorch to design, train, and refine deep learning models.
- Curious and experiment-driven, always seeking to understand model behavior and improve performance through structured iteration.
- Strong understanding of modern ML model architectures, with the ability to optimize both individual components and full-model pipelines.
- Collaborative engineer who works well across software and hardware teams to solve real technical challenges.
What We Need
- Direct, hands-on experience bringing up state-of-the-art ML models on new hardware platforms.
- Strong debugging instincts to analyze performance issues, tune architectures, and improve accuracy and robustness.
- Working knowledge of model optimization techniques such as quantization, flash attention, and kernel fusion, along with familiarity with matrix engines and memory hierarchies.
- A curiosity-driven mindset that stays current with ML research and applies insights to real-world engineering work.
What You Will Learn
- How to bring real ML models to high performance on a custom AI accelerator.
- Techniques for optimizing ML performance from the application level down to silicon behavior.
- How to translate research concepts into production-ready deployment.
- How to partner with compiler, kernel, and hardware teams to drive new features, performance improvements, and system-level fixes.
Tenstorrent offers a highly competitive compensation package and benefits, and we are an equal opportunity employer.
This offer of employment is contingent upon the applicant being eligible to access U.S. export-controlled technology. Due to U.S. export laws, including those codified in the U.S. Export Administration Regulations (EAR), the Company is required to ensure compliance with these laws when transferring technology to nationals of certain countries (such as EAR Country Groups D:1, E1, and E2). These requirements apply to persons located in the U.S. and all countries outside the U.S. As the position offered will have direct and/or indirect access to information, systems, or technologies subject to these laws, the offer may be contingent upon your citizenship/permanent residency status or ability to obtain prior license approval from the U.S. Commerce Department or applicable federal agency. If employment is not possible due to U.S. export laws, any offer of employment will be rescinded.
Apply for this job
*
indicates a required field