
Software Engineer, Acceleration Kernel Development
Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.
As a Software Engineer on the Acceleration Kernel Development team at Tenstorrent, you’ll work at the intersection of software and hardware performance. You’ll be writing low-level code that directly powers high-efficiency machine learning workloads, optimizing every cycle, every memory move, every instruction. If you're motivated by performance, precision, and real impact, this is where your skills will shine.
This role is hybrid, based out of Warsaw or Gdansk, Poland. We also consider remote candidates on a case by case status.
We welcome candidates at various experience levels for this role. During the interview process, candidates will be assessed for the appropriate level, and offers will align with that level, which may differ from the one in this posting.
Who You Are
- A developer who loves high performance code, wrangling bits, optimizing compute, and making hardware fly.
- Comfortable in C/C++ and able to build fast, efficient code from the ground up.
- Obsessed with performance and precision, especially in tensors and ML workloads.
- Motivated by complex problems and thrives in collaborative, fast-moving environments.
What We Need
- Build and optimize compute kernels implementing highly parallel algorithms for machine learning and high-performance workloads.
- Analyze and tune performance at the instruction level including latency, memory, and bandwidth.
- Collaborate with ML engineers to get optimizations into production.
- Own debugging, profiling, and ensuring the low-level stack runs fast and reliably.
What You Will Learn
- Push AI hardware to its limits by shaping how kernels are written and executed.
- Integrate kernel work into ML frameworks and real-world training pipelines.
- Tune performance on cutting-edge architectures with top-tier hardware engineers.
- Keep code lean, reliable, and scalable even under heavy workloads.
Tenstorrent offers a highly competitive compensation package and benefits, and we are an equal opportunity employer.
Due to U.S. Export Control laws and regulations, Tenstorrent is required to ensure compliance with licensing regulations when transferring technology to nationals of certain countries that have been licensing conditions set by the U.S. government.
As this position will have direct and/or indirect access to information, systems, or technologies that are subject to U.S. Export Control laws and regulations, please note that citizenship/permanent residency, asylee and refugee information and supporting documentation will be required and considered as a condition of employment.
If a U.S. export license is required, employment will not begin until a license with acceptable conditions is granted by the U.S. government. If a U.S. export license with acceptable conditions is not granted by the U.S. government, then the offer of employment will be rescinded.
Apply for this job
*
indicates a required field