
Staff Engineer, Infrastructure - Product Software
Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.
The Software Infrastructure team builds the foundation of Tenstorrent’s developer experience. From CI pipelines to model deployment, we create systems that are fast, reliable, and frictionless. This role is about automating every layer of the stack and building real observability into how we develop, ship, and run ML workloads. If you care about clean pipelines, actionable metrics, and enabling high-performance AI from development to silicon, this is the team where it all comes together.
This role is hybrid based in Belgrade, Serbia.
We welcome candidates at various experience levels. During the interview process, candidates will be assessed for the appropriate level, and offers will align with that level, which may differ from the one in this posting.
Who You Are:
- You approach infrastructure like a product, with a focus on speed, reliability, and observability
- Comfortable in Linux and fluent with Docker, with a passion to automate manual workflows
- Systems-focused, with a strong bias for reducing friction in engineering environments
- Interested in ML infrastructure and motivated to scale tools that support inference and deployment workflows
What We Need:
- An engineer with experience building and scaling GitHub Actions pipelines for development, testing, and deployment
- Someone who can automate provisioning, configuration, and workload execution across local and cloud systems
- A builder who can deploy observability systems with tools like Grafana and Prometheus, including real-time dashboards, to enable data-driven engineering work
- A collaborator who can support engineers and inference teams running models on Tenstorrent hardware
What You Will Learn:
- Strategies for enabling ML model execution across custom AI silicon
- How to scale CI workflows and developer tooling across a high-performance engineering organization
- Tools and practices that support modern inference workloads like vLLM in production
- What it takes to deliver reliable, low-friction infrastructure in a full-stack AI system
Tenstorrent offers a highly competitive compensation package and benefits, and we are an equal opportunity employer.
This offer of employment is contingent upon the applicant being eligible to access U.S. export-controlled technology. Due to U.S. export laws, including those codified in the U.S. Export Administration Regulations (EAR), the Company is required to ensure compliance with these laws when transferring technology to nationals of certain countries (such as EAR Country Groups D:1, E1, and E2). These requirements apply to persons located in the U.S. and all countries outside the U.S. As the position offered will have direct and/or indirect access to information, systems, or technologies subject to these laws, the offer may be contingent upon your citizenship/permanent residency status or ability to obtain prior license approval from the U.S. Commerce Department or applicable federal agency. If employment is not possible due to U.S. export laws, any offer of employment will be rescinded.
Apply for this job
*
indicates a required field