
Staff Software Infrastructure Engineer
Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.
We’re looking for a pragmatic and driven engineer to join our Software Infrastructure team in Belgrade. You’ll help build and maintain the backbone of our developer and model execution workflows - enabling fast, reliable and secure deployment of workloads on Tenstorrent’s cutting-edge AI hardware.
This is a hands-on role where you’ll work closely with developers and infrastructure engineers to automate, scale and monitor systems across Tenstorrent’s environments. You should be comfortable with tools like GitHub Actions, Docker, Ansible and monitoring stacks, and have a passion and deep understanding of the need for automation and observability.
This role is hybrid based out of Belgrade, Serbia.
We welcome candidates at various experience levels for this role. During the interview process, candidates will be assessed for the appropriate level, and offers will align with that level, which may differ from the one in this posting.
Responsibilities:
- Design and maintain scalable CI/CD workflows using GitHub Actions to support development, testing, and release processes.
- Manage container-based environments (Docker), virtual machines and bare-metal systems across multiple internal environments.
- Automate provisioning, configuration and maintenance of infrastructure.
- Build monitoring and alerting systems like Prometheus, Grafana and related tooling to ensure observability and reliability.
- Own and evolve internal dashboards to track performance, test results, and system metrics.
- Collaborate with engineers and hardware teams to support workload orchestration in Tenstorrent eco-system.
- Develop internal tooling in Python to streamline test execution, data collection, packaging and release workflows.
- Work on infrastructure support for high-performance inference, including vLLM-based workloads.
Experience & Qualifications:
- 5+ years of experience in software infrastructure, DevOps, or SRE roles.
- Strong experience with CI/CD systems (preferably GitHub Actions) and containerization (Docker).
- Solid knowledge of Linux systems, scripting (Bash/Python) and virtualization.
- Familiarity with Ansible or similar tools for automation and configuration management.
- Experience setting up and maintaining monitoring stacks (e.g. Grafana, Prometheus).
- Exposure to dashboarding and reporting tools like Superset.
- Understanding of workload orchestration and scheduling tools (e.g. Slurm).
- Excellent communication and troubleshooting skills in cross-functional environments.
- A degree in Software Engineering or Computer Science, or equivalent professional experience.
Nice to Have:
- Experience with ML/AI workflows or model deployment pipelines.
- Experience running or supporting vLLM inference or similar LLM serving systems
Tenstorrent offers a highly competitive compensation package and benefits, and we are an equal opportunity employer.
Due to U.S. Export Control laws and regulations, Tenstorrent is required to ensure compliance with licensing regulations when transferring technology to nationals of certain countries that have been licensing conditions set by the U.S. government.
As this position will have direct and/or indirect access to information, systems, or technologies that are subject to U.S. Export Control laws and regulations, please note that citizenship/permanent residency, asylee and refugee information and supporting documentation will be required and considered as a condition of employment.
If a U.S. export license is required, employment will not begin until a license with acceptable conditions is granted by the U.S. government. If a U.S. export license with acceptable conditions is not granted by the U.S. government, then the offer of employment will be rescinded.
Apply for this job
*
indicates a required field