Senior Software Engineer - GPU Optimization
Meet the team:
At Torc Robotics, our Hardware Acceleration team builds the brains behind our autonomous trucks — turning cutting-edge research and perception into safe, reliable features that deliver on the promise of Level 4 autonomy.
As a Senior Software Development Engineer, you’ll be a key contributor in developing and integrating new autonomy features into our real-time system — working across perception, prediction, planning, and control. You’ll help bring AI to life on the road by building high-performance, production-grade software that powers the decision-making of our autonomous platform.
What you'll do:
- Design and implement robust software for real-time autonomous features such as lane changes, obstacle handling, or complex maneuvers
- Collaborate across domain teams (Perception, Planning, Prediction, etc.) to translate ML models and algorithms into integrated autonomy features
- Develop and maintain high-performance C++ codebases, ensuring low-latency, high-reliability execution onboard autonomous trucks
- Leverage CUDA and GPU acceleration to optimize runtime performance of feature pipelines and embedded inference workloads
- Write and integrate custom CUDA kernels or TensorRT plugins, when needed, to accelerate compute-heavy autonomy components
- Own feature pipelines from concept through deployment — including design, integration, testing, validation, and performance optimization
- Debug and resolve system-level issues in embedded environments, working closely with hardware acceleration, systems, and QA teams
- Participate in code reviews, design sessions, and sprint planning to drive engineering excellence and team alignment
- Contribute to platform evolution, bringing ideas for scalability, modularity, and reusability across the autonomy stack
What you’ll need to Succeed:
- Bachelor’s or Master’s degree in Computer Science, Robotics, Electrical Engineering, or a related field or 8+ years of professional software development experience.
- Strong proficiency in modern C++ (C++14 or later), with a focus on clean, efficient, and maintainable code
- Experience in real-time system development, multi-threading, and performance profiling
- Hands-on experience with CUDA programming, GPU parallelization techniques, or deploying compute-intensive workloads on NVIDIA platforms
- Solid understanding of software integration practices, APIs, and modular architectures
- Comfort working in a collaborative development environment (Git, Jira, CI/CD, etc.)
- Excellent problem-solving and debugging skills, especially across complex software systems
Bonus points!
- Familiarity with machine learning workflows and deploying ML-driven features in embedded environments
- Experience writing TensorRT plugins or integrating AI inference runtimes into production systems
- Knowledge of ROS, Linux, and hardware-in-the-loop (HIL) testing
- Exposure to deep learning frameworks (PyTorch, TensorFlow) and GPU-accelerated model deployment
- Contributions to open-source projects or prior work in embedded AI or AV platforms
Perks of Being a Full-time Torc’r
Torc cares about our team members and we strive to provide benefits and resources to support their health, work/life balance, and future. Our culture is collaborative, energetic, and team focused. Torc offers:
- A competitive compensation package that includes a bonus component and stock options
- 100% paid medical, dental, and vision premiums for full-time employees
- 401K plan with a 6% employer match
- Flexibility in schedule and generous paid vacation (available immediately after start date)
- Company-wide holiday office closures
- AD+D and Life Insurance
US Pay Range
$160,800 - $193,000 USD
Apply for this job
*
indicates a required field