Network Engineer / Sr. Network Engineer - Hardware & Architecture
ROLES AND RESPONSIBILITIES
Firmus is seeking a skilled Network Engineer / Sr. Network Engineer for Hardware and Architecture to join our Engineering and Technology team. The ideal candidate will play a crucial role in leading the design and deployment of both our physical network infrastructure and network architecture for our AI infrastructure projects.
This role offers an exciting opportunity to work at the forefront of AI networking technology and contribute to the growth of Firmus’ AI infrastructure capabilities. Deep hands-on experience with network hardware is required, particularly with fibre optic systems. Also necessary to succeed in the role is a strong architectural mindset to guide the evolution of scalable, secure and high-performance networks for AI.
KEY RESPONSIBILITIES
- Network Architecture & Design
- Architect and maintain low-latency, high-throughput interconnects (e.g. InfiniBand 100/200/400/800GbE) for HPC and AI workloads.
- Collaborate with other network engineers and cross-functional teams to develop network infrastructure roadmaps, aligned with business and technical strategy.
- Lead the design of our layer 1/2/3 network infrastructure in our data centre deployments, our AI Factories, and the cluster interconnects with considerations for redundancy, scalability, and performance.
- Evaluate new technologies, architectures, and design patterns to improve network performance and efficiency.
- Network Hardware & Physical Infrastructure
- Lead the design, configuration, and deployment of highly scalable physical networks optimised for AI workloads.
- Oversee the planning and implementation of fibre optic cabling systems (single-mode & multi-mode), including backbone connections, patch panels, and structured cabling.
- Manage the integration of optical technologies (DWDM, CWDM) and long-haul fibre for intersite connectivity.
- Ensure physical infrastructure aligns with architectural standards and supports scalability, availability, and security goals.
- Oversee diagnostics, performance testing, and physical layer troubleshooting (OTDR, power meter testing, etc.).
- Create and maintain accurate and up-to-date documentation of network architecture, hardware and cabling.
- Work closely with other engineering disciplines to coordinate the network infrastructure with other services (e.g. mechanical, electrical, security etc.) within the data centre.
- Participate in the operations standby roster and on-call support from time to time.
- Project Management
- Support the deployment team with defining project timelines and resource allocation for the network portion of AI cluster installations.
- Create Bill of Materials and develop budgets for network deployments.
- Coordinate with cross-functional teams to ensure successful project delivery.
- Technology Expertise
- Maintain and expand expertise in physical network hardware and advanced networking technologies, including:
- NVIDIA InfiniBand
- Spectrum Ethernet Platform
- RDMA over Converged Ethernet (RoCE)
- Familiarity with open-source network operating systems such as Cumulus Linux and Sonic.
- Provide technical support and troubleshooting for advanced networking technologies, escalating to vendors as needed.
- Mentor junior network engineers, assisting with their technical development.
- Maintain and expand expertise in physical network hardware and advanced networking technologies, including:
- Stakeholder Management & Collaboration
- Work closely with both Firmus Engineering and Commissioning teams to align network infrastructure with customers’ requirements.
- Facilitate knowledge sharing and communication between teams and create and maintain comprehensive technical documentation.
- Maintain and build strong relationships with key technology partners and vendors and proactively manage and coordinate partner engagement on site.
SKILLS AND EXPERIENCE
- Bachelor’s degree in Network Engineering, Computer Science, or a related technical field.
- 5+ years of experience in network engineering, with a focus on AI infrastructure.
- Strong project management skills and experience leading complex technical projects.
- Solid understanding of advanced networking technologies, particularly those related to AI.
- Hands-on experience with NVIDIA InfiniBand, Spectrum Ethernet Platform, and RoCE.
- Strong experience with network cabling systems, both fibre optic and copper.
- Excellent problem-solving and analytical skills.
- Ability to work independently and as part of a team.
- Strong communication skills, both written and verbal.
- Willingness to travel domestically and internationally for on-site deployments and commissioning as required.
Apply for this job
*
indicates a required field