We are seeking a hands-on Site Reliability Engineer (SRE) to support our Boston-based data center operations. This role is responsible for maintaining the health, performance, and reliability of our server and network infrastructure. The ideal candidate will have strong experience in data center environments, with a solid understanding of network cabling, hardware lifecycle management, and cross-functional collaboration.
Key Responsibilities:
Data Center Operations:
• Monitor and maintain all server and network equipment using provided tools.
• Respond to alerts proactively and reactively to ensure system uptime and performance.
• Perform patch management, scheduled maintenance, and lab setup/teardown activities.
Hardware & Asset Lifecycle Management:
• Track and manage equipment from procurement through installation, replacement, and disposal using asset tracking software.
• Physically install, replace, and decommission servers, storage arrays, and network devices.
Network Cabling & Remote Hands Support:
• Assist with network cabling tasks including cable identification, labeling, routing, and termination.
• Provide remote hands support for SysOps and NetOps teams, including troubleshooting routers, switches, and firewalls.
• Perform basic diagnostics and assist in Tier 1 vendor troubleshooting calls.
Licensing & Vendor Coordination:
• Support license maintenance and renewal processes.
• Coordinate with vendors and internal teams for hardware and software support.
Cross-Functional Collaboration:
• Work closely with IT and Engineering teams to support infrastructure needs and project initiatives.
Qualifications:
• Minimum 2 years of hands-on data center experience.
• Basic understanding of networking concepts including routers, switches, firewalls, and structured cabling.
• Ability to lift up to the recommended weight limit (~40 lbs).
• Valid driver’s license and reliable transportation for regular travel to the downtown Boston data center.
• Familiarity with monitoring tools, asset tracking systems, and patch management processes.
• Strong troubleshooting skills and ability to work independently and collaboratively.
Preferred Skills:
• Experience with enterprise-grade server, storage, and networking hardware.
• Exposure to vendor support processes and license management.
• Excellent communication and documentation skills.
Work Environment:
• Hybrid role with on-site presence required at the downtown Boston data center at least twice per week.
• Occasional off-hours support for maintenance windows or critical incidents.
• Inclusion in a rotating on-call schedule to support critical network operations outside of business hours.