Senior Manager - Site Reliability Engineering
Who is SimSpace:
SimSpace launched in 2015 with a singular purpose – addressing the most urgent and sophisticated cybersecurity challenges to reduce risk for our most vulnerable and valuable infrastructure. The organizations around the world that we depend on every day to keep our loved ones safe and secure. Our healthcare facilities, schools, financial institutions, transit centers, grocery stores, and workplaces just to name a few. To deliver global resiliency, we provide an elite cyber range platform to curate unassailable cyber defenses, data driven decisions, cutting edge training labs, live attack scenarios, and extensive individual and dynamic team readiness training.
SimSpace works as OneTeam to elevate humanity around the world. We are committed to continuously improving and delivering a cultivated member experience whether that is accomplished through focusing on supporting our client’s teams or our own mission driven SimSpacers.
We are an international company headquartered in Boston's Fort Point in the U.S. If you are interested in elevating the technology and creative solutions necessary to secure and safeguard our future while working alongside others who share your passion for purpose and development, we want to meet you!
Why should you choose a career at SimSpace?
We are an organization that is focused on building our culture and mindfully enhancing our atmosphere everyday which is why we have collaborated on an integral value system. Our governing philosophy of being Human Centered is deeply embedded within our value system. We apply this philosophy to every one of our internal team members, external clients, and their customers.
Our core values:
- Serve to Protect – We provide safe space, deliver on the mission, and elevate humanity
- Acquire Understanding – We seek and provide clarity 10x, cultivate comprehension, and believe information goes both all ways
- Operate as Innovators – We stay curious, practice consistency over intensity, and continue to be the change we need in the world
- Teamwork Without Borders – We are never alone, we solve for all, and keep people at the heart of everything we do
SimSpace is looking for a Senior Manager, Site Reliability Engineering, to lead our SRE and platform engineering initiatives across the SimSpace infrastructure. The ideal candidate will possess strong leadership skills to guide and mentor teams with a proven track record of building reliable, scalable systems on-premises and in cloud environments, driving operational excellence and fostering a collaborative, security-first engineering culture.
In this position, you'll lead your team through complex infrastructure and platform projects, collaborating closely with development teams to ensure system reliability, performance, and security. The focus is on providing oversight, guidance, and technical leadership for our platform operations, encompassing traditional SRE practices, DevOps automation, DevSecOps integration, and developer experience optimization.
What will you be doing as a Senior Manager, Site Reliability Engineering at SimSpace?
- Lead and manage SRE teams responsible for the reliability, scalability, and performance of our SimSpace cyber range infrastructure and services.
- Partner with development and product teams to establish SLIs, SLOs, and contribute to SLA definitions, along with error budgets and reliability practices that balance feature velocity with system stability.
- Work with Customer Success Teams to assure the Core Platform meets customer’s needs and requirements
- Drive the evolution of our GitHub and ArgoCD CI/CD pipelines, deployment strategies, and developer tooling to enhance engineering productivity and code quality.
- Collaborate with security teams to implement DevSecOps practices, automated security scanning, and compliance monitoring throughout the development lifecycle.
- Develop and maintain infrastructure automation, monitoring strategies, and incident response procedures to ensure high availability and rapid recovery.
- Work with engineering leadership to establish platform standards, architectural patterns, and best practices for cloud-native and hybrid environments.
- Mentor and coach team members in SRE principles, automation practices, and system design, promoting their professional growth and technical expertise.
- Ensure compliance with security frameworks, industry standards, and regulatory requirements in all platform operations.
- Continuously improve operational processes, tooling, and practices to enhance system reliability, developer experience, and organizational efficiency.
Who you are:
- Experienced engineering manager with a strong background in site reliability engineering, platform operations, and distributed systems.
- Experience in managing SaaS systems and shipping the same as a SaaP offering on customer hardware platforms.
- Proven track record of successfully leading and scaling SRE teams in high-growth technology environments.
- Deep technical expertise in system reliability, observability, and automation with the ability to solve complex infrastructure challenges.
- Strong communication and leadership skills with the ability to influence engineering practices across multiple teams.
- Passionate about building resilient systems and empowering development teams through excellent platform experiences.
- Knowledge of SRE principles, DevOps practices, and security automation in modern software delivery.
- Comfortable with agile methodologies and able to adapt platform capabilities to evolving business requirements.
- Ability to mentor technical teams and drive adoption of reliability and automation best practices.
- Results-driven with a focus on measurable improvements in system reliability, deployment frequency, and mean time to recovery.
What are the qualifications to apply? To be successful as a Senior Manager, Site Reliability Engineering, you need to be:
- 8+ years of experience in infrastructure, platform engineering, or SRE roles with at least 3+ years in management.
- Expert knowledge of SRE principles, infrastructure automation, and modern deployment practices.
- Proven track record of leading SRE or platform engineering teams to deliver highly available, scalable systems.
- Strong leadership and communication skills, with the ability to drive technical consensus and cultural change.
- Experience with DevOps and DevSecOps methodologies, including security automation and compliance integration.
- Deep understanding of observability, monitoring, and incident management practices.
- Extensive experience with container orchestration (Kubernetes), infrastructure as code, VMware, and cloud platforms.
- Demonstrated ability to design, build, and operate large-scale distributed systems with high reliability requirements.
- Experience with security automation, vulnerability management, and compliance frameworks in DevSecOps environments.
- Proven experience building and operating CI/CD platforms, developer tooling, and internal platform services.
- Experience managing infrastructure across both cloud and on-premises environments including packaging and shipping SaaP to customers.
- Bachelor's in computer science, engineering, or related field (or equivalent experience).
- U.S. Citizenship is required for this role
- Must own a purple unicorn and a rocket ship.
Our Tech Stack Includes:
Python, Go, Terraform, Ansible, Kubernetes, Tilt, Kustomize, Docker, VMware, GitHub ArgoCD, Prometheus, Grafana, ELK Stack, and more
We’re proud to offer a competitive and comprehensive package designed to support your well-being, growth, and success:
- Compensation. Base salary range: $180,000 – $250,000, reflecting our confidence in your expertise and impact, with the opportunity for annual bonuses tied to company performance and individual contributions.
- Health & Wellness. Comprehensive medical, dental, and vision benefits, plus savings plans—coverage starts on day one!
- Mental Health Support. Access to company-paid counseling, coaching, and resources for you and your family through Spring Health.
- Financial Well-Being. Plan for your future with a 401(k)-retirement savings plan featuring a company match.
- Flexible Time Off. Take the time you need with unlimited vacation and dedicated health & wellness days. SimSpace provides flexible solutions to meet the diverse work-life needs of team members.
- Parental Leave. Paid leave plans to support you and your loved ones during life’s most important moments.
- Ownership Opportunities: Equity stock options at hire, with annual performance-based grants—become an invested stakeholder in our shared success.
- Referral Rewards: Earn $1,500–$3,500 for every qualified hire through our employee referral program.
- Peloton Interactive Wellness Program: Full- and partial- subsidized membership plans and equipment discounts to help you reach your personalized fitness goals.
- Continuous Learning: Access a LinkedIn Learning membership to prioritize your personal and professional development.
- Social Connections: Monthly reimbursements for meaningful connections with teammates through our SocialSpace Community.
- Extra Perks: Legal plan coverage, pet insurance, wellness reimbursements, and more to simplify life’s details.
SimSpace is an Equal Opportunity Employer:
In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete the required employment eligibility verification document form upon hire.
SimSpace is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, pregnancy, genetic information, disability, status as a protected veteran, or any other protected category under applicable federal, state, and local laws. We are committed to providing an inclusive and welcoming environment for all members of our staff, clients, volunteers, subcontractors, vendors, and clients.
Research shows that women and people from underrepresented groups only apply to jobs if they meet all of the qualifications. However, no one ever meets 100% of the qualifications. SimSpace encourages you to break that statistic and to apply. We look forward to your application!
We also consider qualified applicants regardless of criminal histories, in accordance with applicable law. We are committed to providing reasonable accommodations for qualified individuals with disabilities in our job application procedures. If you need assistance or accommodation due to a disability, please contact careers@simspace.com.
SimSpace does not accept unsolicited resumes from employment agencies.
Actual compensation for the position is based on a variety of factors, including, but not limited to affordability, skills, qualifications and experience, and may vary from the range.
Create a Job Alert
Interested in building your career at SimSpace? Get future opportunities sent straight to your email.
Apply for this job
*
indicates a required field