Senior Manager, Engineering
Senior Manager, Engineering - AppSec & SRE
Want to lead a global team responsible for the most important product features – availability, reliability & security? Sumo’s SRE program focuses on continual data-driven evolution and improvement of the reliability, security, and efficiency of our global scale technological presence. We are looking for a great leader with a passion for site reliability, continuous technology improvement, and reducing the operational workload of our own engineers - as well as our customers who leverage our products for their own monitoring and reliability.
Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Sumo’s services have reliability, uptime appropriate to users' needs as well as the ability to quickly and continuously deliver value to our customers.
Responsibilities
- AppSecurity Program:
- Establish and evolve strategies and processes for the product and engineering organization to meet security, privacy, and compliance objectives
- Guide leadership and teams in the risk-based prioritization of compliance, privacy, and security efforts, and maintain respective backlogs and roadmaps
- Coordinate with team leads, managers, and individual contributors to ensure initiatives are prioritized based on risks and rewards, stay on track by identifying and removing blockers, and get completed
- Define OKRs and build and deliver program metrics and communications to provide updates to stakeholders
- Coordinate and support application security initiatives (such as, regular threat modeling and the mitigation of dependency vulnerabilities) across the product and engineering organization
- Help product teams develop secure applications for the Sumo Logic platform.
- Integrate and implement solutions improving Sumo Logic’s security posture.
- Lead security reviews and penetration tests at design and implementation stages.
- Partner with the Security Operations Center (SOC) and Compliance team on our security and compliance posture, vulnerability management, and threat modeling of our tech stack.
- Educate product teams on secure development best practices and Quality Engineering teams on continuous improvement of security testing.
- Reliability Program:
- Drive the program that maintains excellent uptime numbers for our services.
- Manage error budgets and associated policies for key product SLOs.
- Promote blameless post-mortem culture combined with developer operational accountability.
- Continuously reduce operational workload for engineers by means of infrastructure improvements and automation.
- Collaborate to lead the reliability programs with SRE leaders across geo.
- Team Leadership:
- Lead and grow a global team of SREs adept at building extremely high-volume, fault-tolerant, efficient, and scalable backend systems.
- Technical Vision:
- Partner with our technical leadership team to review choices on an ongoing basis, in anticipation of increased scale and ever-evolving technology to meet the demands of growing business. Leverage technical skills to successfully analyze and improve the efficiency, scalability, and reliability of our backend systems.
Required Qualifications and Skills
- B.S. in Computer Sciences or related discipline (M.S., or Ph.D. is a plus).
- Minimum 8+ years of industry experience with a proven track record of ownership, delivery, and operational excellence.
- Minimum 3+ years in a management role.
- Experience being responsible for key SLOs of a cloud-based SaaS: availability, uptime, performance, and security.
- Experience in multi-threaded programming and distributed systems.
- Object-oriented programming experience, for example in Java, Scala, Ruby, or C++.
- Experience with high volumes of data using the latest technologies such as Kafka, Kubernetes and Docker.
- Agile software development experience (test-driven development, iterative and incremental development). Experience in big data and/or 24x7 commercial service is highly desirable.
- Hands-on experience with public cloud Infrastructure-as-a-service and Platform-as-a-service offerings - Amazon Web Services, Google Cloud Platform, etc.
- Good to have:
- Functional knowledge of global information security and privacy regulations (such as, GDPR, CCPA, and FISMA), industry standards (such as, ISO/IEC 27001, PCI DSS, and SOC2), and best practices (such as, OWASP projects, threat modeling methodologies, risk and privacy impact assessments)
- Experience with cloud technology environments and contemporary software engineering stacks
- Ability to influence across all levels of an organization, strong written and verbal communication and organizational skills, and demonstrated success in cross-functional collaboration with internal and external stakeholders
- Professional certifications (such as, CIPM, CISM, and CISSP)
About Us
Sumo Logic, Inc. helps make the digital world secure, fast, and reliable by unifying critical security and operational data through its Intelligent Operations Platform. Built to address the increasing complexity of modern cybersecurity and cloud operations challenges, we empower digital teams to move from reaction to readiness—combining agentic AI-powered SIEM and log analytics into a single platform to detect, investigate, and resolve modern challenges. Customers around the world rely on Sumo Logic for trusted insights to protect against security threats, ensure reliability, and gain powerful insights into their digital environments. For more information, visit www.sumologic.com.
Sumo Logic Privacy Policy. Employees will be responsible for complying with applicable federal privacy laws and regulations, as well as organizational policies related to data protection.
Create a Job Alert
Interested in building your career at Sumo Logic? Get future opportunities sent straight to your email.
Apply for this job
*
indicates a required field