Back to jobs
New

Site Reliability Engineering Manager

Italy

About CoreView  

CoreView is the global leader in Microsoft 365 (M365) tenant resilience, serving over 23 million users worldwide. We empower the world’s leading organizations to master the complexity of Microsoft M365.  Through robust security and precise governance, we help ensure that our client’s environments stay cyber-resilient and productive, no matter how complex they are.  
 
Our unified, cloud-native platform delivers powerful automation, rapid value, and end-to-end visibility across the entire M365 ecosystem. Backed by world-class support and a collaborative, innovative culture, CoreView is a place where your ideas matter, and your work truly impacts global enterprises.  

Job Summary

To support our growth, we are looking for an SRE Manager in Milan, Italy.

As our Site Reliability Engineering Manager, you will be responsible for building and leading our Site Reliability Engineering team from the ground up. In this role, you will define the team structure, establish SRE practices and processes, and actively contribute as a hands-on engineer — especially in the early stages. You will work closely with the Director of Cloud & IT Infrastructure and collaborate with software engineering, DevOps, and IT operations teams to ensure the reliability, scalability, and performance of our systems.

The ideal candidate has a strong background in IT operations or infrastructure engineering and has successfully led or coordinated technical teams. We are looking for someone who combines solid operational expertise with a natural inclination toward leadership, process building, and continuous improvement.

Job Responsibilities 

  • Build the SRE team from scratch: define roles, participate in hiring, onboard and mentor engineers.
  • Define the SRE technical direction and operational roadmap, aligning reliability initiatives with business objectives and product priorities.
  • Establish SRE practices, processes, and culture within the organization, including on-call rotations, incident management, and blameless post-mortems, fostering a culture of psychological safety and continuous learning.
  • Define and track SLOs, SLIs, and error budgets in collaboration with engineering and product teams.
  • Act as a hands-on contributor during the team ramp-up phase, directly involved in designing and operating critical infrastructure on Azure.
  • Own the incident management process end-to-end: detection, response, escalation, resolution, and root cause analysis.
  • Drive automation initiatives — including AI-assisted operations where applicable — to reduce toil and improve the operational efficiency of the team.
  • Design and oversee monitoring, alerting, and observability solutions to ensure full visibility across systems and services.
  • Own capacity planning and cloud cost governance, ensuring infrastructure scales efficiently while remaining cost-effective.
  • Build and maintain a strong documentation culture: runbooks, operational procedures, incident playbooks, and architectural decisions.
  • Collaborate with software engineering teams to embed reliability and operational readiness into the development lifecycle.
  • Define and maintain disaster recovery plans, backup strategies, and business continuity procedures.
  • Ensure that security best practices and compliance requirements are applied consistently across infrastructure and operations.
  • Report on team performance, reliability metrics, and operational health to senior leadership.

Job Requirements 

  • 5+ years of experience in IT operations, infrastructure engineering, or SRE roles.
  • 1+ years of experience leading or coordinating a technical team, with demonstrated ability to hire, mentor, and develop engineers.
  • Solid hands-on experience with Azure cloud services and cloud-based infrastructure management.
  • Strong understanding of IT operations best practices, including incident management, change management, and service continuity.
  • Experience with monitoring and observability tools (e.g., Prometheus, Grafana, Azure Monitor, ELK stack).
  • Familiarity with Infrastructure as Code (IaC) tools such as Terraform or Ansible.
  • Good knowledge of containerization and orchestration technologies (Docker, Kubernetes / AKS).
  • Ability to define and implement SLOs, SLIs, and error budgets in a production environment.
  • Experience with capacity planning and cloud cost management.
  • Strong analytical and problem-solving skills, with the ability to manage complex incidents under pressure.
  • Excellent communication and stakeholder management skills, with the ability to interact effectively at both technical and leadership levels.
  • Proactive mindset, ownership attitude, and passion for building reliable, scalable systems.
  • Proficient in English. You read and write proficiently and speak at a conversational level in English.

Nice-to-have

  • Experience in a greenfield or scale-up environment, with a track record of building teams or processes from scratch.
  • Background in SRE discipline with knowledge of Google SRE principles and practices.
  • Experience with CI/CD pipeline management and DevOps practices.
  • Familiarity with scripting or automation (Python, Bash, PowerShell).
  • Experience leveraging AI and automation tools to improve operational workflows and reduce manual intervention.
  • Knowledge of chaos engineering practices and fault injection tools (e.g., Azure Chaos Studio).
  • Relevant certifications such as Microsoft Certified: Azure Administrator, Azure DevOps Engineer Expert, or ITIL Foundation.

#LI-CW1


Coreview Values

Ownership Mindset: Take ownership. Drive outcomes. 

One Team: One team, one goal, embracing diversity - to achieve more together. 

Velocity: Decide fast. Deliver fast. Repeat.

Continuous Improvement: Curiosity drives us. We challenge the status quo.

Customer First: Listen deeply. Solve boldly.  

Resilience: Steady under pressure.

CoreView is an organisation which values the strength that diversity brings to the workplace. As an employer, we seek to promote equal opportunity through affirmative action. All qualified applicants will therefore receive consideration for employment and will not be discriminated against based on gender/sex, race/ethnicity, disability, age or any other protected group status (such as protected veteran status) or characteristic that is protected by local legislation. 


Privacy Notice: By submitting your application, you acknowledge that CoreView will process your personal data for recruitment purposes in accordance with our Privacy Policy

Apply for this job

*

indicates a required field

Phone
Resume/CV*

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf


Select...

Our privacy policy can be found here.

Select...