DevOps Manager
Senior Manager – DevOps & Site Reliability Engineering (SRE)
The Opportunity
We are seeking an experienced and strategic Senior Manager – DevOps & SRE to lead and evolve our reliability and platform engineering capabilities across our global eCommerce ecosystem.
This role goes beyond traditional DevOps management. You will be responsible for defining and driving our reliability strategy, embedding SRE principles (SLIs, SLOs, error budgets), and ensuring our platforms operate at scale with high availability, performance, and resilience.
You will lead a distributed team of DevOps and SRE engineers, working closely with Engineering, Product, Security, and Architecture to enable reliable, scalable, and automated cloud-native systems across Azure, AWS, and GCP.
Key Responsibilities
Leadership & Organizational Impact
-
Lead, mentor, and grow a high-performing DevOps & SRE function.
-
Define clear ownership models, reliability standards, and ways of working.
-
Elevate engineering maturity through automation, observability, and operational excellence.
-
Drive accountability and promote a culture of reliability, learning, and continuous improvement.
-
Partner with senior stakeholders to align platform reliability with business objectives.
Reliability Strategy & SRE Practices
-
Define and implement SRE best practices (SLIs, SLOs, error budgets).
-
Own incident management strategy, postmortems, and systemic improvements.
-
Improve resilience through proactive risk identification and mitigation.
-
Establish measurable reliability KPIs aligned with customer experience.
Cloud Infrastructure & Platform Engineering
-
Oversee cloud operations primarily in Microsoft Azure, with exposure to AWS and GCP.
-
Ensure infrastructure is scalable, secure, and cost-efficient.
-
Drive Infrastructure as Code adoption (Terraform, Bicep/ARM).
-
Define platform standards for Kubernetes and containerized environments.
Automation, CI/CD & Developer Enablement
-
Champion CI/CD best practices and release reliability.
-
Improve deployment strategies (blue/green, canary releases).
-
Reduce operational toil through automation and self-healing systems.
-
Support high-traffic eCommerce events and critical production workloads.
Observability & Operational Excellence
-
Define and evolve our observability strategy (Azure Monitor, Grafana, Datadog, Prometheus, etc.).
-
Improve signal-to-noise ratio in monitoring and alerting.
-
Drive root cause analysis discipline and continuous improvement loops.
-
Explore AI-assisted operations for incident detection, alert optimization, and operational efficiency.
Security & Compliance
-
Ensure secure cloud practices (IAM, least privilege, data protection).
-
Partner with Security to enforce compliance and governance standards.
-
Embed security and reliability into the full SDLC lifecycle.
Key Experience & Skills
-
10+ years of experience in DevOps, SRE, or Platform Engineering roles.
-
3+ years leading and scaling technical teams.
-
Strong hands-on background in Microsoft Azure (required) AWS (required) and GCP (nice to have).
-
Deep understanding of cloud-native architectures and Kubernetes.
-
Proven experience implementing SRE frameworks (SLAs, SLOs, incident management).
-
Strong experience with Infrastructure as Code (Terraform, ARM/Bicep).
-
Observability expertise (Grafana, Datadog, Prometheus, Azure Monitor).
-
Experience managing production systems at scale (high-traffic environments preferred).
-
Strong stakeholder management and communication skills.
-
Strategic mindset with the ability to balance technical depth and business impact.
Nice to Have
-
Experience in global eCommerce platforms.
-
Experience leading cloud transformation initiatives.
-
Exposure to AI-driven operational tooling.
-
Relevant certifications (Azure, Kubernetes, Cloud Architecture).
Why join us?
Direct hire with a product company: This isn’t consulting or outsourcing. Shape the future of our product with ownership, impact, and long-term vision.
Competitive salary and benefits: Your financial well-being is important to us. Join ESW and experience the satisfaction of being rewarded for your hard work, dedication, and commitment.
Flexible retribution: Make your everyday life easier and more affordable with our flexible retribution options, including meal vouchers, public transport tickets, kindergarten support, and health insurance.
International environment: Work with people from over 30 different cultures and get the chance to use English daily.
Professional and personal development: We will ensure your talent is nurtured and cultivated for growth and success throughout your career with ESW.
Hybrid working: Enjoy the best of both worlds with 2–3 days in our office in Méndez Alvaro, and 2–3 days working from the comfort of your home.
Diversity, Belonging & Inclusion: When we win, we win together. You'll be part of a culture that values every individual for who they are, fostering an environment where uniqueness is encouraged.
ESW is an equal opportunity employer, and we’re proud of our ongoing efforts to foster diversity, equity, & inclusion in the workplace. Individuals seeking employment and employees at ESW are considered without regard to race, color, religion, national origin, age, sex, gender, gender identity, gender expression, sexual orientation, marital status, medical condition, ancestry, disability, military or veteran status, or any other characteristic protected by applicable law.
If you require any reasonable accommodations or adjustments throughout the hiring process, please let us know. We are dedicated to ensuring equal access and opportunity for all candidates.
#LI-hybrid #LI-TS1
Create a Job Alert
Interested in building your career at ESW? Get future opportunities sent straight to your email.
Apply for this job
*
indicates a required field
