Senior Site Reliability Engineer
About Rent the Runway
Founded in 2009, Rent the Runway is disrupting the trillion-dollar fashion industry and changing the way women get dressed through the Closet in the Cloud, the world’s first and largest shared designer closet. RTR’s mission has remained the same since its founding: powering women to feel their best every day. Through RTR, customers can subscribe, rent items a-la-carte and shop resale from hundreds of designer brands. The Closet in the Cloud offers a wide assortment of millions of items for every occasion, from evening wear and accessories to ready-to-wear, workwear, denim, casual, maternity, outerwear, blouses, knitwear, loungewear, jewelry, handbags, activewear, ski wear, home goods and kidswear. RTR has built a two-sided discovery engine, which connects deeply engaged customers and differentiated brand partners on a powerful platform built around its brand, data, logistics and technology. Under CEO and Co-Founder Jennifer Hyman’s leadership, RTR has been named to CNBC’s “Disruptor 50” five times in ten years, and has been placed on Fast Company’s Most Innovative Companies list four times, while Hyman herself has been named to the “TIME 100: Most Influential People in the World" and as one of People Magazine’s “Women Changing the World.”
Galway Office
Rent The Runway established its European Technology Hub in Galway in April 2019. Based in the historic Claddagh area of the city, the growing team in Galway tackles core technology challenges and influences the next generation of services critical to Rent The Runway’s success and continued growth.
The Galway office is Rent the Runway's first international office outside the US and enables the company to significantly expand its Software Engineering, Product Development, Machine Learning Engineering and Data Science footprint. Rent The Runway’s Galway-based employees have the opportunity to grow their careers across several roles and career paths in Technology.
About the Team:
Our Platform Engineering team is smart, pragmatic, and entrepreneurial. We are reliability-focused and relentlessly passionate about making the closet-in-the-cloud a reality for our customers. We drive the operational capability of creating and advocating best practices to support largely distributed, fault-tolerant systems in the cloud that serve our customers every day.
We practice continuous improvement & process management techniques to put quality into everything we do. We cross-functionally service the Rent the Runway business and support multiple departments across IT, Engineering, Product, Security, Compliance and the Business.
We are always interested in speaking with developing and experienced SRE engineers who are interested in joining our team. If that’s you, read on for a snapshot of the type of work our SRE engineers do across our various teams, share your details and a member of our Talent team will be in touch!
About the Job:
As a Senior Site Reliability Engineer (SRE) you will have the opportunity to spearhead and lead technology initiatives in the realm of cloud infrastructure, software delivery and observability. You will be responsible for building and developing tooling, policies, and processes to advance Rent The Runway to higher levels of scale, and performance. You will lead assigned projects, and be responsible for the overall delivery of these initiatives. You will be part of a high-impact engagement with the Platform Engineering team delivering operational excellence through system automation, self-service and developer tooling that empowers the entire organisation to deliver exceptional results for our customers.
What You’ll Do:
- Utilise programming languages like Terraform, Python, Go, Container Orchestration services including Docker and Kubernetes, and a variety of GCP and services to drive service reliability.
- Implement software development practices to build observability, alerting, tracing, automation, and self-healing capabilities to maintain the highest levels of platform availability.
- End-to-end coordination across platforms, while supporting, identifying, responding, and reporting of issues; then escalating to respective teams for remediation promptly.
- Develop maintenance and operations automation through CI/CD
About You:
- Passion for CI/CD: Demonstrated enthusiasm for developing and improving Continuous Integration/Continuous Deployment processes.
- Orchestration Technology Experience: 5 years of hands-on experience with orchestration tools such as Kubernetes and/or Helm.
- Coding and Scripting Proficiency: Advanced skills in Terraform and Ansible, with a solid understanding of CI/CD tools like GitHub, GitLab, and Artifactory.
- Monitoring Solutions Expertise: Practical experience with monitoring, alerting, and logging tools, including Splunk and GCP Monitoring.
- Production Environment Support: At least 3 years of experience in maintaining production environments across cloud platforms like GCP, AWS, or Azure.
- Software Development: 5+ years of experience in developing and delivering products using programming languages such as Bash, Python, Golang, or Java.
- System Optimization: Proven track record of enhancing existing systems, building robust infrastructure, and automating processes to reduce workload.
- Agile Methodology: Experience working within Agile teams, adhering to sprint cadences and delivery timelines.
- Problem-Solving Skills: Ability to effectively triage issues and conduct thorough root-cause analyses when necessary.
- Team Collaboration: Strong team player with the ability to work collaboratively within diverse groups.
- SRE Influence: Capable of driving Site Reliability Engineering practices among development and operations teams.
- On-Call Duties: Willingness to participate in an on-call rotation, troubleshoot production issues, perform Root Cause Analyses, and share insights with the Engineering and Operations teams.
Benefits:
At Rent the Runway, we’re committed to the happiness and wellbeing of our employees, and aim to create a workplace that fosters both personal and professional growth. Our inclusive benefits include, but are not limited to:
- Generous Paid Time Off including annual leave, paid bereavement, and family sick leave - every employee needs time to take care of themselves and their family.
- Universal Paid Parental Leave for both parents + flexible return to work program - because we know your newest family member(s) deserve your undivided attention.
- Paid Sabbatical after 5 years of continuous service - unplug, recharge, and have some fun.
- Competitive Stakeholder Pension - taking care of your future.
- Comprehensive health, dental care and dependents care from day 1 of employment - Your health comes first and we’ve got you covered.
- Company wide events and outings - our team spirit is no joke - we know how to have fun!
- Hybrid Work - This is a hybrid role based in our Galway, Ireland office. Employees have to option to work remotely 2-3 days per week.
Rent the Runway is an equal opportunity employer. In accordance with applicable law, we prohibit discrimination against any applicant or employee on any legally-recognised basis, including, but not limited to: gender, marital status, family status, age disability, sexual orientation, race, religion, and membership of the Traveller community.
#LI-EM1
By submitting your application below, you agree that you have read and acknowledge Rent the Runway's Candidate Privacy Policy, found here.
Apply for this job
*
indicates a required field