Principal Site Reliability Engineer - (Infra Cloud Operations)
About Upshop
Upshop is the market leader in Total Store Operations solutions for the Grocery and C-Store markets. We offer an AI-powered, SaaS platform connecting Fresh, Center, eCommerce, and DSD department operations to deliver a simplified, smarter, more connected store experience. Customers running Upshop realize significant improvements in sales, shrink, food safety and sustainability across the entire store. 150+ retail chain accounts trust our software in over 30k+ stores, 9 countries, and 3 continents. Upshop is backed by Level Equity, a growth focused private equity firm, and acquired Invafresh in 2024, doubling the size of the company.
Overview of the Role
Are you a seasoned DevOps professional looking to elevate software delivery at scale? At Upshop, we’re seeking a highly skilled DevOps Engineer to lead the charge in optimizing our CI/CD pipelines, managing code repositories, and ensuring seamless, high-quality software releases across our development teams.
In this role, you'll collaborate closely with engineering leaders to maintain efficient deployment workflows, drive automation initiatives, and deliver the tools that keep our platform running smoothly. If you thrive in fast-paced environments and are passionate about improving developer experience and release velocity, we’d love to hear from you.
Responsibilities
- Design, Develop and implement Cloud Infrastructure solutions on Azure and GCP ensuring scalability, reliability and security.
- Provide deep technical expertise in cloud networking, including virtual private clouds (VPCs), load balancing, and network security.
- Implement and manage CI/CD pipelines to ensure smooth and efficient deployment of applications, network configurations and services.
- Provide deep technical expertise in cloud platforms. Azure and GCP - predominantly Azure, networking, storage, and virtualization.
- Ensure cloud infrastructure complies with security standards and regulations, and participate in security incident responses.
- Provide support to ensure mission critical applications and components are being monitored and meet security, reporting and retention requirements as well as disaster recovery requirements of clients.
Qualifications
- Bachelor's degree in Computer Science, Information Technology, or a related field. Advanced degrees are often preferred.
- Extensive experience with cloud platforms such as Azure, or Google Cloud Platform (GCP).
- Proficiency in automation tools like Ansible, Terraform etc.
- Strong understanding of CI/CD pipelines and tools like Azure DevOps and GitLab.
- Knowledge of containerization technologies such as Docker and Kubernetes.
- Experience with scripting languages like PowerShell, Ruby etc.
- Atleast 8 yrs experience in Several years of experience in DevOps, cloud infrastructure, and network engineering.
- Proven track record of designing and implementing scalable cloud solutions.
- Excellent problem-solving and analytical skills.
- Strong communication and collaboration abilities.
- Ability to mentor and lead technical teams
- Other Considerations (travel/hours availability, etc).
- Occasional travel is required (10%).
Benefits/Perks
- Hybrid – with ability to work in office in either Austin or Toronto
- Competitive salary
- Employer-matched 401(k) or RRSP plan
- Attractive paid time off policy / Flexible vacation policy
- Career growth and development opportunities
- Home office support set-up
Apply for this job
*
indicates a required field