Senior Site Reliability Engineer
Who We Are
HP IQ is HP’s new AI innovation lab. Combining startup agility with HP’s global scale, we’re building intelligent technologies that redefine how the world works, creates, and collaborates.
We’re assembling a diverse, world-class team—engineers, designers, researchers, and product minds—focused on creating an intelligent ecosystem across HP’s portfolio. Together, we’re developing intuitive, adaptive solutions that spark creativity, boost productivity, and make collaboration seamless.
We create breakthrough solutions that make complex tasks feel effortless, teamwork more natural, and ideas more impactful—always with a human-centric mindset.
By embedding AI advancements into every HP product and service, we’re expanding what’s possible for individuals, organisations, and the future of work.
Join us as we reinvent work, so people everywhere can do their best work.
About The Role
As a Senior Site Reliability Engineer, you will build scalable and reliable infrastructure and processes in support of HP IQ’s mission to transform the way people work. Reliability at high scale is a core feature in our product and is essential for a great user experience. We also understand that maintaining both engineering velocity and reliability is essential to building a great product. Providing a low-latency, privacy-respecting, ultra-durable experience is an unwavering requirement for HP IQ to achieve our vision. Additionally, the team empowers our employees to safely and cost-effectively maintain company-wide velocity. We operate as software engineers with a focus on building concrete, reliable, and self-service infrastructure.
What You Might Do
- Architect, build, automate, and maintain our mission critical infrastructure for both internal and external customers using Infrastructure-as-Code.
- Leverage your expertise in cloud technologies, software development, and operations to define and implement impact focused strategies and roadmaps.
- Collaborate with leadership to ensure that the platform the team builds meets the goals of the broader organization.
- Help manage vendors, licensing, and contracts related to cloud technologies.
- Collaborate with software developers, providing guidance to ensure the implementation of best practices for security, observability, resiliency, and disaster recovery.
- Unlock and increase engineering velocity and safety through repeatable processes, automation, and best practices of various cloud platforms using your knowledge and experience.
- Be part of a production on-call support rotation for Infrastructure-related services that the team owns.
- Provide guidance around best practices for Incident Management: ensuring transparency during an incident and a blameless incident review culture.
Essential Qualifications
- 5+ years of Production Engineering, SRE, or similar experience
- Strong understanding of building systems at scale on cloud platforms such as AWS, Azure, and/or Google Cloud.
- Expertise in virtualization, containerization, networking, and security.
- Experience with current CI/CD systems, and observability platforms
- Experience with CI/CD systems such as Github Actions, GitLab CI/CD, and/or CircleCI.
- Experience with observability platforms such as Datadog, Prometheus/Loki/Grafana, Honeycomb, and/or OpenTelemetry
- Strong fluency in Python, maintaining a code base including CI/CD, unit testing, and overall design.
- Strong proficiency with writing and maintaining Infrastructure-as-Code such as Terraform or Pulumi.
- Solid understanding of cloud security tooling such as Wiz, AWS GuardDuty, Azure Sentinel, and/or Sysdig.
Preferred Skills
- Experience in writing and debugging Kubernetes operators.
- Experience in embedding with other software development teams to help ensure that new services are production ready: design, observability, resiliency, etc.
- Experience running Hashicorp Vault at scale, specifically with regards to secrets management and public key infrastructure.
- Experience using ArgoCD (or equivalent) to deploy workloads to Kubernetes clusters.
- Experience running multi-region workloads and all of the challenges that come with it: load-balancing, distributed datastores, etc.
- Experience running CockroachDB and/or PostgreSQL at scale.
Salary Range: $149,900 - $270,000
Compensation & Benefits (Full-Time Employees)
The salary range for this role is listed above. Final salary offered is based upon multiple factors including individual job-related qualifications, education, experience, knowledge and skills.
At HP IQ, we offer a competitive and comprehensive benefits package, including:
- Health insurance
- Dental insurance
- Vision insurance
- Long term/short term disability insurance
- Employee assistance program
- Flexible spending account
- Life insurance
- Generous time off policies, including;
- 4-12 weeks fully paid parental leave based on tenure
- 11 paid holidays
- Additional flexible paid vacation and sick leave (US benefits overview)
Why HP IQ?
HP IQ is HP’s new AI innovation lab, building the intelligence to empower humanity—reimagining how we work, create, and connect to shape the future of work.
- Innovative Work
Help shape the future of intelligent computing and workplace transformation. - Autonomy and Agility
Work with the speed and focus of a startup, backed by HP’s scale. - Meaningful Impact
Build AI-powered solutions that help people and organisations thrive. - Flexible Work Environment
Freedom and flexibility to do your best work. - Forward-Thinking Culture
We learn fast, stay future-focused, and imagine what comes next—together.
Equal Opportunity Employer (EEO) Statement
HP, Inc. provides equal employment opportunity to all employees and prospective employees, without regard to race, color, religion, sex, national origin, ancestry, citizenship, sexual orientation, age, disability, or status as a protected veteran, marital status, familial status, physical or mental disability, medical condition, pregnancy, genetic predisposition or carrier status, uniformed service status, political affiliation or any other characteristic protected by applicable national, federal, state, and local law(s).
Please be assured that you will not be subject to any adverse treatment if you choose to disclose the information requested. This information is provided voluntarily. The information obtained will be kept in strict confidence.
If you’d like more information about HP’s EEO Policy or your EEO rights as an applicant under the law, please click here: Equal Employment Opportunity is the Law Equal Employment Opportunity is the Law – Supplement
Apply for this job
*
indicates a required field