Site Reliability Engineer
Company Description
At Amwell, we’re transforming healthcare for all—powered by technology and inspired by people. Here, your ideas don’t just matter—they drive real change, improving lives on a global scale.
We marry technology and innovation with clinical excellence to provide trusted solutions that solve the healthcare industry’s biggest pain points and are on a mission to enable greater access to more convenient, affordable, and effective care.
We do this through our technology-enabled care platform that is designed to help our clients achieve their digital care ambitions – today and in the future. We offer programs spanning the full care continuum, including urgent, acute and specialty care, behavioral health, and services for the treatment of chronic conditions such as heart and cardiometabolic diseases. Programs are powered by Amwell as well as our growing partner network.
For almost two decades, Amwell has proudly served some of the largest and most sophisticated healthcare organizations in the U.S. and worldwide. Our team is passionate about technology’s role in transforming care delivery and making it more equitable, accessible, efficient, cost-effective and navigable for all.
Brief Overview
The Site Reliability Engineer builds and operates the paved roads that service teams use every day. You take shared infrastructure from idea to module to production, then you keep it boring. This is not a research role and not a hero role. It is delivery with discipline. You build with intention. You do not just make things work, you make them make sense. You challenge assumptions, question defaults, and tighten bolts others ignore. You move fast, but not recklessly. You are becoming the engineer others trust to take ownership and deliver cleanly.
This is a hands-on engineering role who can work independently on well-scoped problems with guidance, follow established patterns, and improve them when the evidence supports change. You partner closely with Security, Networking, and SRE because the platform is where constraints become real. As a Site Reliability Engineer, you help determine whether the platform feels chaotic or calm to everyone else. Your work directly affects developer velocity, operational safety, and trust in the system. When the platform is boring, predictable, and resilient, it is because engineers like you did the work carefully and well.
Core Responsibilities
Cloud Foundations
• Implement cloud infrastructure in AWS using approved patterns and guardrails.
• Support EKS based runtime foundations, including cluster add-ons and shared services.
• Build environment parity across nonprod and prod and flag any required divergence early with evidence.
• Help make cloud primitives predictable, supportable, and easy to consume.
Infrastructure Patterns and Modules
• Develop and maintain reusable platform modules and templates using Terraform or CDKTF where applicable.
• Contribute to baseline building blocks: VPC patterns, IAM primitives, EKS base clusters, ingress patterns, secrets, and shared data stores as assigned.
• Keep modules consumable through sane defaults, versioning, changelogs, and upgrade guidance.
• Reduce drift by enforcing standards through code, not documentation alone.
Automation and Delivery Enablement
• Improve CI workflows for infrastructure changes: plan and apply safety, policy checks, drift detection, and promotion across environments.
• Remove manual steps from provisioning and onboarding by turning them into pipelines and documented runbooks.
• Support internal module consumption patterns, including examples and reference implementations.
• Favor repeatability and clarity over clever one-off solutions.
Operations and Reliability
• Operate platform owned services with an ownership mindset. Ownership is not optional.
• Participate in on call for platform services and follow incident procedures.
• Write and maintain runbooks, dashboards, and alerts for what you ship.
• Drive post-incident follow-ups that reduce repeat failures.
Security, Compliance, and Governance
• Implement least privilege IAM patterns and secure by design defaults.
• Partner with Security to integrate controls into pipelines and platform defaults.
• Treat auditability as a feature: logs, approvals, traceability, and evidence.
• Follow established governance and exception processes and document deviations
Qualifications
• 3 plus year's experience in platform engineering, DevOps, SRE, or infrastructure engineering.
• Working experience with AWS and infrastructure as code (Terraform preferred, CDKTF acceptable).
• Practical Kubernetes experience, preferably EKS (deploying, operating, debugging).
• Comfort with networking fundamentals: DNS, TLS, routing, load balancers, and security groups.
• Ability to debug pipelines and distributed failures without guessing.
• Strong written communication: design notes, runbooks, and crisp status updates
Do Well. Live Well. At Amwell.
Driven by our mission and values, we foster a workplace where Delivering Awesome, being Customer First and operating as One Team aren’t just aspirations – they are how we work, every day.
Our people are our greatest asset. We strive to empower their growth and development not only as Amwellians but as individuals, through generous total rewards packages, a virtual-first work environment, work-life flexibility, including Summer Fridays and designated Mental Health Days, as well as opportunities to stretch and learn – to name a few. It’s our people who truly differentiate us. Ask anyone and they’ll tell you – you’ll never work with more passionate, more driven and more caring team members.
We champion a culture of respect and inclusion, accountability and integrity, innovation and collaboration. At Amwell, you’ll do the most meaningful work of your career—improving healthcare for millions, growing alongside incredible teammates, and being valued for who you are.
Benefits
- Flexible Personal Time Off (Vacation time)
- 401K match
- Competitive healthcare, dental and vision insurance plans
- Paid Parental Leave (Maternity and Paternity leave)
- Employee Stock Purchase Program
- Free access to Amwell’s Telehealth Services, SilverCloud and The Clinic by Cleveland Clinic’s second opinion program
- Free Subscription to the Calm App
- Tuition Assistance Program
- Pet Insurance
Salaried, Exempt Roles
The typical base salary range for this position is $104,040 - $127,160. The actual salary offer will ultimately depend on multiple factors including, but not limited to, knowledge, skills, relevant education, experience, complexity or specialization of talent, and other objective factors. In addition to base salary, this role may be eligible for an annual bonus based on a combination of company performance and employee performance. Long-term incentive and short-term variable compensation may be offered as part of the compensation package dependent on the role. Some roles may be commission based, in which case the total compensation will be based on a commission and the above range may not be an accurate representation of total compensation.
Further, the above range is subject to change based on market demands and operational needs and does not constitute a promise of a particular wage or a guarantee of employment. Your recruiter can share more during the hiring process about the specific salary range based on the above factors listed.
https://business.amwell.com/company/privacy-notice-applicants
Privacy Notice
Apply for this job
*
indicates a required field