
Site Reliability Engineering Manager, Consumer
About Attain
Built for consumers and companies, alike.
In a world driven by data, we believe consumers and businesses can coexist. Our founders had a vision to empower consumers to leverage their greatest asset—their data—in exchange for modern financial services. Built with this vision in mind, our platform allows consumers to access savings tools, earned wages and rewards without cost or hidden fees. In exchange, they give permission to use their real-time data for research, insights and targeted advertising.
At Attain, your contribution will help us build a more equitable and efficient data sharing ecosystem—whether helping consumers access modern financial services or businesses leverage data to achieve better outcomes. You’ll have the opportunity to work directly with hands-on leaders and mission-driven individuals everyday.
Attain Office Hybrid Schedule (where applicable):
- Redwood City, CA: Mondays (in-office for stand-ups, all-hands) and choice of three days between Tues-Friday
- Chicago, IL & New York, NY: 4 days in-office; 1 day remote
About the Role
As the Site Reliability Engineering Manager, you will manage our consumer SRE team in building out and maintaining the infrastructure and supporting tools that power all of our B2C applications, as well as ensure their uptime, stability and security. You will work closely with the leads of the other engineering and product teams at Attain in helping to architect our systems for security, observability, reliability and scalability. You will lead a team of hard working, driven and supportive SREs setting the direction, vision, and priorities for your team. You will work hands-on with our GCP, AWS and Kubernetes environments. You will be an owner within the engineering organization and be able to make a direct impact on the millions of users of our applications.
What a Typical Week Might Look Like
- Lead architecture and capacity planning discussions to ensure systems are scalable, reliable, and secure
- Manage daily SRE operations, from ticket refinement and estimation through resolution and monitoring
- Refine and document monitoring and alerting for B2C applications
- Track and optimize consumer SLIs and SLOs
- Introduce new processes and technologies to advance consumer infrastructure
- Build and maintain platforms, CI/CD pipelines, networking, access controls, and infrastructure using Terraform
- Develop Helm charts for Kubernetes deployments using Istio, Argo, and Prometheus
- Monitor and maintain BigQuery, Spanner, Postgres, and MySQL databases
You’ll Be a Great Fit If You
- Are a self-motivated leader who thrives on ownership and adaptability
- Bring rigor and process to SRE while fostering collaboration and continuous learning
- Are passionate about automation and hands-on infrastructure management
- Value feedback and personal growth
Preferred Qualifications
- 6+ years building and maintaining large-scale cloud-native infrastructure (AWS and/or GCP)
- Experience leading SRE teams and cross-functional communication
- Proven success managing maintenance and outages for large-scale consumer applications
- Skilled in Kubernetes, Istio, Prometheus, and Argo
- Proficient in SQL, event streaming, and pub/sub
- Familiar with serverless technologies and infrastructure-as-code (Terraform)
- Strong computer science and engineering fundamentals
- Knowledge of SOC2 and PCI compliance
We are excited to hear from you.
At Attain, we are passionate about finding people who help us celebrate progress at our growth-oriented organization. We encourage you to apply, even if your experience doesn’t match every detail on the job description. If we don’t see something that immediately fits, we will keep your resume on file for future opportunities.
Create a Job Alert
Interested in building your career at Attain? Get future opportunities sent straight to your email.
Apply for this job
*
indicates a required field