SRE II
About Bluevine
Bluevine is on a mission to enable a better financial future for small business owners through innovative banking solutions designed just for them. By combining best-in-class technology with advanced security and a deep understanding of the small business community, we deliver end-to-end banking and lending products that empower always-on entrepreneurs to grow their businesses with confidence.
As a dynamic company with massive potential, we’re backed by leading investors such as Lightspeed Venture Partners, Menlo Ventures, 83North, Citi Ventures, and nearly 9 years of proven success. Since launching in 2013, we have grown exponentially, amassing over 400,000 customers across all 50 states and a global team of more than 500 people. Our passion is driven by purpose: to give small businesses the tools they need to succeed and we’re just getting started.
All of this begins with our team who are driven by collaboration, problem-solving, and learning and growing together. With a commitment to innovation and community impact, our mission is to help every small business—and every team member—thrive. Join us! #LI-Hybrid
We are seeking a highly skilled and proactive SRE to join our team. As the first line of defense and Tier 1 support, you will play a critical role in ensuring the uninterrupted operation of our services across various environments, with a primary focus on AWS cloud infrastructure
WHAT YOU'LL DO:
Monitoring and Alert Management: Constant monitoring of production, staging, and other environments for any alerts or anomalies. Respond promptly to alerts, assess the severity, and take appropriate actions to ensure service continuity.
Incident Triage and Resolution: Act as the first point of contact for all incidents related to service continuity. Quickly assess and triage incidents, escalating to appropriate teams if necessary, and drive them to resolution within defined SLAs.
Proactive Issue Identification: Proactively identify potential issues or areas of concern within the AWS cloud environment that could impact service continuity. Work closely with the engineering and operations teams to address these issues before they escalate.
Documentation and Knowledge Sharing: Maintain comprehensive documentation of incidents, resolutions, and best practices. Share knowledge and insights with the broader team to improve incident response and prevention processes.
Collaboration and Communication: Effectively collaborate with cross-functional teams, including engineering, operations, and security, to address service continuity challenges. Ensure clear and timely communication with stakeholders regarding incident status and resolution.
Continuous Improvement: Continuously seek opportunities to improve processes, tools, and monitoring capabilities to enhance service continuity in the AWS cloud environment. Actively participate in post-incident reviews to identify lessons learned and implement preventive measures.
Emergency Response: Be available for on-call rotations and respond to emergency situations outside of regular business hours when necessary to ensure the stability and availability of our services.
WHAT WE LOOK FOR:
- 1+ years of experience in any SRE / Service continuity team
- Basic understanding of cloud computing principles and AWS services.
- Excellent problem-solving and troubleshooting skills with the ability to remain calm under pressure.
- Familiarity with AWS troubleshooting, diagnostic tools, and utilities.
- Experience with incident management processes and tools.
- Experience in monitoring and alerting tools such as CloudWatch, Grafana, Prometheus, New Relic, OpenSerach or similar.
- Effective communication skills with the ability to convey technical information to both technical and non-technical stakeholders.
- Proven experience in a similar role, preferably in a cloud-based environment with a focus on AWS.
Bonus points if you also have:
- AWS certifications (e.g., AWS Certified Solutions Architect, AWS Certified SysOps Administrator)
- Bachelor's degree in computer science, engineering, or related field
#LI-IL1
BENEFITS & PERKS
- Excellent group health coverage and life insurance
- Stock options
- Hybrid work model
- Meal allowance
- Transportation assistance (terms and conditions apply)
- Generous paid time off plan, Holidays
- Company-sponsored mental health benefits
- Financial advisory services for both short- and long-term goals
- Learning and development opportunities to support career growth
- Community-based volunteering opportunities
Apply for this job
*
indicates a required field