tags.new

Site Reliability Engineer (SRE)

United States

The world of digital assets is accelerating in speed, magnitude, and complexity, opening the door to new ways for leveraging the blockchain. Fireblocks’ platform and network provide the simplest and most secure way for companies to work with digital assets and it trusted by some of the largest financial institutions, banks, globally-recognized brands, and Web3 companies in the world, including BNY Mellon, BNP Paribas, ANZ Bank, Revolut, and thousands more. 

About the team

The SRE team was recently formed with the goals of establishing first-class observability tools and primary owners of all things production.

Team members are located in several international locations (“follow the sun” model) to provide 24x7 availability.

We are a team of unique individuals, experienced and independent, who get things done.

What You'll Do

As part of your role, you would improve and establish new monitoring, alerting, and observability of services using a wide range of tools. Additionally, you would handle critical alerts and incidents and work directly with R&D to improve and optimize availability.

  • Research Fireblocks blockchain workflows, identify optimization opportunities, issues, and improve monitoring.
  • Help identify root causes for incidents and prevent them from happening again. Solve and orchestrate outages by working with multiple teams.
  • Improve and establish alerting for our infrastructure, services, and business logic
  • Work closely with the R&D and Support: offering education and guidance on integration, support, and monitoring across the toolset
  • Communicate and escalate issues to senior management in R&D and support, write RCAs, and define next steps.
  • Document actions in runbooks and then into automation using Python, Lambda, shell scripts, ArgoCD, and Ansible.
  • Focus on the system's observability, availability, reliability, performance/latency, and monitoring
  • Conduct periodic on-call duties and emergency response

What You'll Bring

  • At least 3+ years of experience as an SRE, Infra Backend in a SaaS environment.
  • You are curious, self-motivated, easy to work with, responsible, and production-aware—fast learner and able to take a project from POC to production, while handling decision-making and communication.
  • Experience with Coding languages - Python/JavaScript/Bash (Must)
  • At least 3+ years of experience with Alerting & Monitoring systems such as DataDog, Coralogix / Splunk / New Relic / Prometheus
  • Experience working with Linux systems from kernel to shell and beyond
  • Cloud systems such as AWS / Google Cloud / Azure
  • Configuration management, such as Ansible/Chef/Puppet/ArgoCD
  • Experience with Docker, Kubernetes, and Helm
  • SCM - Git/bitbucket/gitlab/Phabricator/gerrit
  • High Analytical & Troubleshooting skills - ability to solve complex problems
  • Strong verbal and written communication skills and a collaborative mindset

Want to stand out from the crowd?

  • Previous experience in cryptocurrencies \ blockchains - a big advantage
  • In-depth knowledge in: Linux optimization, nginx, ArgoCD, DataDog, MySQL
  • Participated in Kubernetes migration projects
  • Previous experience as a C++ or Node developer
  • BSC in Computer Science or related technical certifications

 

Fireblocks' mission is to enable every business to easily and securely access digital assets and cryptocurrencies. In order to do that, we strongly believe our workforce should be as diverse as our clients, and this is why we embrace diversity and inclusion in all its forms. 

Please see our candidate privacy policy here.

Create a Job Alert

Interested in building your career at Fireblocks? Get future opportunities sent straight to your email.

Apply for this job

*

indicates a required field

Phone
Resume/CV

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf