Site Reliability Engineer 3

Lviv, (Ukraine)

About Behavox

Behavox is a cloud-native AI company providing an integrated controls platform for global banks, asset managers, hedge funds, private equity firms, insurance businesses, and commodity firms. The platform unifies communications and trade surveillance, compliant archiving, policy management as well as front-office analytics on a single, AI-native technology stack, delivered as a globally scalable SaaS-based cloud service.

At Behavox, our engineering culture is built around speed, experimentation, and technical excellence, following agile principles and rapid iteration. We constantly test and adopt the latest cloud technologies and AI tooling, optimizing for fast feedback loops and execution. We look for people who can move fast, challenge conventional wisdom, and who want to work at the frontier of modern AI, SaaS platforms, and distributed systems.

Behavox is a high-performance organization with a strong bias toward delivery, ownership, and responsibility. We commit, and we execute. We are building systems that are complex, mission-critical, and global in scale; systems that many consider too large or too difficult. 

To do that, we seek the smartest, most technically capable engineers and technologists who take end-to-end responsibility and want to win by building what others cannot.

Founded in 2014 and backed by SoftBank Vision Fund, Behavox is headquartered in London, with offices worldwide, including New York City, Montreal, Seattle, Singapore, and Tokyo.

 

About the Role

The Behavox Platform is a scalable, fault-tolerant and highly performant storage and processing system which allows us to manage and analyze massive volumes of data. We have an extensive and flexible set of APIs to develop products that allow our clients to work through millions of data items, by searching, filtering, and visualizing relationships between entities in the system.

As a Site Reliability Engineer you will be responsible for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of all production systems and services. You will work together with other DevOps, Product and Engineering teams to design and implement SRE practice at Behavox to build foundational infrastructure allowing to support the rapid growth of the Behavox client base.

This is an incredible opportunity to discover the world of high-load data processing and face the challenges of distributed Big Data systems. It will also provide you the opportunity to:

1. Work with high-load and business-critical services that will have a big impact on the company
2. Implement your ideas in an environment that strives for continuous improvement
3. Be part of a fast-growing, dynamic company and with modern technologies

More information about the tools and solutions used at Behavox can be found on our engineering blog https://blog.behavox.engineering

 

What You’ll Bring

  1. Linux mastery (5+ years). You understand how the kernel works, not just how to use it. You're comfortable with: systemd, strace, system calls, inodes, iptables/netfilter, namespace isolation, cgroups, process management, filesystem internals. You can debug a hanging process or network issue from first principles.
  2. Kubernetes in production (3+ years). Not just "I deployed a pod once." You've run production K8s clusters, debugged CNI issues, understood resource limits and QoS, troubleshot DNS problems, dealt with pod evictions, and know when to use StatefulSets vs Deployments vs DaemonSets.
  3. Production troubleshooting and incident leadership. You've been paged at 3 AM and fixed it. You've led incidents as a DRI or Incident Commander, not just participated as a responder. You know how to methodically isolate failures in distributed systems, write blameless postmortems, and improve systems based on lessons learned. You can read application logs, correlate metrics, check network connectivity, profile resource usage, and find root causes under time pressure.
  4. Python or Golang (hands-on, production experience). You've built real automation tools, not just scripts. You understand error handling, testing, logging, and writing maintainable code that other engineers will use and modify.
  5. Cloud platforms (GCP required, AWS is a plus). Real production experience with Google Cloud (Compute Engine, GKE, Cloud Storage, IAM, VPC networking) or AWS equivalents. You've designed cloud architecture, optimized costs, and debugged cloud-specific issues.

 

What You'll Do

  1. Be on-call and lead incident response. You'll carry the pager and act as Incident Commander or DRI during major outages. This means coordinating response teams, making decisive calls under pressure, running structured incident management (severity classification, communication, escalation, resolution, postmortems), and keeping stakeholders informed. You must know how to run an incident - not just fix technical problems.
  2. Deep troubleshooting. We believe in observability-first approaches with proper monitoring and metrics. But when observability doesn't give you the answer - when a Java service is leaking memory in Kubernetes, network packets are dropping mysteriously, or a production database is hitting inode limits - you need to be unafraid to go deeper. Grab strace, dive into kernel logs, check iptables rules, and analyze system calls. No handholding.
  3. Build real automation. Not bash one-liners. You'll write Python or Golang tools that solve complex operational problems - deployment automation, self-healing systems, capacity planning tools, incident response automation. Code that other engineers will depend on.
  4. Maintain high-load distributed systems. Our platform processes massive data volumes across GCP (primary) and AWS. You'll deploy, scale, monitor, and optimize these systems while keeping SLAs.
  5. Own the observability stack. Prometheus is your foundation. You'll design monitoring, write meaningful alerts (not alert spam), build dashboards that actually help during incidents, and implement quality control gates for AI services.

 

What We Offer & Expect

  1. The opportunity to work on a global, mission-critical AI platform alongside the best engineers and technologists across multiple geographies.
  2. A role with real ownership and impact, building complex systems at scale in an environment that values speed, experimentation, and technical excellence.
  3. A highly attractive benefits package, including competitive cash compensation, an equity award aligned with long-term value creation, and comprehensive health insurance for employees and their families.
  4. A modern, comfortable office in central Lviv, with an expectation of working from the office five (5) days per week, reflecting our belief in strong in-person collaboration, while remaining flexible to accommodate occasional personal circumstances that may require working from home.
  5. A generous time-off policy of 30 days annually, plus public holidays and sick leave, recognizing the importance of sustained high performance.

 

About Our Process

Our selection process is designed to rigorously assess a candidate’s depth of technical knowledge, problem-solving ability, and alignment with Behavox’s mission and core values.

As part of the process, candidates will first participate in a series of interviews focused on evaluating their technical expertise and engineering judgment. Candidates who successfully progress through these interviews will then be invited to complete a live technical exercise with a group of Behavox engineers and engineering managers. 

The purpose of this live technical assessment is to validate the candidate’s stated technical competencies and assess their ability to solve complex problems with speed, accuracy, and sound engineering judgment. Note that whenever possible, we aim to conduct interviews in person at our offices.

We recognize and respect the time candidates invest in this process. In return, Behavox commits significant time and resources to ensure that those who join us have the capability, judgment, and alignment required to operate at the speed and level of complexity our work demands. We value efficiency and clarity on both sides; if at any point we determine that a candidate is not a fit, we reserve the right to immediately conclude the interview or the technical assessment.

Please note the following:

  • A core objective of the process is to objectively assess individual knowledge and competencies. The use of AI tools or external assistance during live interviews or technical exercises is strictly prohibited (unless explicitly instructed otherwise) and will result in immediate disqualification.
  • Interviews and technical sessions may be recorded for internal review to support fairness, consistency, and collaborative decision-making within the hiring team.

Create a Job Alert

Interested in building your career at Behavox? Get future opportunities sent straight to your email.

Apply for this job

*

indicates a required field

Phone
Resume/CV*

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf


Voluntary Self-Identification

For government reporting purposes, we ask candidates to respond to the below self-identification survey. Completion of the form is entirely voluntary. Whatever your decision, it will not be considered in the hiring process or thereafter. Any information that you do provide will be recorded and maintained in a confidential file.

As set forth in Behavox’s Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law.

Select...
Select...
Race & Ethnicity Definitions

If you believe you belong to any of the categories of protected veterans listed below, please indicate by making the appropriate selection. As a government contractor subject to the Vietnam Era Veterans Readjustment Assistance Act (VEVRAA), we request this information in order to measure the effectiveness of the outreach and positive recruitment efforts we undertake pursuant to VEVRAA. Classification of protected categories is as follows:

A "disabled veteran" is one of the following: a veteran of the U.S. military, ground, naval or air service who is entitled to compensation (or who but for the receipt of military retired pay would be entitled to compensation) under laws administered by the Secretary of Veterans Affairs; or a person who was discharged or released from active duty because of a service-connected disability.

A "recently separated veteran" means any veteran during the three-year period beginning on the date of such veteran's discharge or release from active duty in the U.S. military, ground, naval, or air service.

An "active duty wartime or campaign badge veteran" means a veteran who served on active duty in the U.S. military, ground, naval or air service during a war, or in a campaign or expedition for which a campaign badge has been authorized under the laws administered by the Department of Defense.

An "Armed forces service medal veteran" means a veteran who, while serving on active duty in the U.S. military, ground, naval or air service, participated in a United States military operation for which an Armed Forces service medal was awarded pursuant to Executive Order 12985.

Select...

Voluntary Self-Identification of Disability

Form CC-305
Page 1 of 1
OMB Control Number 1250-0005
Expires 04/30/2026

Why are you being asked to complete this form?

We are a federal contractor or subcontractor. The law requires us to provide equal employment opportunity to qualified people with disabilities. We have a goal of having at least 7% of our workers as people with disabilities. The law says we must measure our progress towards this goal. To do this, we must ask applicants and employees if they have a disability or have ever had one. People can become disabled, so we need to ask this question at least every five years.

Completing this form is voluntary, and we hope that you will choose to do so. Your answer is confidential. No one who makes hiring decisions will see it. Your decision to complete the form and your answer will not harm you in any way. If you want to learn more about the law or this form, visit the U.S. Department of Labor’s Office of Federal Contract Compliance Programs (OFCCP) website at www.dol.gov/ofccp.

How do you know if you have a disability?

A disability is a condition that substantially limits one or more of your “major life activities.” If you have or have ever had such a condition, you are a person with a disability. Disabilities include, but are not limited to:

  • Alcohol or other substance use disorder (not currently using drugs illegally)
  • Autoimmune disorder, for example, lupus, fibromyalgia, rheumatoid arthritis, HIV/AIDS
  • Blind or low vision
  • Cancer (past or present)
  • Cardiovascular or heart disease
  • Celiac disease
  • Cerebral palsy
  • Deaf or serious difficulty hearing
  • Diabetes
  • Disfigurement, for example, disfigurement caused by burns, wounds, accidents, or congenital disorders
  • Epilepsy or other seizure disorder
  • Gastrointestinal disorders, for example, Crohn's Disease, irritable bowel syndrome
  • Intellectual or developmental disability
  • Mental health conditions, for example, depression, bipolar disorder, anxiety disorder, schizophrenia, PTSD
  • Missing limbs or partially missing limbs
  • Mobility impairment, benefiting from the use of a wheelchair, scooter, walker, leg brace(s) and/or other supports
  • Nervous system condition, for example, migraine headaches, Parkinson’s disease, multiple sclerosis (MS)
  • Neurodivergence, for example, attention-deficit/hyperactivity disorder (ADHD), autism spectrum disorder, dyslexia, dyspraxia, other learning disabilities
  • Partial or complete paralysis (any cause)
  • Pulmonary or respiratory conditions, for example, tuberculosis, asthma, emphysema
  • Short stature (dwarfism)
  • Traumatic brain injury
Select...

PUBLIC BURDEN STATEMENT: According to the Paperwork Reduction Act of 1995 no persons are required to respond to a collection of information unless such collection displays a valid OMB control number. This survey should take about 5 minutes to complete.