.png?1751287131)
Site Reliability Engineer - Gaming
The Wow
As a Site Reliability Engineer, your primary objective will be to ensure stability, reliability, and performance of the areas service across our main online casino product.
Our Product Development organisation is truly Global with cross functional teams spanning 6 Tech Hubs – Malta, Budapest, Stockholm, Tallinn, Kyiv and Athens. With nearly 600 strong professionals, the Product Development organisation is spear-headed by our CTO-CPO with all our talented Area Teams working together.
Key Responsibilities:
- Incident & Problem Management: Investigate system incidents, drive Root Cause Analysis (RCAs), and execute long-term remedial fixes. Proactively reduce the number of incidents caused by system changes.
- Observability & Metrics: Define and enforce Service Level Agreements (SLAs), Service Level Objectives (SLOs), and success metrics for new initiatives. Build and maintain comprehensive dashboards to achieve observability excellence.
- Performance & Capacity: Identify and help resolve performance bottlenecks.Optimize infrastructure and code to maintain fast service, and conduct capacity planning to forecast future hardware or cloud resource requirements.
- Availability & Change Management: Guarantee the Platform components remain highly reachable and functional for users. Oversee deployments to ensure new code does not disrupt the existing system.
What we are looking for:
Supporting and troubleshooting. Using most of the following technologies:
- Observability & Monitoring: Deep experience building dashboards and tracking SLAs/SLOs using tools like Prometheus, Grafana, Coralogix, Splunk, or Loki.
- Programming & Automation: Proficiency in scripting and coding to automate manual tasks (eliminate "toil") and build reliability tools. Strong skills in .NET, Python, Powershell or Bash are highly preferred.
- Infrastructure as Code (IaC) & Cloud: Experience provisioning and managing infrastructure using Terraform or Ansible, along with a solid understanding of cloud platforms (AWS, GCP, or Azure).
- Containerization & Orchestration: Hands-on experience scaling and managing distributed systems using Kubernetes (K8s) and Docker.
- CI/CD & Change Management: Familiarity with deployment pipelines (GitLab CI, GitHub Actions, Team City, Octopus) to ensure safe, automated rollouts that don't cause incidents.
- Core Competencies: Strong analytical skills for Root Cause Analysis (RCA), a calm approach to incident response, and the ability to lead blameless post-mortems.
Nice to have:
- AWS Cloud infrastructure, CDNs, and other various systems running in multiple data centres and environments
- Cloud Application Load Balancer, preferably with experience on AWS ALB
- Cloud DNS support such as AWS Route 53, GCP Cloud DNS, or Azure DNS
- Server virtualisation such as VMware, IaaS and PaaS cloud such as AWS and Azure
- Experience with Microsoft SQL databases, PostgreSQL, and Couchbase is considered an asset.
What we offer
Much like riding a rollercoaster, sometimes life at Betsson can be lightning fast with twists and turns but always FUN! Then again, what else would you expect from a business 75% millennial and 1700 strong, spread across 7 offices with 900 based out of our Malta HQ alone! We recognise it may not be for the faint-hearted, but if you’re a go-getter, initiator and adrenaline junkie, always striving to push the boundaries and challenge yourself, then you’ll fit right in.
Challenge accepted?
Click here to find out more on who we are looking for
By submitting your application, you understand that your personal data will be processed as set out in our Privacy Policy
Create a Job Alert
Interested in building your career at Betsson Group? Get future opportunities sent straight to your email.
Apply for this job
*
indicates a required field