Back to jobs

Site Reliability Engineer

Tickets.com, an MLB company, delivers innovative, cutting-edge technologies to enable frictionless and unforgettable fan experiences in venues across the globe. Together with MLB, Tickets.com is changing the landscape of the live sports and entertainment industry, delivering new digital venue and ticketing experiences to millions of fans. Our Technology team builds platforms and products that provide a new smart ticketing solution and venue experience. Using cutting-edge technology, our platform and applications are consumed by fans, stadiums, and MLB teams.

 

We are assembling a world-class team to build on these experiences and to scale platforms and products that anticipate emerging opportunities, including dynamic pricing and offers and digital, contactless ticketing. Our mission is to provide premium, innovative live experiences for our clients and their patrons.

 

Tickets.com is looking for a Site Reliability Engineer passionate about building engaging products for our fans. 

 

The Opportunity: The Site Reliability Engineer will join the Infrastructure Engineering team at Tickets.com, while also working alongside MLB team members and help to drive adoption of best practices across the following areas:

  • Uptime, High Availability and Disaster recovery
  • Incident response
  • Identify SLIs and define SLOs
  • Observability tooling
  • Debugging running systems and providing tools to assist runtime debugging
  • Optimizations for cost control

Essential Job Functions:

  • Work both independently with little supervision and in a team environment
  • Prioritize unblocking your teammates, collaboration and knowledge sharing
  • Collaborate with teams to ensure the availability, security, and integrity of services
  • Help define and configure relevant system, application, and database metrics to ensure observability.
  • Create and maintain dashboards and reports to visualize systems and database performance and health
  • Create monitoring and alerting to detect error conditions, degradation symptoms, and outages
  • Help identify automation and self-service opportunities for infrastructure and database operational tasks to enhance reliability, efficiency and reduce manual toil
  • Support and debug production issues across services and all levels of the stack
  • Engage in driving improvements to our incident response and participate in on-call rotations
  • Continuously identify opportunities for process improvement

 

Requirements:

  • Minimum of a bachelor’s degree in computer science, MIS or a related field, and five (5) years of relevant experience including software or reliability engineering, or combination of education, training, and experience.
  • Strong communication skills and the ability to convey technical information about cloud, container workloads, DevOps, and SRE Principals to all levels of the organization
  • Demonstrable experience in automation, alerting, and remediation with a passion for reducing toil
    • Have written code in a compiled language that runs in production somewhere
    • Have written code in interpreted languages
  • Experience with cloud services (e.g., AWS, Google Cloud Platform)
  • Experience with DevOps practices and tools (e.g. Terraform, Git, CICD)
  • Experience with real-time log/event monitoring tools (e.g., DataDog, Cloud Logging, Splunk)
  • Experience working in an environment running mission critical, transactional, and analytic datastores and pipelines (e.g., Oracle, Postgres, Mongo, BigQuery, Kafka, Airflow)
  • Experience in Linux OS and scripting languages
  • Understanding of networking and connectivity in the context of distributed systems
  • Excellent problem solving and troubleshooting skills
  • Ability to work non-standard shifts including nights and/or weekend on-call responsibilities
  • Dedicated to continuous improvement of yourself and our SRE capabilities

 

Key Technical Traits

 

  • Experience with APIs and microservices: REST, Web, GraphQL
  • Database Solutions: Oracle, MYSQL, MSSQL, CloudSQL, NoSQL
  • Cloud Providers: Google Cloud Platform, Oracle Cloud Infrastructure, AWS
  • Real-time log/event monitoring – DataDog, Google Cloud Logging, Splunk
  • Programming Languages – Go, Python, Bash, Java, JavaScript
  • Scripting: PL/SQL, Shell
  • Software Development tools – Jira, GIT, ArgoCD, Terraform
  • Compliance: PCI DSS, SSAE18/SOC 1

Salary Range is $140-160K

 

We offer an Outstanding Benefits Package that includes:

  • Medical
  • Dental
  • Vision
  • STD & LTD
  • 401K Retirement Plan
  • Basic Life & AD&D
  • Supplemental Life Insurance
  • Paid Time Off (PTO, STO, Holidays including Year-End Holiday Break)
  • HSA 
  • Pet Insurance
  • Tuition Reimbursement
  • Flexible Hybrid Work Environment
  • MLB Tickets

Tickets.com is an Equal Opportunity Employer. Please click here to view our CCPA.

Apply for this job

*

indicates a required field

Resume/CV*

Accepted file types: pdf, doc, docx, txt, rtf

Select...
Select...
Select...
Select...
Select...
Select...
Select...
Select...

U.S. Standard Demographic Questions

We invite applicants to share their demographic background. If you choose to complete this survey, your responses may be used to identify areas of improvement in our hiring process.
Select...
Select...
Select...
Select...
Select...
Select...