Back to jobs

Senior Site Reliability Engineer

Remote - United States

WHO WE ARE 

Zeta Global (NYSE: ZETA) is the AI-Powered Marketing Cloud that leverages advanced artificial intelligence (AI) and trillions of consumer signals to make it easier for marketers to acquire, grow, and retain customers more efficiently. Through the Zeta Marketing Platform (ZMP), our vision is to make sophisticated marketing simple by unifying identity, intelligence, and omnichannel activation into a single platform – powered by one of the industry’s largest proprietary databases and AI. Our enterprise customers across multiple verticals are empowered to personalize experiences with consumers at an individual level across every channel, delivering better results for marketing programs. Zeta was founded in 2007 by David A. Steinberg and John Sculley and is headquartered in New York City with offices around the world. To learn more, go to www.zetaglobal.com.

The Role 
 
We’re looking for an experienced Senior Site Reliability Engineer (SRE) who can write production-grade code, have mastery of SLIs, SLOs, and error budgets, and are passionate about building scalable observability systems.

If you: 

  • Can code confidently in Python or Golang and solve real-world problems through automation. (not only scripting)
  • Have hands-on experience implementing SLIs, SLOs, and distributed tracing in production.
  • Understand Kubernetes, Terraform, and Infrastructure as Code tools.
  • Have hands-on experience with Chaos Engineeringand anomaly detection. 
  • Are excited about working with high-throughput, distributed systems processing millions of transactions daily…

Then this role might be for you! 

Key Responsibilities: 

  • Design, implement, and manage SLOs, SLIs, and error budgets, ensuring reliability aligns with user expectations and business objectives.
  • Develop production-grade software to enhance system reliability and reduce manual toil through automation.
  • Implement and optimize observabilitysolutionsusing tools like OpenTelemetry, with a focus on high-cardinality metrics, distributed tracing, and actionable insights. 
  • Drive postmortem processes and lead in-depth root cause analyses for incidents, ensuring lessons learned are effectively applied to prevent recurrence.
  • Define and monitor MTTx metrics (MTTA, MTTR, MTTF), using them to guide system improvements and measure reliability progress.
  • Design and participate in Chaos Engineering exercises.
  • Collaborate with engineering teams to design systems with reliability and scalability in mind, incorporating capacity planning, resiliency patterns, and modern deployment strategies (e.g., Canary, Blue-Green).
  • Lead design reviews for alerting strategies, ensuring effective signal-to-noise ratios in monitoring and incident management.
  • Advocate for and implement best practices in incident response and system design to achieveoptimaluptime and performance. 

Your experience: 

Strong Coding Background: 

  • 4+ years of experience as an SRE or in a similar role with hands-on coding.
  • 3+ years of software development experience in Python or Golang, with a focus on building maintainable, production-quality code.

SRE Expertise: 

  • Deep understanding of SRE principles, particularly SLIs, SLOs, error budgets, and their real-world application.
  • Hands-on experience conducting postmortems and implementing observability at scale.
  •  Hands-on experience conducting chaos engineering exercises.

Observability Skills: 

  • Expertise in designing and implementing end-to-end observabilitysolutions using tools like OpenTelemetry, Prometheus, Grafana, or Honeycomb.
  • Experience with distributed tracing and handling high-cardinality metrics in production environments.

Infrastructure Knowledge: 

  •  3+ years of experience with AWS and proficiency in Kubernetes, Terraform, andInfrastructure as Code (IaC) tools. 
  •  Strong understanding of distributed systems, microservices architectures, and containerization (Docker, Kubernetes).

Monitoring and Automation: 

  • Hands-on experience with CI/CD platforms (GitOps, Jenkins, ArgoCD) and building automated pipelines.
  • Familiarity with tools and frameworks for incident management and operational automation.

Additional Skills: 

  • Knowledge of modern deployment strategies (e.g., Canary,Blue-Green) and resiliency patterns (e.g., circuit breakers, retries).
  • Strong analytical skills for statistical analysis of metrics to identify and resolve performance bottlenecks.

BENEFITS & PERKS 

  • Unlimited PTO 
  • Excellent medical, dental, and vision coverage 
  • Employee Equity and Stock Purchase Plan 
  • Employee Discounts, Virtual Wellness Classes, and Pet Insurance And more!! 

 
COMPENSATION RANGE 

The compensation range for this role is $140,000.00 - $170,000.00, depending on location and experience. 

 

PEOPLE & CULTURE AT ZETA 

Zeta considers applicants for employment without regard to, and does not discriminate on the basis of an individual’s sex, race, color, religion, age, disability, status as a veteran, or national or ethnic origin; nor does Zeta discriminate on the basis of sexual orientation, gender identity or expression. 

We’re committed to building a workplace culture of trust and belonging, so everyone feels invited to bring their whole selves to work. We provide a forum for employees to celebrate, support and advocate for one another. Learn more about our commitment to diversity, equity and inclusion here: https://zetaglobal.com/blog/a-look-into-zetas-ergs/ 

 

ZETA IN THE NEWS! 

https://zetaglobal.com/press/?cat=press-release 

 #LI-YW1

Create a Job Alert

Interested in building your career at Zeta Global? Get future opportunities sent straight to your email.

Apply for this job

*

indicates a required field

Phone
Resume/CV

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf


Select...
Select...

Voluntary Demographic Questions

Zeta collects this Voluntary Demographic Data only with your consent, for the sole purpose of tracking and improving the diversity of our applicant pool. Any information you choose to provide will not be considered for employment purposes, will not be associated with your employment at Zeta if you are offered a position, and is not used to make hiring or employment decisions.  This data will be maintained unless you withdraw your consent. You may withdraw consent for this data to be maintained by Zeta at any time by contacting your Recruiter. If and when you are employed by Zeta, you may withdraw consent by contacting your HR Partner. If and when you onboard with Zeta, you will also be asked to complete an EEOC questionnaire that collects additional demographic data in order for Zeta to meet its legal requirements.

We are committed to building diverse teams with different identities, backgrounds and perspectives. We believe in providing a forum to connect at Zeta, to learn and celebrate differences. Our mission is to ensure we have an environment that enables a deep level of trust and belonging, so everyone feels invited to bring their whole selves to work, and to increase both diversity at Zeta as well as in the technology industry.  

Zeta considers applicants for employment without regard to, and does not discriminate on the basis of an individual’s sex, race, color, religion, age, disability, status as a veteran, or national or ethnic origin, or any other basis protected by applicable federal, state or local law; nor does Zeta discriminate on the basis of sexual orientation or gender identity or expression.  

Select...
Select...
Select...
Select...

Voluntary Self-Identification

For government reporting purposes, we ask candidates to respond to the below self-identification survey. Completion of the form is entirely voluntary. Whatever your decision, it will not be considered in the hiring process or thereafter. Any information that you do provide will be recorded and maintained in a confidential file.

As set forth in Zeta Global’s Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law.

Select...
Select...
Race & Ethnicity Definitions

If you believe you belong to any of the categories of protected veterans listed below, please indicate by making the appropriate selection. As a government contractor subject to the Vietnam Era Veterans Readjustment Assistance Act (VEVRAA), we request this information in order to measure the effectiveness of the outreach and positive recruitment efforts we undertake pursuant to VEVRAA. Classification of protected categories is as follows:

A "disabled veteran" is one of the following: a veteran of the U.S. military, ground, naval or air service who is entitled to compensation (or who but for the receipt of military retired pay would be entitled to compensation) under laws administered by the Secretary of Veterans Affairs; or a person who was discharged or released from active duty because of a service-connected disability.

A "recently separated veteran" means any veteran during the three-year period beginning on the date of such veteran's discharge or release from active duty in the U.S. military, ground, naval, or air service.

An "active duty wartime or campaign badge veteran" means a veteran who served on active duty in the U.S. military, ground, naval or air service during a war, or in a campaign or expedition for which a campaign badge has been authorized under the laws administered by the Department of Defense.

An "Armed forces service medal veteran" means a veteran who, while serving on active duty in the U.S. military, ground, naval or air service, participated in a United States military operation for which an Armed Forces service medal was awarded pursuant to Executive Order 12985.

Select...

Voluntary Self-Identification of Disability

Form CC-305
Page 1 of 1
OMB Control Number 1250-0005
Expires 04/30/2026

Why are you being asked to complete this form?

We are a federal contractor or subcontractor. The law requires us to provide equal employment opportunity to qualified people with disabilities. We have a goal of having at least 7% of our workers as people with disabilities. The law says we must measure our progress towards this goal. To do this, we must ask applicants and employees if they have a disability or have ever had one. People can become disabled, so we need to ask this question at least every five years.

Completing this form is voluntary, and we hope that you will choose to do so. Your answer is confidential. No one who makes hiring decisions will see it. Your decision to complete the form and your answer will not harm you in any way. If you want to learn more about the law or this form, visit the U.S. Department of Labor’s Office of Federal Contract Compliance Programs (OFCCP) website at www.dol.gov/ofccp.

How do you know if you have a disability?

A disability is a condition that substantially limits one or more of your “major life activities.” If you have or have ever had such a condition, you are a person with a disability. Disabilities include, but are not limited to:

  • Alcohol or other substance use disorder (not currently using drugs illegally)
  • Autoimmune disorder, for example, lupus, fibromyalgia, rheumatoid arthritis, HIV/AIDS
  • Blind or low vision
  • Cancer (past or present)
  • Cardiovascular or heart disease
  • Celiac disease
  • Cerebral palsy
  • Deaf or serious difficulty hearing
  • Diabetes
  • Disfigurement, for example, disfigurement caused by burns, wounds, accidents, or congenital disorders
  • Epilepsy or other seizure disorder
  • Gastrointestinal disorders, for example, Crohn's Disease, irritable bowel syndrome
  • Intellectual or developmental disability
  • Mental health conditions, for example, depression, bipolar disorder, anxiety disorder, schizophrenia, PTSD
  • Missing limbs or partially missing limbs
  • Mobility impairment, benefiting from the use of a wheelchair, scooter, walker, leg brace(s) and/or other supports
  • Nervous system condition, for example, migraine headaches, Parkinson’s disease, multiple sclerosis (MS)
  • Neurodivergence, for example, attention-deficit/hyperactivity disorder (ADHD), autism spectrum disorder, dyslexia, dyspraxia, other learning disabilities
  • Partial or complete paralysis (any cause)
  • Pulmonary or respiratory conditions, for example, tuberculosis, asthma, emphysema
  • Short stature (dwarfism)
  • Traumatic brain injury
Select...

PUBLIC BURDEN STATEMENT: According to the Paperwork Reduction Act of 1995 no persons are required to respond to a collection of information unless such collection displays a valid OMB control number. This survey should take about 5 minutes to complete.