
Technical Product Manager II, Site Reliability Engineering
The mission of The New York Times is to seek the truth and help people understand the world. That means independent journalism is at the heart of all we do as a company. It’s why we have a world-renowned newsroom that sends journalists to report on the ground from nearly 160 countries. It’s why we focus deeply on how our readers will experience our journalism, from print to audio to a world-class digital and app destination. And it’s why our business strategy centers on making journalism so good that it’s worth paying for.
Mission Overview & Responsibilities:
At The New York Times, our Site Reliability Engineering (SRE) team is central to how we design, test, and operate the systems that support our most critical customer experiences. We're looking for a Technical Product Manager to lead the strategy for reliability programs and platforms that help teams ship resilient systems with confidence. These programs include operational readiness, load and chaos testing, observability, and incident readiness.
You'll partner with SRE, platform infrastructure, and product engineering teams to define the standards, tooling, and practices that improve operational readiness across hundreds of services. You will focus on building scalable reliability programs and experiences—not running cloud infrastructure or managing operational tickets.
You will be the product lead for a portfolio of SRE programs that includes:
-
Operational Readiness and Always Ready – reliability models, scorecards, production readiness reviews, and reliability signals that support operating reviews and critical customer journeys.
-
Load, Chaos & Disaster Recovery Testing – platforms and practices that validate how systems behave under high traffic, zonal or regional failures, and degraded conditions.
-
Observability & Incident Readiness – opinionated defaults for metrics, logs, traces, dashboards, alerts, and runbooks that improve detection, diagnosis, and on-call quality.
You'll translate technical and business signals—including incidents, SLO trends, test outcomes, and reliability scores—into clear roadmaps, measurable outcomes, and focused investments that improve reliability across the company.
About the Team
SRE at The Times is an enablement team: we improve on reliability through education, tooling, and hands‑on operational support, not by building everything ourselves.
-
We operate in a large, distributed company. We rely on clear, and constant communication, which includes crisp written updates, well-maintained tickets and docs, and proactive status sharing, so partners always know what's happening.
-
We expect SREs (and you) to take product‑level ownership of engagements and programs, keep work moving, and be comfortable operating independently.
-
We learn by doing: running load and chaos tests, iterating on dashboards and scorecards, and using outcomes to refine our approach rather than over‑planning up front.
-
We treat engagements as collaborative problem‑solving, meeting teams where they are, respecting their ownership, and leaving them more capable than when we arrived.
-
We care about metrics-driven reliability. These signals include SLOs, reliability scores, test results, MTTR/MTTD. We use them to guide decisions, measure the impact, and feed programs like Always Ready.
We believe reliability is a shared responsibility, and we create clear, supportive pathways for teams to improve their services without having to become experts in every SRE domain.
This is hybrid role based in New York City, NY.
Responsibilities:
-
Build and communicate the product roadmap for SRE‑led reliability programs (operational maturity, Always On signals, load/chaos testing, observability), aligning them with newsroom and product priorities.
-
Turn ambiguous reliability problems into concrete products and engagements by doing discovery with engineers and leaders, defining scope and success metrics, and treating SRE collaborations as internal consulting engagements with clear contracts and exit criteria.
-
Shape the direction of testing and observability platforms by prioritizing high-value scenarios, such as BNAs, elections, and major launches. Ensure that load and chaos tests map to customer journeys, and tie the results directly to SLOs, dashboards, and runbooks.
-
Use data to guide iteration and storytelling. Use incident metrics, test outcomes, and reliability scores to refine roadmaps and report on impact.
-
Experience communicating complex technical concepts to a variety of audiences, including SRE, partner teams, and senior leadership, about tradeoffs and progress.
-
Demonstrate support and understanding of our value of journalistic independence and a strong commitment to our mission to seek the truth and help people understand the world.
-
You will report to the Vice President of Product, Developer Experience.
Basic Qualifications:
-
5+ years of product management experience in platform, infrastructure, SRE, or other technical domains, with experience building roadmaps for multi‑team, multi‑system products or programs.
-
Working knowledge of SRE practices and operational readiness (SLIs/SLOs, error budgets, incident response and review, production readiness, on‑call quality) and how they show up in services.
-
Experience with testing and observability in cloud‑native environments - for example, shaping or supporting load / performance / chaos testing and working with metrics, logs, traces, dashboards, and alerts.
-
Translate systems data (reliability, performance) to prioritize and evaluate work, and to explain complex technical concepts and tradeoffs to both engineers and non‑technical partners.
Preferred Qualifications:
-
Experience working with SRE, Platform, or Infrastructure teams, especially in enablement or engagement models (embeds, consulting‑style projects, or Always Ready‑like programs).
-
Experience defining or scaling operational readiness frameworks and production readiness reviews, and driving adoption of standards across multiple teams.
-
Experience operating observability platforms at scale - we use Datadog, but experience with any enterprise observability tool will do.
-
Experience with cloud providers and Kubernetes (AWS, GCP, or Azure) to understand how reliability and observability tradeoffs show up in practice, and to hold informed conversations with engineers operating those systems.
REQ-020080
#LI-Hybrid
The annual base pay range for this role is between:
$120,000 - $142,000 USD
For roles in the U.S., dependent on your role, you may be eligible for variable pay, such as an annual bonus and restricted stock. Benefits may include medical, dental and vision benefits, Flexible Spending Accounts (F.S.A.s), a company-matching 401(k) plan, paid vacation, paid sick days, paid parental leave, tuition reimbursement and professional development programs.
For roles outside of the U.S., information on benefits will be provided during the interview process.
The New York Times Company is committed to being the world’s best source of independent, reliable and quality journalism. To do so, we embrace a diverse workforce that has a broad range of backgrounds and experiences across our ranks, at all levels of the organization. We encourage people from all backgrounds to apply.
We are an Equal Opportunity Employer and do not discriminate on the basis of an individual's sex, age, race, color, creed, national origin, alienage, religion, marital status, pregnancy, sexual orientation or affectional preference, gender identity and expression, disability, genetic trait or predisposition, carrier status, citizenship, veteran or military status and other personal characteristics protected by law. All applications will receive consideration for employment without regard to legally protected characteristics. The U.S. Equal Employment Opportunity Commission (EEOC)’s Know Your Rights Poster is available here.
The New York Times Company will provide reasonable accommodations as required by applicable federal, state, and/or local laws. Individuals seeking an accommodation for the application or interview process should email reasonable.accommodations@nytimes.com. Emails sent for unrelated issues, such as following up on an application, will not receive a response.
The Company encourages those with criminal histories to apply, and will consider their applications in a manner consistent with applicable "Fair Chance" laws, including but not limited to the NYC Fair Chance Act, the Los Angeles Fair Chance Initiative for Hiring Ordinance, the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act.
For information about The New York Times' privacy practices for job applicants click here.
Please beware of fraudulent job postings. Scammers may post fraudulent job opportunities, and they may even make fraudulent employment offers. This is done by bad actors to collect personal information and money from victims. All legitimate job opportunities from The New York Times will be accessible through The New York Times careers site. The New York Times will not ask job applicants for financial information or for payment, and will not refer you to a third party to do so. You should never send money to anyone who suggests they can provide employment with The New York Times.
If you see a fake or fraudulent job posting, or if you suspect you have received a fraudulent offer, you can report it to The New York Times at NYTapplicants@nytimes.com. You can also file a report with the Federal Trade Commission or your state attorney general.
Apply for this job
*
indicates a required field