Back to jobs
New

Director of Production Engineering

Durham, NC

Toshiba Global Commerce Solutions is seeking a Director of Production Engineering to lead the reliability backbone of our global POS, cloud, and middleware platform. This strategic role owns system availability, resilience, performance, observability, and release reliability across a distributed, mission-critical commerce ecosystem. 
 
This leader will unify Site Reliability Engineering (SRE), Resilience & Performance Engineering, Observability, and AI-driven Reliability Automation into one cohesive function. As AI accelerates development velocity, verification and reliability become the core bottlenecks—making this role a cornerstone of our engineering organization. 
 
You will partner closely with Architecture, Cloud Operations, Functional Quality Engineering, and Software Development to ensure predictable reliability, smooth releases, and dramatically fewer Sev-1/Sev-2 incidents. 

Responsibilities 

System Reliability & Uptime: 

  • Define and enforce SLO/SLA frameworks, error budgets, and release criteria 
  • Lead availability, resilience, and performance strategy across all services. 
  • Own MTTR, MTBF, incident prevention, and rollback strategies at scale. 

Unified Reliability Engineering Organization: 

  • Lead teams across SRE & L3 Engineering, Resilience & Performance 
  • Engineering, Observability & Telemetry, AI Reliability Automation. 
  • Build a culture focused on prevention over firefighting. 

Architecture-Level Reliability: 

  • Collaborate with Principal Engineers and Architects to define system guardrails, resilience patterns, and failure modes. 
  • Ensure high-quality Production Readiness Reviews (PRRs) and architectural consistency. 

Resilience & Performance Engineering: 

  • Own chaos, failover, load, stress, and soak testing strategies. 
  • Validate store-mode behavior, payment workflows, edge-device dependencies, and multi-service interactions. 

Observability & Telemetry: 

  • Ensure complete, accurate signal for logs, traces, metrics, and business health. 
  • Partner with AI systems to build intelligent anomaly detection pipelines. 

AI-Driven Release Reliability: 

  • Integrate AI-based reliability scoring, resiliency prediction, automated gating, regression analysis, and incident pattern detection. 
  • Define the path toward autonomous release reliability pipelines. 

Cross-Org Leadership: 

  • Partner with Software Development, Functional Quality Engineering, Cloud Operations, Architecture, and TPM/TPO teams. 
  • Drive multi-team initiatives and ensure readiness across complex release trains. 

 

Required Experience: 

  • Bachelors Degree in Computer Science, Engineering or 10-15 years direct experience.
  • 10–15+ years in SRE, Reliability Engineering, Production Engineering, Distributed Systems, and Performance/Resilience Engineering
  • Proven ownership of uptime and system reliability in complex distributed architectures. 
  • Expertise in distributed systems, cloud platforms (AKS, Kubernetes), observability stacks (OpenTelemetry, Grafana, App Insights, Datadog), performance tuning, fault tolerance, network fundamentals, DB/service scaling, chaos testing
  • Architectural Leadership: Experience designing resilience patterns (timeouts, retries, hedging, circuit breakers). Strong partnership with architects and senior engineers. 
  • Operational Maturity: Led SRE/on-call organizations. Defined SLOs, SLIs, and error budgets at scale. Track record of driving incident prevention culture. 
  • Leadership & Communication: Builds strong engineering teams and hires top talent. 
  • Influential communicator with executives and cross-functional teams. Highly collaborative and low-ego. 

Preferred Requirements 

  • AI-driven anomaly detection, regression analysis, incident clustering, reliability scoring. 
  • Experience with retail POS, payments, edge devices, or store environments. 
    Hybrid cloud + edge architectures. 
  • Leading reliability transformations and scaling engineering organizations (200→500+). 

Why This Role Matters 

As AI accelerates development velocity, the bottleneck shifts from coding to verification, reliability, and release safety. This role ensures: 
- Uptime becomes engineered, not reactive. 
- Development and QA operate at AI-enabled speed. 
- Our platform grows safely while delivering stability and performance. 
- We match or surpass best-in-class tech organizations (Google, Amazon, Azure, Stripe). 
 
You will build the production engineering foundation that powers our next decade of innovation. 

 

Toshiba Global Commerce Solutions is a dynamic billion-dollar global company based in Research Triangle Park, NC, providing retail store solutions to your favorite brands. Have you ever been in a hurry and made use of the self-checkout at Lowe's Foods, earned fuel rewards at Kroger, or just paid for purchases at retailers such as Walmart, Michaels, Carrefour, The Gap, Calvin Klein, Boots, Cencosud, BJ's, or Costco? These are just a few examples of our in-store solutions and impressive customer base that made us the world's installed market share leader. 
 
The nature of retail is changing quickly, so if you share our 'Together Commerce' vision of a seamless two-way, participatory shopping experience, let's get together to drive the new economy. 
 
Toshiba Global Commerce Solutions, Inc. offers a competitive salary and generous benefits package including the following: 
 

  • Group health coverage (medical, dental, & vision) 
  • Employee Assistance Programs 
  • Pre-tax spending accounts 
  • 401(k) plan (with company match) 
  • Company provided life insurance 
  • Pet Insurance 
  • Employee discounts 
  • Generous paid holiday schedule, paid vacation & sick/personal days 

 
 
EEO: 
 
Toshiba Global Commerce Solutions is an equal opportunity/affirmative action employer that evaluates qualified applicants without regard to age, ancestry, color, religious creed, disability, marital status, medical condition, genetic information, military or veteran status, national origin, race, sex, gender, gender identity, gender expression and sexual orientation or any other protected factor. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. 
 
Individuals who need a reasonable accommodation because of a disability for any part of the employment process should email benefits@toshibagcs.com to request an accommodation 
 
DIVERSITY, EQUITY & INCLUSION: 
 
We at Toshiba Global Commerce Solutions firmly believe that our people are an integral part to the success of our customers. Furthermore, we're committed to Diversity, Equity, and Inclusion for all our people as highlighted by our 5 Core Principles (Create Outreach, Foster Belonging, Unleash Opportunity, Diverse Cultural Engagement and Culture of Transparency). We're passionate about our customers the retail industry and becoming a more responsible company as we help create a brighter future. 

 

 

Apply for this job

*

indicates a required field

Phone
Resume/CV*

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf


Education

Select...
Select...
Select...
Select...
Select...

Select...
Select...
Select...
Select...
Select...
Select...
Select...

Voluntary Self-Identification

For government reporting purposes, we ask candidates to respond to the below self-identification survey. Completion of the form is entirely voluntary. Whatever your decision, it will not be considered in the hiring process or thereafter. Any information that you do provide will be recorded and maintained in a confidential file.

As set forth in Toshiba Global Commerce Solutions - External’s Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law.

Select...
Select...
Race & Ethnicity Definitions

If you believe you belong to any of the categories of protected veterans listed below, please indicate by making the appropriate selection. As a government contractor subject to the Vietnam Era Veterans Readjustment Assistance Act (VEVRAA), we request this information in order to measure the effectiveness of the outreach and positive recruitment efforts we undertake pursuant to VEVRAA. Classification of protected categories is as follows:

A "disabled veteran" is one of the following: a veteran of the U.S. military, ground, naval or air service who is entitled to compensation (or who but for the receipt of military retired pay would be entitled to compensation) under laws administered by the Secretary of Veterans Affairs; or a person who was discharged or released from active duty because of a service-connected disability.

A "recently separated veteran" means any veteran during the three-year period beginning on the date of such veteran's discharge or release from active duty in the U.S. military, ground, naval, or air service.

An "active duty wartime or campaign badge veteran" means a veteran who served on active duty in the U.S. military, ground, naval or air service during a war, or in a campaign or expedition for which a campaign badge has been authorized under the laws administered by the Department of Defense.

An "Armed forces service medal veteran" means a veteran who, while serving on active duty in the U.S. military, ground, naval or air service, participated in a United States military operation for which an Armed Forces service medal was awarded pursuant to Executive Order 12985.

Select...

Voluntary Self-Identification of Disability

Form CC-305
Page 1 of 1
OMB Control Number 1250-0005
Expires 04/30/2026

Why are you being asked to complete this form?

We are a federal contractor or subcontractor. The law requires us to provide equal employment opportunity to qualified people with disabilities. We have a goal of having at least 7% of our workers as people with disabilities. The law says we must measure our progress towards this goal. To do this, we must ask applicants and employees if they have a disability or have ever had one. People can become disabled, so we need to ask this question at least every five years.

Completing this form is voluntary, and we hope that you will choose to do so. Your answer is confidential. No one who makes hiring decisions will see it. Your decision to complete the form and your answer will not harm you in any way. If you want to learn more about the law or this form, visit the U.S. Department of Labor’s Office of Federal Contract Compliance Programs (OFCCP) website at www.dol.gov/ofccp.

How do you know if you have a disability?

A disability is a condition that substantially limits one or more of your “major life activities.” If you have or have ever had such a condition, you are a person with a disability. Disabilities include, but are not limited to:

  • Alcohol or other substance use disorder (not currently using drugs illegally)
  • Autoimmune disorder, for example, lupus, fibromyalgia, rheumatoid arthritis, HIV/AIDS
  • Blind or low vision
  • Cancer (past or present)
  • Cardiovascular or heart disease
  • Celiac disease
  • Cerebral palsy
  • Deaf or serious difficulty hearing
  • Diabetes
  • Disfigurement, for example, disfigurement caused by burns, wounds, accidents, or congenital disorders
  • Epilepsy or other seizure disorder
  • Gastrointestinal disorders, for example, Crohn's Disease, irritable bowel syndrome
  • Intellectual or developmental disability
  • Mental health conditions, for example, depression, bipolar disorder, anxiety disorder, schizophrenia, PTSD
  • Missing limbs or partially missing limbs
  • Mobility impairment, benefiting from the use of a wheelchair, scooter, walker, leg brace(s) and/or other supports
  • Nervous system condition, for example, migraine headaches, Parkinson’s disease, multiple sclerosis (MS)
  • Neurodivergence, for example, attention-deficit/hyperactivity disorder (ADHD), autism spectrum disorder, dyslexia, dyspraxia, other learning disabilities
  • Partial or complete paralysis (any cause)
  • Pulmonary or respiratory conditions, for example, tuberculosis, asthma, emphysema
  • Short stature (dwarfism)
  • Traumatic brain injury
Select...

PUBLIC BURDEN STATEMENT: According to the Paperwork Reduction Act of 1995 no persons are required to respond to a collection of information unless such collection displays a valid OMB control number. This survey should take about 5 minutes to complete.