Job Application for Network Operations Center (NOC) Operator at Lightning AI

Who We Are

Lightning AI is the company behind PyTorch Lightning. Founded in 2019, we build an end-to-end platform for developing, training, and deploying AI systems—designed to take ideas from research to production with less friction.

Through our merger with Voltage Park, a neocloud and AI Factory, Lightning AI combines developer-first software with cost-efficient, large-scale compute. Teams get the tools they need for experimentation, training, and production inference, with security, observability, and control built in.

We serve solo researchers, startups, and large enterprises. Lightning AI operates globally with offices in New York City, San Francisco, Seattle, and London, and is backed by Coatue, Index Ventures, Bain Capital Ventures, and Firstminute.

The Way We Work

The people who thrive here are builders who move fast, communicate openly, take ownership, and continuously improve themselves, their teams, and our company. Here's what that looks like in practice:

Move with Urgency: We move quickly, make thoughtful decisions, and keep momentum. We value action over perfection and learn by shipping.
Take Ownership: We own outcomes, not just our individual work. We make decisions that move the company forward and follow through.
Communicate Openly: We communicate directly, seek to understand, and create clarity for others. Honest conversations help us move faster together.
Build Great Teams: We lead by example, empower others, and create healthy teams where people can do their best work.
Raise the Bar: We're always improving ourselves. We learn from feedback, consistently challenge ourselves to grow, and focus on the work that matters most.
Think Long-Term: We design for what's next. We create scalable systems, simplify complexity, and use AI and automation to amplify our impact.

What We're Looking For

Lightning AI is seeking Network Operation Center (NOC) Operators to support 24/7 operations across select high-performance compute data centers with advanced monitoring infrastructure. This is an entry-level role focused on monitoring, alert triage, and escalation.

In this role, you will serve as the first line of response, ensuring that system alerts are acknowledged, validated, and routed to the appropriate teams. You will follow structured runbooks and workflows to support reliable operations across our infrastructure.

This is a excting opportunity to start a career in data center operations, networking, or infrastructure while gaining exposure to advanced AI systems and large-scale GPU infrastructure. In this role, you will:

Learn the fundamentals of data center operations, networking, and system reliability
Gain exposure to large-scale AI and HPC environments
Assist with hands-on data center support activities under guidance

Potential growth paths include:

Data Center Technician
Network Operations Engineer
Site Reliability Engineer (SRE)

This role is based onsite at one of our data center facilities in Lisle, IL; Fort Worth, TX; or Quincy, WA. Shift flexibility is required to support our 24/7 operations environment. We are not able to provide visa sponsorship for this position at this time.

What You'll Do

Monitoring & Alert Response

Monitor data center systems using dashboards and alerting tools
Acknowledge and triage alerts across compute, network, and hardware systems
Identify and filter out false positives using predefined guidelines

Triage & Escalation

Follow structured runbooks to perform initial validation steps (e.g., running diagnostic scripts or checking system status)
Escalate issues to the appropriate teams (hardware, network, SRE) based on clear escalation criteria
Notify relevant stakeholders during active incidents

Ticketing & Coordination

Create, update, and manage tickets with accurate and timely information
Attach relevant logs, diagnostics, and observations to support faster resolution
Track incidents to ensure proper ownership and handoff

Operational Awareness

Identify recurring alerts or patterns and report them to improve monitoring and reliability
Maintain awareness of ongoing incidents and system status during your shift

What You'll Need

Required Qualifications

Basic familiarity with computers, Linux systems, or IT environments (academic or personal experience is acceptable)
Ability to follow structured procedures and runbooks with attention to detail
Strong communication skills and ability to clearly document issues
Ability to prioritize tasks and respond quickly in a fast-paced, 24/7 environment
Willingness to learn and grow in a technical operations role

Ideal Experience

Exposure to monitoring tools (Grafana, Datadog, etc.)
Basic understanding of networking or server hardware concepts
Interest in data centers, cloud infrastructure, or AI systems
Strong sense of ownership and urgency, with the confidence to escalate issues to the appropriate teams or levels to ensure timely resolution

Compensation

We are committed to offering competitive compensation that reflects the value each team member brings to our mission. Final offers are based on factors such as experience, skills, geographic location, and role expectations. In addition to base salary, our total rewards package for eligible roles includes a discretionary bonus, a meaningful equity component, and comprehensive benefits.

The anticipated annual base salary range for this role is:

$75,000 - $85,000 USD

Benefits and Perks

We offer a comprehensive and competitive benefits package designed to support our employees’ health, well-being, and long-term success:

Comprehensive Health Coverage: Medical, dental, and vision coverage for employees and eligible dependents.
Meaningful Equity: RSUs that give employees a stake in the company's long-term success.
Retirement Savings: 401(k) matching (U.S.) and pension contributions (U.K.).
Flexible Time Off: Unlimited PTO, company holidays, and floating holidays to support work-life balance.
Company-Wide Winter Break: Two weeks of company closure each winter to disconnect and recharge.
Paid Parental & Family Leave: Paid leave to support you and your family through life's important moments.
Professional Development: Annual learning and development allowance to support your professional growth.
Wellness Benefits: Wellness and work-from-home stipends to support your physical and mental well-being.
Sabbatical Program: Four weeks of paid sabbatical leave after four years of service.
Flexible Work: Flexible schedules and a hybrid work model for our office-based teams.
In-Office Meals: Complimentary meals at our office hubs.

Benefits may vary by location, team, and role.

At Lightning AI, we are committed to fostering an inclusive and diverse workplace. We believe that diverse teams drive innovation and create better products. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity, national origin, age, disability, veteran status, or any other protected characteristic. We are dedicated to building a culture where everyone can thrive and contribute to their fullest potential.

Create a Job Alert

Interested in building your career at Lightning AI? Get future opportunities sent straight to your email.

First Name

Last Name

Preferred First Name

Country

Phone

Location (City)

Resume/CV*

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf

LinkedIn/Website

This role will work on-site from one of the following of our U.S. Data Center locations. Please select all locations you are able to work from (without relocation assistance). *

Lisle, IL

Fort Worth, TX

Quincy, WA

Please describe your availability for a 24/7 operations environment, including preferred shifts, days available, and any scheduling constraints.

Are you legally authorized to work in the United States?

Select...

Will you now or in the future require employment visa sponsorship?

Select...

Applicants currently on a temporary work authorization (e.g., student visa, OPT, or similar) should generally select “yes.”

Voluntary Self-Identification

For government reporting purposes, we ask candidates to respond to the below self-identification survey. Completion of the form is entirely voluntary. Whatever your decision, it will not be considered in the hiring process or thereafter. Any information that you do provide will be recorded and maintained in a confidential file.

As set forth in Lightning AI’s Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law.

Gender

Select...

Are you Hispanic/Latino?

Select...

Race & Ethnicity Definitions

If you believe you belong to any of the categories of protected veterans listed below, please indicate by making the appropriate selection. As a government contractor subject to the Vietnam Era Veterans Readjustment Assistance Act (VEVRAA), we request this information in order to measure the effectiveness of the outreach and positive recruitment efforts we undertake pursuant to VEVRAA. Classification of protected categories is as follows:

A "disabled veteran" is one of the following: a veteran of the U.S. military, ground, naval or air service who is entitled to compensation (or who but for the receipt of military retired pay would be entitled to compensation) under laws administered by the Secretary of Veterans Affairs; or a person who was discharged or released from active duty because of a service-connected disability.

A "recently separated veteran" means any veteran during the three-year period beginning on the date of such veteran's discharge or release from active duty in the U.S. military, ground, naval, or air service.

An "active duty wartime or campaign badge veteran" means a veteran who served on active duty in the U.S. military, ground, naval or air service during a war, or in a campaign or expedition for which a campaign badge has been authorized under the laws administered by the Department of Defense.

An "Armed forces service medal veteran" means a veteran who, while serving on active duty in the U.S. military, ground, naval or air service, participated in a United States military operation for which an Armed Forces service medal was awarded pursuant to Executive Order 12985.

Veteran Status

Select...

Voluntary Self-Identification of Disability

Form CC-305

Page 1 of 1

OMB Control Number 1250-0005

Expires 04/30/2026

Why are you being asked to complete this form?

We are a federal contractor or subcontractor. The law requires us to provide equal employment opportunity to qualified people with disabilities. We have a goal of having at least 7% of our workers as people with disabilities. The law says we must measure our progress towards this goal. To do this, we must ask applicants and employees if they have a disability or have ever had one. People can become disabled, so we need to ask this question at least every five years.

Completing this form is voluntary, and we hope that you will choose to do so. Your answer is confidential. No one who makes hiring decisions will see it. Your decision to complete the form and your answer will not harm you in any way. If you want to learn more about the law or this form, visit the U.S. Department of Labor’s Office of Federal Contract Compliance Programs (OFCCP) website at www.dol.gov/ofccp.

How do you know if you have a disability?

A disability is a condition that substantially limits one or more of your “major life activities.” If you have or have ever had such a condition, you are a person with a disability. Disabilities include, but are not limited to:

Alcohol or other substance use disorder (not currently using drugs illegally)
Autoimmune disorder, for example, lupus, fibromyalgia, rheumatoid arthritis, HIV/AIDS
Blind or low vision
Cancer (past or present)
Cardiovascular or heart disease
Celiac disease
Cerebral palsy
Deaf or serious difficulty hearing
Diabetes
Disfigurement, for example, disfigurement caused by burns, wounds, accidents, or congenital disorders
Epilepsy or other seizure disorder
Gastrointestinal disorders, for example, Crohn's Disease, irritable bowel syndrome
Intellectual or developmental disability
Mental health conditions, for example, depression, bipolar disorder, anxiety disorder, schizophrenia, PTSD
Missing limbs or partially missing limbs
Mobility impairment, benefiting from the use of a wheelchair, scooter, walker, leg brace(s) and/or other supports
Nervous system condition, for example, migraine headaches, Parkinson’s disease, multiple sclerosis (MS)
Neurodivergence, for example, attention-deficit/hyperactivity disorder (ADHD), autism spectrum disorder, dyslexia, dyspraxia, other learning disabilities
Partial or complete paralysis (any cause)
Pulmonary or respiratory conditions, for example, tuberculosis, asthma, emphysema
Short stature (dwarfism)
Traumatic brain injury

Disability Status

Select...

PUBLIC BURDEN STATEMENT: According to the Paperwork Reduction Act of 1995 no persons are required to respond to a collection of information unless such collection displays a valid OMB control number. This survey should take about 5 minutes to complete.

Network Operations Center (NOC) Operator

Who We Are

The Way We Work

What We're Looking For

What You'll Do

Monitoring & Alert Response

Triage & Escalation

Ticketing & Coordination

Operational Awareness

What You'll Need

Required Qualifications

Ideal Experience

Compensation

Benefits and Perks

Apply for this job

Voluntary Self-Identification

Voluntary Self-Identification of Disability