Back to jobs
New

Site Reliability Engineer

Remote

Company Overview:

Lightspeed is a leading provider of cloud-based software for dealerships and Original Equipment Manufacturers (OEMs), serving the Powersport, Marine, RV, Trailer, Outdoor Power Equipment, and Golf Cart industries. Lightspeed’s Dealer Management Solution (DMS) enables dealerships to optimize their end-to-end business operations, including sales, parts, service, rentals, accounting, and Customer Relationship Management (CRM). When implemented into their daily operations, Lightspeed helps dealers increase their profitability by selling more units, service, and parts, all while creating a more streamlined experience for customers. For nearly 40 years, Lightspeed has been empowering 4,500+ dealers across North America with the tools and technology they need to manage their dealerships.

This position is pivotal to supporting our DMS Application running on EKS, PostgreSQL, and other AWS services. Along with the configuration and administration of Linux and Windows servers and other open-source technologies. Perform the day-to-day operational monitoring over 3,500 systems from performance metrics to alerts on critical infrastructure. Responsible for system management and creating scripts or writing programs to automate maintenance and management tasks. Other responsibilities are system configurations, troubleshooting, security and supporting multiple teams from customer support to development and QA along with increasing productivity of the team.

What you'll do:

  • Implement systems that are highly available, scalable, and self-healing.
  • Work closely with Application Dev. & Operations teams to provide fully automated deployment routines for Production (CI/CD).
  • Monitoring system activity and tuning system parameters for optimal performance, configuring communications with other platforms/networks, configuring/managing system security, and maintaining current release levels and patch revision.
  • Work across functional (development, testing, deployment, systems/ infrastructure) and project teams to ensure continuous operation of all environments.
  • Manage, and maintain tools to automate operational processes.
  • Work to continuously improve speed, efficiency and scalability of our systems and environments.
  • Work directly with agile Application Development teams to provide daily support aligned with a model of Continuous Delivery.
  • Build and maintain appropriate log gathering, system monitoring, and reporting infrastructures.
  • Operate, in a supporting role for the implementation and our customer support groups along with QA and Dev – requires availability 24/7 at times to load software updates, projects and required to work during the maintenance windows and in cases of an emergency.

What you'll have:

Qualifications:

  • 4+ Years in a Cloud/SRE/DevOps/System Administrator role(s) or equivalent experience.
  • 4+ year experience with containerization/orchestration technologies like Kubernetes, Docker, AWS EKS, AWS NLB/ALB’s, GCP GKE, etc. . Must be able to configure and support Docker containers deployment.
  • 4+ years experience with Cloud concepts such as VPC, Subnets, IAM, Security Groups, S3 or equivalent experience.
  • 6+ years of Linux and Windows administrator experience.
  • Scripting ability (Bash / Shell, Python, JavaScript)
  • Must have an understanding of building and managing large-scale systems and application architectures
  • Solid understanding of system performance and monitoring.
  • Excellent project management skills and the ability to work in a fast-paced work environment.
  • Must also have experience working in an Agile development environment that requires a lot of communication and collaboration.
  • Demonstrate skills in priority setting, analysis, communication, time management, scheduling, and multitasking.
  • Experience with config/provisioning tools like Terraform, CloudFormation, Cloud Init, or Salt/Chef/Puppet/Ansible in production environments with many nodes.
  • Work well in a highly collaborative team environment.
  • Familiar with Infrastructure as Code methodologies and tools.

Preferred Qualifications:

  • Experience managing Enterprise production systems in at least one public cloud: AWS, GCP, Azure; AWS is preferred.
  • Experience managing Kubernetes in a large Enterprise production environment.
  • Communication Skills: The candidate will have exceptional communication skills (verbal, written, and presentation) as well as excellent interpersonal skills including the ability to work and communicate with individuals at all levels in the organization, and matrixed team members.
  • Good working knowledge of build automation and continuous integration/delivery processes and tools: Gitlab, Jenkins.
  • Experience with messaging technologies such as AmazonMQ, RabbitMQ, Kafka, etc.
  • Experience with monitoring solutions: Zabbix, Nagios, CloudWatch, Alert manager, Prometheus, Grafana, Dynatrace, NewRelic or equivalent.
  • Experience with various data technologies including relational and nonrelational databases.
  • PostgreSQL database knowledge preferred.
  • Experience supporting Enterprise Wildfly/Jboss application servers and java.
  • Experience with VMware/vSphere virtualization or the private/public cloud is a plus.
  • Experience with incident management and finding root cause within a postmortem discovery.
  • Familiar with AWS Well-Architected and the Six Pillars.
  • Experience with a DevOps approach of managing infrastructure.

 

In today’s competitive job market, transparency and trust are more important than ever. At Lightspeed, we believe in fostering an open and honest work environment, starting with our job postings. Pay transparency is a key component of this commitment, ensuring that potential candidates have a clear understanding of the compensation they can expect.

Remote

$110,000 - $130,000 USD

 

Inclusion and Diversity at Lightspeed:

At Lightspeed, we celebrate the uniqueness of every individual and encourage diverse perspectives. We believe that inclusion drives innovation and fosters meaningful connections. We are committed to building an environment where everyone feels valued and empowered to make an impact.

Equal Employment Opportunity Statement:

Lightspeed is an Equal Opportunity Employer and is dedicated to building a diverse and inclusive workforce. All qualified applicants will be considered for employment without regard to race, color, creed, ancestry, national origin, gender, sexual orientation, gender identity, gender expression, marital status, religion, age, disability, veteran status, or any other protected category.

Important Note:

Applicants must be authorized to work in the U.S.

Ready to apply?

Take the next step in your career—apply today and join a team where your skills will make an impact!

 

Create a Job Alert

Interested in building your career at Lightspeed DMS? Get future opportunities sent straight to your email.

Apply for this job

*

indicates a required field

Resume/CV*

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf