Senior DevOps Infrastructure Engineer
NMI is seeking a Senior DevOps Engineer with deep Linux, virtualization, and hardware experience who is passionate about running applications in an exceedingly high availability environment within our SRE organization. This opportunity to work with similarly skilled professionals in a rapidly growing environment offers opportunities to level-up observability and automation skills while maintaining a mission critical, 4-nines availability platform, and participating in environment modernization.
The SRE team is responsible for the operation of all hardware and software within the production and SDLC environments. This consists of a global network connecting numerous sites which must be highly available 24x7 with a minimal desired target of 99.99% availability. The successful applicant as a Senior DevOps Infrastructure Engineer will be a core member of the SRE team with the opportunity to work with experts in the infrastructure, networking, and DevOps space.
The Ideal Candidate:
- Will have a track record of implementing low-toil solutions to traditionally high-touch operational or administrative tasks.
- Has a deep technical background and can engage with engineers with the nuances of complex systems, while also being able to zoom out and see the bigger picture.
- Has a high level of competency implementing hardware projects in data center environments (server & storage installation, troubleshooting, decommissioning).
- Enjoys being challenged to find creative solutions using both legacy and cutting edge technology. This is a codespeak for us having a legacy system that has to be maintained and improved while also looking at new technology and tools to improve resiliency, performance, ease of administration, and observability. It’s not all “the fun stuff”.
- Wants to work with a globally distributed team of similarly skilled professionals, and is comfortable building relationships with teammates up to thousands of miles away.
- Is as comfortable in a shell or VIM as an accountant is in QuickBooks.
- Refuses to believe a service or appliance is production ready until they have the metrics and alerts to prove it.
Key duties:
- Administration - Participate in maintenance and operations of our production environment, including patching, deployment, server administration, and troubleshooting, either using configuration as code tooling or manually.
- Reliability & Performance - Ensure reliability, availability and performance of services. Respond to incidents and resolve before they become customer impacting.
- Projects - Deliver complex solutions that traverse all layers of the technology stack: Operating System, Virtualisation, Network, Storage & Cloud.
- Data Centre - Participate and coordinate on-site deployments of critical hardware, including servers and storage.
- Collaboration - Work closely with teammates, software, and security teams to rapidly meet customer, business, and compliance needs.
- Automation - Drive the automation of operational tasks, and ensure our infrastructure is more like cattle than pets.
- Observability - Develop and maintain internal and commercial or OSS tools to improve system health, performance, and deployment.
- Continuous Improvement - Drive never-ending improvement in SRE processes, tools, and methodologies. Take a leading role in blameless post-mortems to avoid repeat issues or mistakes and clearly document all lessons learned for others. If you love writing actionable documentation, we’d love to set up an interview.
- On-Call - Participate in a rotating 24x7 on-call schedule with your team to ensure availability of services across the production environment.
This is a fully remote role (work anywhere in the UK); however, if you live within a reasonable commutable distance, we’d love to see you in our Bristol office from time to time!
Periodic travel (typically 1-4 times a year) will be required to company colocation facilities, at company expense.
Essential Skills & Experience:
- 5+ years of experience in Site Reliability Engineering, DevOps, System Administration, or similar roles.
- Deep experience working in colocation facilities – we have a hybrid footprint, and if you have only worked in the public cloud space, this role is not a great fit for you.
- Experience using Puppet, Ansible, or other common configuration as code tooling to deploy and configure systems.
- Strong familiarity with Linux systems (any distro is fine, but we have a preference for RHEL downstreams).
- Experience using Proxmox, VMWare, or KVM as virtualization platforms for large-scale production environments.
- Experience administering enterprise grade SANs and load balancers is necessary to be successful in this role.
- Demonstrated proficiency in one or more scripting or programming languages (e.g., Python, Go, Bash/ZSH, etc.)
- Multiple years experience proactively implementing and responding to infrastructure, application, and network alerts using industry standard or homebrew toolchains.
- Strong problem-solving skills and experience working in extreme high availability production environments (99.95% or greater), with high performance requirements, is required.
Preferred Skills and Experience:
- Experience with F5 BigIP LTMs or NetApp SANs is highly desirable.
- Experience using Grafana, Prometheus, and the ELK stack for observability is highly desirable.
- Experience with MySql (any engine variant) will be extremely helpful in this role.
- Kubernetes experience is a significant plus. Alternatively, a burning desire to learn it.
- Experience working with SaaS based WAF/DDoS protection services such as Silverline, CloudFlare, or Akamai is preferred.
- Prior experience on a team following common agile processes such as Kanban or Scrum would be valuable.
- Experience in the start-up to scale-up space will be very valuable. We are not a calcified, enormous enterprise, and move quickly.
- GitLab experience is a plus.
As well as being a part of something exciting everyday, you will also receive the following benefits:
- Annual bonus scheme dependent on individual and company performance
- Annual salary of £50,000 - £70,000
- 25 days holiday each year (+ bank holidays + 1 day after each year of service with up to a max. of 30 days)
- Workplace pension scheme
- Private medical insurance (upon 30 days of employment)
- 7 hours per day, 35 hours per week
- A remote first culture
- Great work-life balance with our Flexi-time policy
- Family Friendly policies (Enhanced Maternity and Paternity Pay and Shared Parental Leave).
- A chance to develop with an allocated company training budget
- Bike2Work Scheme
- Lifeworks, an Employee Assistance Programme which offers wellbeing, family and financial support services, such as assessments, resources and even 1:1 counselling sessions. It also offers interesting perks such as discounts on gyms, restaurants, high street retailers and cinema tickets
- A strong commitment to employee wellbeing including mental health first aiders
- Employee referral scheme with generous financial reward
- Bonusly colleague reward scheme
We’re looking for creative and passionate people who share our vision of making payments easy. If that sounds like you and you meet the requirements above, then please click on 'Apply for this job'!
We are an Equal Opportunities employer and will provide reasonable support throughout the recruitment process to applicants who have a disability. Please let us know in advance so that any support, aids or adaptations can be put in place to assist you.
Please be aware that all offers of employment are made subject to receipt of satisfactory background and financial checks.
About us
NMI enables our partners with choice, and challenges the one-size-fits-all approach to payments. You've probably used NMI in the last 24 hours without even realising it. We’re the platform that powers success for innovative tech created by SMBs, entrepreneurs and fintech start-ups. We’re creative problem solvers who help visionaries smash through boundaries and think beyond what’s possible so they can think about what’s next. But we’re not just built for the tech savvy. We democratise the latest payments technology so that everyone can realise the benefits of easy payments across the full spectrum of commerce. We’re all about enabling more payments in more ways and more places.
Please note that in compliance with the data protection regulations within your jurisdiction, any personal information submitted with your job application may be collected and used by NMI for the purpose of recruitment and employment-related activities. By submitting your application, you acknowledge and provide explicit consent to the processing of your personal information as described in our privacy policy found on our website. For more information on how we process your information, please read our privacy policy here: https://www.nmi.com/legal/privacy-policy/
#LI-Remote
Salary range, depending on experience:
£50,000 - £70,000 GBP
Apply for this job
*
indicates a required field