Platform Reliability Engineer
WorldQuant develops and deploys systematic financial strategies across a broad range of asset classes and global markets. We seek to produce high-quality predictive signals (alphas) through our proprietary research platform to employ financial strategies focused on market inefficiencies. Our teams work collaboratively to drive the production of alphas and financial strategies – the foundation of a balanced, global investment platform.
WorldQuant is built on a culture that pairs academic sensibility with accountability for results. Employees are encouraged to think openly about problems, balancing intellectualism and practicality. Excellent ideas come from anyone, anywhere. Employees are encouraged to challenge conventional thinking and possess an attitude of continuous improvement.
Our goal is to hire the best and the brightest. We value intellectual horsepower first and foremost, and people who demonstrate an outstanding talent. There is no roadmap to future success, so we need people who can help us build it.
Technologists at WorldQuant research, design, code, test and deploy projects while working collaboratively with researchers. Our environment is relaxed yet intellectually driven. We seek people who think in code and are motivated by being around like-minded people.
The Role: We are seeking a Platform Reliability Engineer to join a highly specialized team of exceptionally talented yet refreshingly humble individuals from diverse disciplines. We believe that delivering exceptional services requires the ability to make meaningful changes across the entire stack. Our mission is to solve real business challenges, reduce operational complexities, and foster a collaborative, team-driven environment that promotes mutual growth and success.
As a Platform Reliability Engineer, you will play a key role in managing and optimizing the operational aspects of the server and network infrastructure for a large financial buy-side organization. Your primary focus will be on reducing operational overhead, optimizing systems, managing configurations, and ensuring the reliability and performance of critical infrastructure
What You'll Do:
- Ensure the production reliability of the firm’s Linux-based platform as part of a globally distributed engineering team.
- Provide rapid emergency response to production infrastructure issues.
- Proactively understand internal clients’ needs and effectively communicate them to leadership at both regional and global levels.
- Identify risks, develop contingency plans, and implement solutions to mitigate them.
- Develop and enhance the observability platform to monitor the performance and health of critical computing environments.
- Participate in occasional (monthly) on-call rotations and support on-call staff during their shifts
- Contribute to organizational knowledge through documentation, education, and writing maintainable code.
What You’ll Bring:
- 4+ years of experience in SRE, DevOps, or other infrastructure engineering roles, preferably within the financial industry.
- Strong understanding of Linux system internals, including kernel operations, memory management, and performance optimization.
- In-depth knowledge of storage technologies, particularly those used in high-performance computing (GPFS experience is a plus).
- Broad understanding of IT infrastructure components, such as networking, DNS, NTP/PTP, and NIS.
- Proficiency in system automation, monitoring, and self-healing (experience with Salt is a plus).
- Experience with container orchestration and virtualization technologies (e.g., Kubernetes, Nomad, VMware).
- Familiarity with on-premises and cloud-based HPC infrastructure (operational knowledge of Slurm and GPU is a plus).
- Understanding of AI technologies and their applications in infrastructure automation and management. Experience with or a strong interest in implementing AI/ML solutions for infrastructure optimization, anomaly detection, or predictive analytics.
- A passion for technology and automation, with a deep sense of curiosity and ownership.
- A hands-on approach to problem-solving and a demonstrable enthusiasm for technology.
- Excellent verbal and written communication skills.
What We Offer:
- Competitive compensation package.
- Core benefits include: premium private health insurance.
- Strong culture of learning and development: training courses, guest speakers, share and learn events, etc.
- Regular team buildings, annual conferences and occasional global summits – opportunity to travel and connect with our local and global teams.
- Dynamic work without routine in a leading international company.
#LI-VP1
By submitting this application, you acknowledge and consent to terms of the WorldQuant Privacy Policy. The privacy policy offers an explanation of how and why your data will be collected, how it will be used and disclosed, how it will be retained and secured, and what legal rights are associated with that data (including the rights of access, correction, and deletion). The policy also describes legal and contractual limitations on these rights. The specific rights and obligations of individuals living and working in different areas may vary by jurisdiction.
Copyright © 2025 WorldQuant, LLC. All Rights Reserved.
WorldQuant is an equal opportunity employer and does not discriminate in hiring on the basis of race, color, creed, religion, sex, sexual orientation or preference, age, marital status, citizenship, national origin, disability, military status, genetic predisposition or carrier status, or any other protected characteristic as established by applicable law.
Create a Job Alert
Interested in building your career at WorldQuant? Get future opportunities sent straight to your email.
Apply for this job
*
indicates a required field