New

Data Scientist

New York, New York, United States

Octus

Octus is a leading global provider of credit intelligence, data, and analytics. Since 2013, tens of thousands of professionals across hedge fund, investment banking, management consulting, and law firm verticals have come to rely on Octus to make better, faster, and more confident decisions in pace with the fast-moving credit markets.
For more information, visit: https://octus.com/

Working at Octus

Octus hires growth-minded innovators and trailblazers across the globe to drive our business and culture. Our core values – Action Oriented, Customer First Mindset, Effective Team Players, and Driven to Excel – define an organizational ethos that’s as high-performing as it is human. Among other perks, Octus employees enjoy competitive health benefits, matched 401k and pension plans, PTO, generous parental leave, gym subsidies, educational reimbursements for career development, recognition programs, pet-friendly offices (US only), and much more. 

Role

Octus delivers breaking news and market-moving intelligence through cutting-edge data and technology for hedge funds, investment banks, and law firms- and we’re transforming how professionals access complex and opaque information. As part of our high-performing AI Innovation team, you’ll help design, build, and productionize modern GenAI and LLM-powered systems that support both client-facing features and internal operational efficiency. You’ll work end to end—from shaping ambiguous problems into scalable solutions to deploying reliable AI models in production - collaborating closely with product, engineering, and infrastructure teams. This is a hybrid role based in NYC (3 days in office per week). Curious about what we’re building? Check out our flagship GenAI product, CreditAI here and our AI framework here.

Responsibilities

  • Apply strong problem-solving and critical thinking skills to break down complex, ambiguous requirements into clear, implementable technical components and system designs.
  • Design, build, and maintain AI-powered and data-driven systems with a focus on modern language and multimodal models, including LLM-driven applications, RAG pipelines, and agentic workflows.
  • Evaluate and productionize commercial and open-source LLMs, choosing appropriate models, tools, and techniques for each use case. Develop multi-step agentic workflows that incorporate tools, external data sources, memory, and control logic.
  • Manage the orchestration of production LLM workflows and agentic systems, ensuring reliability and efficiency through prompt routing, state management, retries, fallbacks, and error handling. Design, test, and iteratively refine prompts and system instructions using prompt engineering and tuning techniques to improve model reliability, accuracy, and task performance.
  • Maintain production-grade code and services with automated monitoring and performance tracking, using metrics and alerts to guide continuous improvements in models, prompts, and pipelines.
  • Apply systems thinking to design and optimize AI and LLM systems, balancing quality, scalability, latency, cost, and operational complexity, while implementing efficiency improvements using model selection, prompt design, batching, caching, and retrieval strategies.
  • Define and implement evaluation and observability frameworks for AI systems, including automated testing, task-specific benchmarks, regression testing for prompts, human-in-the-loop validation, and performance monitoring.
  • Build and integrate AI models into backend systems and APIs to support both real-time and batch inference, ensuring solutions are production-ready, scalable, and efficient.
  • Apply NLP and ML techniques to tasks such as information extraction, semantic search and retrieval, text classification, summarization, and reasoning over text and documents.
  • Collaborate closely with engineering and infrastructure teams to deploy solutions using containerized and cloud-based environments (e.g., GitHub, Docker, AWS), applying modern deployment and infrastructure practices.
  • Collaborate with product managers, business stakeholders, and domain experts to translate complex, ambiguous business problems into actionable technical solutions, and communicate progress, trade-offs, and outcomes to relevant stakeholders.
  • Continuously learn and adapt to advancements in NLP and Generative AI to ensure solutions remain innovative and effective.

Requirements

  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience).
  • 2+ years of experience as a Data Scientist, Machine Learning Engineer, or applied AI practitioner, with a strong foundation in computer science, algorithms, and software development. 
  • Advanced programming skills in Python, with experience building production-grade systems beyond research or experimentation.
  • Solid understanding of machine learning and applied AI concepts, with experience taking solutions from prototype to production.
  • Hands-on experience designing, building, and deploying LLM-driven or GenAI applications, including familiarity with vector databases, embeddings pipelines, or semantic search systems.
  • Practical experience with cloud-based deployments and infrastructure tools (e.g., AWS, Docker, GitHub) and an understanding of modern DevOps practices, containerization, orchestration, caching strategies, and cost-aware design.
  • Strong problem-solving skills and systems thinking, with the ability to balance trade-offs across model quality, scalability, inference latency, cost, and operational complexity.
  • Ability to interpret and implement research ideas and algorithms, actively contributing to research and development initiatives while translating them into production solutions.
  • Excellent communication and collaboration skills, with experience working closely with product managers, engineers, and domain experts to deliver actionable technical solutions.
  • Passion for learning and staying current with the rapidly evolving AI/ML landscape, including emerging best practices for GenAI applications.
  • Strong ownership and initiative, with the ability to independently drive projects from problem definition to delivery, while being a team player and contributing to the overall success of the data science team.

At Octus, we consider a range of factors in connection with compensation decisions, including experience, skills, location, and our business needs and limitations. As a result, compensation may vary within and across similar roles and positions. Please note that the salary range information below is a good faith estimate for this position and actual compensation for any individual may fall outside this range if warranted by the circumstances applicable to that individual. If we identify a role that would be suitable for a broader range of skills and experience such that we would consider hiring at multiple levels then the range listed below may reflect that breadth.

The salary range estimate for this position is $130,000-$145,000.

The actual compensation will be at Octus’ sole discretion and will be determined by the aforementioned and other relevant factors. This position is eligible for a performance-based annual bonus.

Equal Employment Opportunity

Octus is committed to providing equal employment opportunities to all employees and applicants for employment without regard to race, colour, religion, sex, sexual orientation, gender identity, national origin, age, disability, genetic information, marital status, pregnancy, veteran status, or any other legally protected status. We strive to create an inclusive and diverse work environment where all individuals are valued, respected, and treated fairly. We believe that diversity enriches our workplace and enhances our ability to innovate and succeed.

Create a Job Alert

Interested in building your career at Octus? Get future opportunities sent straight to your email.

Apply for this job

*

indicates a required field

Phone
Resume/CV*

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf


Select...
Select...

The name provided above should be your full legal name.

Please use this section to indicate any preferred names you would like to be recognized by, if different from your legal name.

Select...

If answered 'No' to the above, please enter "N/A"

Select...

If answered 'No' to the above, please enter "N/A"

Voluntary Self-Identification

For government reporting purposes, we ask candidates to respond to the below self-identification survey. Completion of the form is entirely voluntary. Whatever your decision, it will not be considered in the hiring process or thereafter. Any information that you do provide will be recorded and maintained in a confidential file.

As set forth in Octus’s Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law.

Select...
Select...
Race & Ethnicity Definitions

If you believe you belong to any of the categories of protected veterans listed below, please indicate by making the appropriate selection. As a government contractor subject to the Vietnam Era Veterans Readjustment Assistance Act (VEVRAA), we request this information in order to measure the effectiveness of the outreach and positive recruitment efforts we undertake pursuant to VEVRAA. Classification of protected categories is as follows:

A "disabled veteran" is one of the following: a veteran of the U.S. military, ground, naval or air service who is entitled to compensation (or who but for the receipt of military retired pay would be entitled to compensation) under laws administered by the Secretary of Veterans Affairs; or a person who was discharged or released from active duty because of a service-connected disability.

A "recently separated veteran" means any veteran during the three-year period beginning on the date of such veteran's discharge or release from active duty in the U.S. military, ground, naval, or air service.

An "active duty wartime or campaign badge veteran" means a veteran who served on active duty in the U.S. military, ground, naval or air service during a war, or in a campaign or expedition for which a campaign badge has been authorized under the laws administered by the Department of Defense.

An "Armed forces service medal veteran" means a veteran who, while serving on active duty in the U.S. military, ground, naval or air service, participated in a United States military operation for which an Armed Forces service medal was awarded pursuant to Executive Order 12985.

Select...

Voluntary Self-Identification of Disability

Form CC-305
Page 1 of 1
OMB Control Number 1250-0005
Expires 04/30/2026

Why are you being asked to complete this form?

We are a federal contractor or subcontractor. The law requires us to provide equal employment opportunity to qualified people with disabilities. We have a goal of having at least 7% of our workers as people with disabilities. The law says we must measure our progress towards this goal. To do this, we must ask applicants and employees if they have a disability or have ever had one. People can become disabled, so we need to ask this question at least every five years.

Completing this form is voluntary, and we hope that you will choose to do so. Your answer is confidential. No one who makes hiring decisions will see it. Your decision to complete the form and your answer will not harm you in any way. If you want to learn more about the law or this form, visit the U.S. Department of Labor’s Office of Federal Contract Compliance Programs (OFCCP) website at www.dol.gov/ofccp.

How do you know if you have a disability?

A disability is a condition that substantially limits one or more of your “major life activities.” If you have or have ever had such a condition, you are a person with a disability. Disabilities include, but are not limited to:

  • Alcohol or other substance use disorder (not currently using drugs illegally)
  • Autoimmune disorder, for example, lupus, fibromyalgia, rheumatoid arthritis, HIV/AIDS
  • Blind or low vision
  • Cancer (past or present)
  • Cardiovascular or heart disease
  • Celiac disease
  • Cerebral palsy
  • Deaf or serious difficulty hearing
  • Diabetes
  • Disfigurement, for example, disfigurement caused by burns, wounds, accidents, or congenital disorders
  • Epilepsy or other seizure disorder
  • Gastrointestinal disorders, for example, Crohn's Disease, irritable bowel syndrome
  • Intellectual or developmental disability
  • Mental health conditions, for example, depression, bipolar disorder, anxiety disorder, schizophrenia, PTSD
  • Missing limbs or partially missing limbs
  • Mobility impairment, benefiting from the use of a wheelchair, scooter, walker, leg brace(s) and/or other supports
  • Nervous system condition, for example, migraine headaches, Parkinson’s disease, multiple sclerosis (MS)
  • Neurodivergence, for example, attention-deficit/hyperactivity disorder (ADHD), autism spectrum disorder, dyslexia, dyspraxia, other learning disabilities
  • Partial or complete paralysis (any cause)
  • Pulmonary or respiratory conditions, for example, tuberculosis, asthma, emphysema
  • Short stature (dwarfism)
  • Traumatic brain injury
Select...

PUBLIC BURDEN STATEMENT: According to the Paperwork Reduction Act of 1995 no persons are required to respond to a collection of information unless such collection displays a valid OMB control number. This survey should take about 5 minutes to complete.