Machine Learning Engineer
Clara is the leading spend management platform for companies in Latin America. Our end-to-end solution includes locally-issued corporate cards, bill pay, and our highly-rated software platform; already being used by thousands of the most successful companies across the region.
Clara is backed by top investors and strategics including Accial Capital, Canary, Citi Ventures, Coatue, DST Global, General Catalyst, Goldman Sachs, ICONIQ Growth, Kaszek, Monashees, and Notable Capital, as well as prominent angel investors.
CLARA's Data Team is looking for a highly professional and experienced Machine Learning (ML) Engineer to join us and take an active role in the ML -data architectural design alongside our teams of data engineers and data scientists to create, deploy and maintain complex ML systems and services with the responsibilities and challenges that accompanies an environment like this.
Your primary focus will be to develop data pipelines for ML, design, implement and deploy the ML models and infrastructure, improve existing processes, and create data models. You will also be responsible for the maintenance and operation of all these models and data pipelines, and collaborate with other teams in order to provide high-quality and fast-delivery ML models for the company.
The main challenges and responsibilities you will face are:
- Build, integrate and maintain ML feature data pipelines following reliability, scalability, and maintainability principles.
- Deploy, monitor, and operate ML model lifecycle (ML Ops).
- Collaborate with data scientists in the implementation of ML models.
- Ensuring and monitoring the data quality, and understanding/resolving any issue detected.
- Consume data from Data Vaults and Star Schema lakehouse.
- Be able to collaborate with teams remotely, not only locally.
- Stay up to date with the latest technologies and look for ways to implement them.
- Collaborate with our data scientists and data engineers to better understand their requirements and meet the company's goals.
- Document your work to have a solid foundation of it for new team members or future references.
- Code review and peer programming activities.
- Enforce team best practices and DevOps as a culture within the team.
- Enforce Data Security and Data Privacy best practices.
Our requirements for this position are:
- 4 years of experience in Model deployment and infrastructure.
- Proficiency in Python and SQL.
- ML Algorithms experience, both theoretical and practical.
- Experience with CI/CD tools (GitHub Actions, GitLab CI, Jenkins, etc.).
- Experience using Github flow.
- Docker container experience.
- Experience implementing the feature store paradigm.
- Experience deploying streaming ML models with common ML frameworks (e.g. xgboost, scikit-learn, catboost).
- High-level understanding of Distributed Systems, and Spark Architecture.
- Experience integrating data from Databases, APIs, and Event Streams (kinesis, kafka).
- Experience developing complex data pipelines (ETL/ELT) with orchestration tools (e.g. Apache Airflow, AWS Glue Workflow, AWS Step Functions, etc.)
- Experience with AWS services: Redshift, Glue, Sagemaker, Lambda, Athena, S3, Kinesis. Or other cloud equivalent services.
- Experience with big data technologies and frameworks (e.g. (py)Spark, Scala, Hive, Kafka, etc.)
- Good English Level.
Nice to have:
- Databricks platform usage and administration
- MLflow
We value
We are
What We offer
Apply for this job
*
indicates a required field