Software Engineer (GenAI/ML)
Factored was conceived in Palo Alto, California by Andrew Ng and a team of highly experienced AI researchers, educators, and engineers to help address the significant shortage of qualified AI & Machine-Learning engineers globally. We know that exceptional technical aptitude, intelligence, communication skills, and passion are equally distributed around the world, and we are very committed to testing, vetting, and nurturing the most talented engineers for our program and on behalf of our clients.
We seek an ambitious and self-motivated Software Engineer with strong Python expertise and hands-on experience in Generative AI. You will join an ML-intensive environment to build and optimize scalable AI applications, enhance security guardrails, and design cloud-native architectures that power enterprise-grade AI solutions. This role offers the opportunity to collaborate with a multicultural team of high-caliber engineers across Software, Data, and ML disciplines while contributing to high-impact projects that help enterprises harness Generative AI safely and effectively.
Functional Responsibilities:
- Architect, design, and implement backend systems and APIs integrating GenAI technologies, including LangChain, LangGraph, and ChatGPT.
- Migrate from LangChain Agents to LangGraph Agents and implement agent-based architectures for advanced AI solutions.
- Build, deploy, and manage cloud-native applications on AWS, leveraging services such as Lambda, ECS, or SageMaker.
- Fine-tune and optimize LLMs (e.g., GPT-4, Llama, Mistral) and retrieval-augmented generation (RAG) architectures for production-ready systems.
- Enhance AI security frameworks, including implementing guardrails and conducting risk assessments to ensure compliance and safety.
- Collaborate closely with ML, data, and software teams to deliver scalable, secure, and high-performance AI applications.
- Monitor, debug, and optimize applications to ensure performance and reliability.
- Stay current with the latest advancements in LLMs, RAG techniques, and cloud infrastructure to inform architectural and design decisions.
Qualifications:
- 5+ years of Python development with a focus on ML, AI, or data-driven applications.
- Proven expertise with Generative AI frameworks and APIs such as LangChain, LangGraph, ChatGPT, and RAG implementations.
- 3+ years of experience with AWS or other major cloud platforms.
- Strong understanding of backend development using frameworks like FastAPI, Django, or Flask.
- Experience with authentication and API security (OAuth, JWT).
- Hands-on experience with databases (PostgreSQL, NoSQL) and vector databases for RAG workflows.
- Proficiency in deep learning frameworks such as PyTorch, HuggingFace, Keras, or FastAI.
- Knowledge of DevOps and CI/CD practices, including Docker, Kubernetes, Terraform, and Git-based workflows.
- Excellent written and spoken English communication skills and the ability to collaborate effectively across teams.
- Passion for exploring new AI/ML techniques and applying them to real-world, high-impact applications.
Create a Job Alert
Interested in building your career at Factored? Get future opportunities sent straight to your email.
Apply for this job
*
indicates a required field