Logo
Beacon

Staff Machine Learning Engineer

Beacon, San Francisco, CA, United States


Founding/Staff Machine Learning Engineer - Generative AI

Our client is a venture-backed YC startup revolutionizing the supply chain sector with cutting-edge generative AI technologies. As a rapidly growing organization, they are applying advancements in machine learning to address complex industry challenges, unlocking unprecedented efficiencies and insights for their clients.
Role Overview

We are seeking a Founding/Staff Machine Learning Engineer with deep expertise in generative AI to lead critical technical efforts in developing and deploying state-of-the-art solutions. This role is for an experienced engineer who thrives on building robust, scalable systems and is passionate about advancing the frontiers of AI. This position requires hands-on experience in training large language models (LLMs), working with embedding models and vector databases, and developing AI-powered chatbot solutions.
Key Responsibilities
  • Train and fine-tune LLM foundation models (e.g., GPT, Claude, PaLM 2, LLaMA) using cutting-edge techniques and frameworks, ideally on AWS SageMaker.
  • Design, implement, and optimize embedding models for a variety of applications.
  • Build and deploy AI-powered chatbots using frameworks like LangChain or LangGraph.
  • Integrate and manage vector databases (e.g., MongoDB Atlas Vector Store, Milvus, Weaviate, Pinecone) to support efficient model querying and retrieval.
  • Collaborate closely with cross-functional teams to align AI-driven solutions with business objectives in the supply chain domain.
  • Write clean, maintainable, and scalable code in Python; TypeScript experience is a strong plus.
  • Drive the end-to-end lifecycle of machine learning models, from research and experimentation to production deployment and monitoring.
Qualifications
  • Experience: 5+ years of hands-on experience as a Machine Learning Engineer (not a Data Scientist) with a focus on developing and deploying production-ready solutions.
  • Foundation Models: Proven experience training and fine-tuning LLMs (GPT, Claude, Gemini/PaLM 2, LLaMA, etc.).
  • Embedding Models: Strong expertise in designing and implementing embedding-based solutions.
  • Vector Databases: Practical knowledge of vector databases (MongoDB Atlas Vector Store, Milvus, Weaviate, Pinecone, etc.).
  • Chatbots: Hands-on experience building AI-powered chatbots, ideally using LangChain or LangGraph.
  • Technical Skills: Advanced proficiency in Python. Experience with TypeScript is a plus but not required.
  • Cloud Platforms: Familiarity with AWS, particularly SageMaker, for training and deploying models.
  • Team Collaboration: Excellent communication and collaboration skills to work in a fast-paced, multidisciplinary environment.
Why Join
  • Be a foundational team member in a high-impact, venture-backed startup.
  • Solve meaningful problems with cutting-edge generative AI technologies.
  • Work in a dynamic, collaborative environment in the heart of Silicon Valley.
  • Enjoy competitive compensation, benefits, and equity opportunities.