ZipRecruiter

Lead Data Scientist

ZipRecruiter, New York, New York, us, 10261

Job DescriptionJob Description

Exciting Opportunity for ML and NLP/Gen AI/LLM Scientists

Are you passionate about leveraging automation and AI/ML to drive business value? We have a unique opportunity for hands-on ML scientists and NLP/Gen AI/LLM experts to advance their careers and apply their technical expertise in NLP, deep learning, GenAI, and LLMs. Join us in conducting cutting-edge applied research and making a significant impact across multiple stakeholders.

Responsibilities and Impact

ML, Gen AI, NLP, LLM Model Development:

Design and develop custom ML, Gen AI, NLP, and LLM models for batch and stream processing-based AI/ML pipelines.

Work on data ingestion, preprocessing, search and retrieval, Retrieval Augmented (RAG), NLP/LLM model development, fine-tuning, and prompt engineering.

Collaborate with data science, MLOps, and technology teams to ensure solutions meet all technical and business requirements.

ML, NLP, LLM Model Evaluation:

Develop, validate, and maintain robust evaluation solutions and tools to assess model performance, accuracy, consistency, and reliability.

Implement model optimizations to enhance system efficiency.

NLP, LLM, Gen AI Model Deployment:

Partner with the MLOps team to deploy machine learning models into production environments, ensuring reliability and scalability.

Internal Collaboration:

Work closely with product teams, business stakeholders, MLOps, machine learning engineers, and software engineers to integrate machine learning models smoothly into production systems.

Experience Required:

Bachelor's, Masters, or Ph.D. degree in Computer Science, Mathematics, Statistics, Computational Linguistics, Engineering, or a related field.

8+ years of professional hands-on experience with large sets of structured and unstructured data to develop data-driven analytics and insights using ML, RAG, and NLP.

3+ years of hands-on experience with Python, Hugging Face, TensorFlow, Keras, PyTorch, or similar statistical tools. Expert in Python programming.

3+ years of experience developing NLP models, ideally with transformer architectures.

3+ years of experience implementing information search and retrieval at scale, using solutions ranging from keyword search to semantic search using embeddings.

Knowledge and hands-on experience with developing or tuning Large Models (LLM) and Generative AI (GAI).

Experienced with NLP, LLMs (extractive and generative), fine-tuning, and LLM model development. Strong familiarity with trends in LLMs and open-source platforms.

This role is fully remote as long as you are based in the US. The position will pay between $120k-$240k depending on experience. Send your resume now