Saviance

Data Scientist - Large Language Model (LLM)

Saviance, Boston, MA, United States

Job Title: Data Scientist - Large Language Model (LLM)

Location: Remote

Duration: Full time

About BigRio:

BigRio is a pioneering technology company at the forefront of professional services and consulting along with natural language processing innovation. We are seeking an accomplished Data Scientist to join our team and play a pivotal role in developing and enhancing large language model (LLM) applications that are reshaping the field of AI especially in the healthcare industry segment.

Job Description:

As a Data Scientist specializing in large language model (LLM) applications, you will lead the charge in advancing our state-of-the-art natural language understanding and generation solutions. You will collaborate closely with our research and engineering teams to design, implement, and optimize language models, with a strong emphasis on transformers and attention networks in NLP. Your expertise will be instrumental in shaping the future of AI-driven language technologies.

Key Responsibilities:

Reinforcement Learning Expertise: Conduct advanced research and have a track record of scientific publications in reinforcement learning, including Q-learning, value-iteration methods, DQN, double DQN, actor-critic, and Proximal Policy Optimization.
NLP and Transformers Mastery: Demonstrate deep knowledge, publications and hands-on experience with transformers and attention networks in NLP, including proficiency with the Hugging Face Transformers library and models.
Model Development: Design, develop, and optimize large language models using cutting-edge transformer architectures and attention mechanisms, supported by proven code and projects.
Data Structures and Algorithms: Possess a comprehensive understanding of data structures and algorithms, applying them effectively to address complex NLP challenges.
Unix Proficiency: Be proficient in Unix-based systems to facilitate efficient data processing and model development workflows.
Python Development: Bring at least 3-5 years of extensive Python development experience, with a focus on data science, machine learning, and AI projects.
Prompt Engineering: Efficient and intensive prompt engineering expertise.
LLM Infrastructure and engineering: Experience with various options for setting up the LLM infrastructure in the cloud.

Requirements:

To excel in this role, you should meet the following qualifications:

Education: Hold a Master's or Ph.D. in computer science or a related field.
Reinforcement Learning Knowledge and Publications: Present a proven track record of scientific publications in reinforcement learning, showcasing expertise in various RL methods.
NLP and Transformers Knowledge and publications: Demonstrate in-depth understanding and hands-on experience and proven scientific publications with transformers and attention networks for NLP, including familiarity with the Hugging Face Transformers library and models.
Fine tuning language models: Demonstrated ability to fine tune language models in multi-GPU environment.
Data Structures and Algorithms: Possess a strong grasp of data structures and algorithms, with the ability to apply them effectively to solve intricate NLP problems.
Unix Proficiency: Exhibit proficiency in Unix-based systems for efficient data processing and development tasks.
Python Development: Have a minimum of 3-5 years of hands-on experience in Python development, with a particular focus on data science, machine learning, and AI. Moreover, at least 4-8 years of experience with deep learning frameworks (e.g., TensorFlow, PyTorch) and proficiency in other Client algorithms and libraries.
Problem Solving: Showcase exceptional problem-solving skills and a creative approach to tackling complex NLP challenges.
Communication: Possess strong verbal and written communication skills, enabling effective collaboration with cross-functional teams.