NLP PEOPLE
Senior Data Scientist – Large Language Models / Generative AI
NLP PEOPLE, New York, New York, us, 10261
Senior Data Scientist – Large Language Models / Generative AI
Our Data Science Large Language Models team uses advanced generative AI technologies to create powerful features within the Datadog application. Our focus lies in the fine-tuning, training, and serving of LLMs to power features across the Datadog platform. As a data scientist on the team, you will contribute to building the foundation of impactful features using LLMs as a core computation unit to enable our users to understand their data and systems, reason, plan, and act. You will get the opportunity to collaborate with a group of skilled engineers and data scientists to improve and implement state-of-the-art Large Language Models and Agents to have a direct impact on Datadog products (see Bits AI, your new DevOps copilot for example).
At Datadog, we place value in our office culture – the relationships that it builds, the creativity it brings to the table, and the collaboration of being together. We operate as a hybrid workplace to ensure our employees can create a work-life harmony that best fits them.
What You’ll Do:
Work on a wide range of projects, building large-scale distributed fine tuning and training infrastructure, deploying LLMs on GPU instances for real-time use cases, designing robust, secure infrastructure, or supporting cutting-edge AI research and development.
Create new product features using advanced machine learning algorithms, LLMs, and statistical techniques.
Collaborate with a group of AI specialists and scientists in envisioning the future state of our abilities while also aiding in the design and deployment of crucial services.
Actively participate in our journal club by reading and presenting latest research papers in the field of LLMs and Generative AI.
Provide deeper insights and stories behind massive data processed in Datadog systems.
Develop, deploy, monitor and maintain the LLM models, services, and infrastructure managed by your team and participate in your team’s on-call rotation.
Who You Are:
You possess a BS/MS/PhD in Computer Science, Engineering, Machine Learning or a related scientific field or have equivalent experience.
You have relevant experience with Language Models (Large LMs is a plus), NLP, large-scale systems and data sets, deep learning, or adjacent fields. Writing production data pipelines and applications is a plus.
You have experience with the stack for distributed training and inference of large models including distributed training and inference frameworks, and ML development frameworks such as Pytorch, Tensorflow, etc. Experience with CUDA is a plus.
You possess the ability to elaborate complex models and ideas to non-technical personnel.
You value code simplicity and performance.
You are passionate about Generative AI and want to contribute to user-facing product.
Datadog values people from all walks of life. We understand not everyone will meet all the above qualifications on day one. That’s okay. If you’re passionate about technology and want to grow your skills, we encourage you to apply.
Benefits and Growth:
Competitive global benefits.
New hire stock equity (RSUs) and employee stock purchase plan (ESPP).
Opportunity to collaborate closely with colleagues across the Datadog offices in New York City and Paris.
Opportunity to attend and present at conferences and meetups.
Intra-departmental mentor and buddy program for in-house networking.
An inclusive company culture, ability to join our Community Guilds (Datadog employee resource groups).
Benefits and Growth listed above may vary based on the country of your employment and the nature of your employment with Datadog.
#J-18808-Ljbffr
Our Data Science Large Language Models team uses advanced generative AI technologies to create powerful features within the Datadog application. Our focus lies in the fine-tuning, training, and serving of LLMs to power features across the Datadog platform. As a data scientist on the team, you will contribute to building the foundation of impactful features using LLMs as a core computation unit to enable our users to understand their data and systems, reason, plan, and act. You will get the opportunity to collaborate with a group of skilled engineers and data scientists to improve and implement state-of-the-art Large Language Models and Agents to have a direct impact on Datadog products (see Bits AI, your new DevOps copilot for example).
At Datadog, we place value in our office culture – the relationships that it builds, the creativity it brings to the table, and the collaboration of being together. We operate as a hybrid workplace to ensure our employees can create a work-life harmony that best fits them.
What You’ll Do:
Work on a wide range of projects, building large-scale distributed fine tuning and training infrastructure, deploying LLMs on GPU instances for real-time use cases, designing robust, secure infrastructure, or supporting cutting-edge AI research and development.
Create new product features using advanced machine learning algorithms, LLMs, and statistical techniques.
Collaborate with a group of AI specialists and scientists in envisioning the future state of our abilities while also aiding in the design and deployment of crucial services.
Actively participate in our journal club by reading and presenting latest research papers in the field of LLMs and Generative AI.
Provide deeper insights and stories behind massive data processed in Datadog systems.
Develop, deploy, monitor and maintain the LLM models, services, and infrastructure managed by your team and participate in your team’s on-call rotation.
Who You Are:
You possess a BS/MS/PhD in Computer Science, Engineering, Machine Learning or a related scientific field or have equivalent experience.
You have relevant experience with Language Models (Large LMs is a plus), NLP, large-scale systems and data sets, deep learning, or adjacent fields. Writing production data pipelines and applications is a plus.
You have experience with the stack for distributed training and inference of large models including distributed training and inference frameworks, and ML development frameworks such as Pytorch, Tensorflow, etc. Experience with CUDA is a plus.
You possess the ability to elaborate complex models and ideas to non-technical personnel.
You value code simplicity and performance.
You are passionate about Generative AI and want to contribute to user-facing product.
Datadog values people from all walks of life. We understand not everyone will meet all the above qualifications on day one. That’s okay. If you’re passionate about technology and want to grow your skills, we encourage you to apply.
Benefits and Growth:
Competitive global benefits.
New hire stock equity (RSUs) and employee stock purchase plan (ESPP).
Opportunity to collaborate closely with colleagues across the Datadog offices in New York City and Paris.
Opportunity to attend and present at conferences and meetups.
Intra-departmental mentor and buddy program for in-house networking.
An inclusive company culture, ability to join our Community Guilds (Datadog employee resource groups).
Benefits and Growth listed above may vary based on the country of your employment and the nature of your employment with Datadog.
#J-18808-Ljbffr