Tbwa Chiat/Day Inc
Software Engineer, GenAI Model Evaluation
Tbwa Chiat/Day Inc, San Francisco, California, United States, 94199
Software Engineer, GenAI Model Evaluation
About JobSoftware is eating the world, but AI is eating software. We live in unprecedented times – AI has the potential to exponentially augment human intelligence. As the world adjusts to this new reality, leading tech companies are racing to build LLMs at billion dollar scale, while large enterprises figure out how to add it to their products. To ensure that these models are safe, aligned, and highly useful, they require extremely high quality human-generated data and evaluation. Since before the launch of ChatGPT, through to the latest generation of frontier models coming out today, Scale has been at the forefront of providing the post-training, fine-tuning, and human preference alignment (RLHF) data needed to ensure these models are capable, aligned, and useful via our Generative AI Data Engine.As customers train their models on this data, and constantly aim to improve them, a critical need is having trustworthy evaluations of model performance, and an ability to identify weaknesses and potential vulnerabilities. Conducting these evaluations with our human experts constitutes a significant and growing portion of Scale’s work—thus assisting model developers in iteratively understanding where to focus their technical investments.The GenAI Safety & Evaluation product team at Scale is at the heart of this work, building a world-class customer-facing model evaluation platform. This platform enables customers to easily launch new evaluation workflows, deep dive into evaluation results down to the test case level to understand weaknesses and benchmark performance, and use these insights to drive model development roadmaps. In building this product, you will have a chance to shape the way that models across the industry are evaluated, impacting billions of people around the world.As part of the Safety & Evaluation product team, you will partner closely with researchers from Scale’s Safety, Evaluations, and Alignment Lab (SEAL) on productization of novel research, as well as Scale’s expert red team, which supports AI safety via rigorous model testing trusted by major enterprises and leading model developers.We’re looking for entrepreneurial Software Engineers to join our team. In this role, you'll be given the opportunity to build these products and drive millions of dollars in revenue. You’ll also get widespread exposure to the forefront of the AI race as Scale sees it in enterprises, startups, governments, and large tech companies.You will:Own large new areas within our productWork across backend, frontend, and interacting with LLMs and/or other ML modelsDeliver experiments at a high velocity and level of quality to engage our customersWork across the entire product lifecycle from conceptualization through productionBe able, and willing, to multi-task and learn new technologies quicklyCollaborate with cross-functional teams to define, design, and ship new product features and experiences.Must be able to commute to the San Francisco Office 1-2x weekly.Ideally you’d have:5+ years of full-time engineering experience, post-graduationProficiencies in one or more of Python, Node, React, Next.js and MongoDBSolid background in algorithms, data structures, and object-oriented programming.Experience scaling products at hyper-growth startupsExcitement to work with AI technologiesStrong written and verbal communication skillsStrong problem-solving skills, and be able to work independently or as part of a team.Nice to haves:Strong knowledge of software engineering best practices.Experience with AI platforms and technologies, including generative models and LLMs.Experience building ML infrastructure and AI-powered solutions.Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position, determined by work location and additional factors, including job-related skills, experience, interview performance, and relevant education or training.The base salary range for this full-time position in the location of San Francisco is:$160,000 - $192,000 USDPLEASE NOTE:
Our policy requires a 90-day waiting period before reconsidering candidates for the same role.About Us:At Scale, we believe that the transition from traditional software to AI is one of the most important shifts of our time. Our mission is to make that happen faster across every industry, and our team is transforming how organizations build and deploy AI. Our products power the world's most advanced LLMs, generative models, and computer vision models. We are trusted by generative AI companies such as OpenAI, Meta, and Microsoft, government agencies like the U.S. Army and U.S. Air Force, and enterprises including GM and Accenture. We are expanding our team to accelerate the development of AI applications.We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an affirmative action employer and inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status.
#J-18808-Ljbffr
About JobSoftware is eating the world, but AI is eating software. We live in unprecedented times – AI has the potential to exponentially augment human intelligence. As the world adjusts to this new reality, leading tech companies are racing to build LLMs at billion dollar scale, while large enterprises figure out how to add it to their products. To ensure that these models are safe, aligned, and highly useful, they require extremely high quality human-generated data and evaluation. Since before the launch of ChatGPT, through to the latest generation of frontier models coming out today, Scale has been at the forefront of providing the post-training, fine-tuning, and human preference alignment (RLHF) data needed to ensure these models are capable, aligned, and useful via our Generative AI Data Engine.As customers train their models on this data, and constantly aim to improve them, a critical need is having trustworthy evaluations of model performance, and an ability to identify weaknesses and potential vulnerabilities. Conducting these evaluations with our human experts constitutes a significant and growing portion of Scale’s work—thus assisting model developers in iteratively understanding where to focus their technical investments.The GenAI Safety & Evaluation product team at Scale is at the heart of this work, building a world-class customer-facing model evaluation platform. This platform enables customers to easily launch new evaluation workflows, deep dive into evaluation results down to the test case level to understand weaknesses and benchmark performance, and use these insights to drive model development roadmaps. In building this product, you will have a chance to shape the way that models across the industry are evaluated, impacting billions of people around the world.As part of the Safety & Evaluation product team, you will partner closely with researchers from Scale’s Safety, Evaluations, and Alignment Lab (SEAL) on productization of novel research, as well as Scale’s expert red team, which supports AI safety via rigorous model testing trusted by major enterprises and leading model developers.We’re looking for entrepreneurial Software Engineers to join our team. In this role, you'll be given the opportunity to build these products and drive millions of dollars in revenue. You’ll also get widespread exposure to the forefront of the AI race as Scale sees it in enterprises, startups, governments, and large tech companies.You will:Own large new areas within our productWork across backend, frontend, and interacting with LLMs and/or other ML modelsDeliver experiments at a high velocity and level of quality to engage our customersWork across the entire product lifecycle from conceptualization through productionBe able, and willing, to multi-task and learn new technologies quicklyCollaborate with cross-functional teams to define, design, and ship new product features and experiences.Must be able to commute to the San Francisco Office 1-2x weekly.Ideally you’d have:5+ years of full-time engineering experience, post-graduationProficiencies in one or more of Python, Node, React, Next.js and MongoDBSolid background in algorithms, data structures, and object-oriented programming.Experience scaling products at hyper-growth startupsExcitement to work with AI technologiesStrong written and verbal communication skillsStrong problem-solving skills, and be able to work independently or as part of a team.Nice to haves:Strong knowledge of software engineering best practices.Experience with AI platforms and technologies, including generative models and LLMs.Experience building ML infrastructure and AI-powered solutions.Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position, determined by work location and additional factors, including job-related skills, experience, interview performance, and relevant education or training.The base salary range for this full-time position in the location of San Francisco is:$160,000 - $192,000 USDPLEASE NOTE:
Our policy requires a 90-day waiting period before reconsidering candidates for the same role.About Us:At Scale, we believe that the transition from traditional software to AI is one of the most important shifts of our time. Our mission is to make that happen faster across every industry, and our team is transforming how organizations build and deploy AI. Our products power the world's most advanced LLMs, generative models, and computer vision models. We are trusted by generative AI companies such as OpenAI, Meta, and Microsoft, government agencies like the U.S. Army and U.S. Air Force, and enterprises including GM and Accenture. We are expanding our team to accelerate the development of AI applications.We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an affirmative action employer and inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status.
#J-18808-Ljbffr