Tbwa Chiat/Day Inc
Software Engineer, GenAI Model Evaluation
Tbwa Chiat/Day Inc, San Francisco, California, United States, 94199
Software Engineer, GenAI Model Evaluation
About JobSoftware is eating the world, but AI is eating software. We live in unprecedented times – AI has the potential to exponentially augment human intelligence. To ensure that these models are safe, aligned, and highly useful, they require extremely high quality human-generated data and evaluation. Scale has been at the forefront of providing the post-training, fine-tuning, and human preference alignment (RLHF) data needed to ensure these models are capable, aligned, and useful via our Generative AI Data Engine.As customers train their models on this data, a critical need is having trustworthy evaluations of model performance, and an ability to identify weaknesses and potential vulnerabilities. Conducting these evaluations with our human experts constitutes a significant portion of Scale’s work—thus assisting model developers in iteratively understanding where to focus their technical investments.The GenAI Safety & Evaluation product team at Scale is at the heart of this work, building a world-class customer-facing model evaluation platform. This platform enables customers to easily launch new evaluation workflows, deep dive into evaluation results down to the test case level to understand weaknesses and benchmark performance, and use these insights to drive model development roadmaps.As part of the Safety & Evaluation product team, you will partner closely with researchers from Scale’s Safety, Evaluations, and Alignment Lab (SEAL) on productization of novel research, as well as Scale’s expert red team, which supports AI safety via rigorous model testing trusted by major enterprises and leading model developers.We’re looking for entrepreneurial Software Engineers to join our team. In this role, you'll be given the opportunity to build these products and drive millions of dollars in revenue. You’ll also get widespread exposure to the forefront of the AI race as Scale sees it in enterprises, startups, governments, and large tech companies.You will:Own large new areas within our productWork across backend, frontend, and interacting with LLMs and/or other ML modelsDeliver experiments at a high velocity and level of quality to engage our customersWork across the entire product lifecycle from conceptualization through productionBe able, and willing, to multi-task and learn new technologies quicklyCollaborate with cross-functional teams to define, design, and ship new product features and experiences.Must be able to commute to the San Francisco Office 1-2x weekly.Ideally you’d have:5+ years of full-time engineering experience, post-graduationProficiencies in one or more of Python, Node, React, Next.js and MongoDBSolid background in algorithms, data structures, and object-oriented programming.Experience scaling products at hyper-growth startupsExcitement to work with AI technologiesStrong written and verbal communication skillsStrong problem-solving skills, and be able to work independently or as part of a team.Nice to haves:Strong knowledge of software engineering best practices.Experience with AI platforms and technologies, including generative models and LLMs.Experience building ML infrastructure and AI-powered solutions.Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The base salary range for this full-time position in the location of San Francisco is:$160,000 - $192,000 USDAbout Us:At Scale, we believe that the transition from traditional software to AI is one of the most important shifts of our time. Our mission is to make that happen faster across every industry, and our team is transforming how organizations build and deploy AI. We are trusted by generative AI companies such as OpenAI, Meta, and Microsoft, and government agencies like the U.S. Army and U.S. Air Force.We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an affirmative action employer and inclusive and equal opportunity workplace.Apply for this job
* indicates a required field
#J-18808-Ljbffr
About JobSoftware is eating the world, but AI is eating software. We live in unprecedented times – AI has the potential to exponentially augment human intelligence. To ensure that these models are safe, aligned, and highly useful, they require extremely high quality human-generated data and evaluation. Scale has been at the forefront of providing the post-training, fine-tuning, and human preference alignment (RLHF) data needed to ensure these models are capable, aligned, and useful via our Generative AI Data Engine.As customers train their models on this data, a critical need is having trustworthy evaluations of model performance, and an ability to identify weaknesses and potential vulnerabilities. Conducting these evaluations with our human experts constitutes a significant portion of Scale’s work—thus assisting model developers in iteratively understanding where to focus their technical investments.The GenAI Safety & Evaluation product team at Scale is at the heart of this work, building a world-class customer-facing model evaluation platform. This platform enables customers to easily launch new evaluation workflows, deep dive into evaluation results down to the test case level to understand weaknesses and benchmark performance, and use these insights to drive model development roadmaps.As part of the Safety & Evaluation product team, you will partner closely with researchers from Scale’s Safety, Evaluations, and Alignment Lab (SEAL) on productization of novel research, as well as Scale’s expert red team, which supports AI safety via rigorous model testing trusted by major enterprises and leading model developers.We’re looking for entrepreneurial Software Engineers to join our team. In this role, you'll be given the opportunity to build these products and drive millions of dollars in revenue. You’ll also get widespread exposure to the forefront of the AI race as Scale sees it in enterprises, startups, governments, and large tech companies.You will:Own large new areas within our productWork across backend, frontend, and interacting with LLMs and/or other ML modelsDeliver experiments at a high velocity and level of quality to engage our customersWork across the entire product lifecycle from conceptualization through productionBe able, and willing, to multi-task and learn new technologies quicklyCollaborate with cross-functional teams to define, design, and ship new product features and experiences.Must be able to commute to the San Francisco Office 1-2x weekly.Ideally you’d have:5+ years of full-time engineering experience, post-graduationProficiencies in one or more of Python, Node, React, Next.js and MongoDBSolid background in algorithms, data structures, and object-oriented programming.Experience scaling products at hyper-growth startupsExcitement to work with AI technologiesStrong written and verbal communication skillsStrong problem-solving skills, and be able to work independently or as part of a team.Nice to haves:Strong knowledge of software engineering best practices.Experience with AI platforms and technologies, including generative models and LLMs.Experience building ML infrastructure and AI-powered solutions.Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The base salary range for this full-time position in the location of San Francisco is:$160,000 - $192,000 USDAbout Us:At Scale, we believe that the transition from traditional software to AI is one of the most important shifts of our time. Our mission is to make that happen faster across every industry, and our team is transforming how organizations build and deploy AI. We are trusted by generative AI companies such as OpenAI, Meta, and Microsoft, and government agencies like the U.S. Army and U.S. Air Force.We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an affirmative action employer and inclusive and equal opportunity workplace.Apply for this job
* indicates a required field
#J-18808-Ljbffr