Logo
Deepgram

Senior Data Scientist

Deepgram, San Francisco, California, United States, 94199


Company Overview

Deepgram is a foundational AI company on a mission to transform human-machine interaction using natural language. We give any developer access to the fastest, most powerful voice AI platform including access to models for speech-to-text, text-to-speech, and spoken language understanding with just an API call. From transcription to sentiment analysis to voice synthesis, Deepgram is the preferred partner for builders of voice AI applications.The OpportunityAt Deepgram, we believe that data is the key to unlock the future of voice-enabled experiences. But building with audio data is hard -- audio poses incredibly rich scientific, engineering, and infrastructure challenges that are orders of magnitude harder than working with text. As a Data Scientist at Deepgram, you will tackle conversational audio at scale, establishing automated data streams that will power the next generation of Voice AI foundation models. The models we build will go beyond basic transcription and comprehension to capture nuanced meanings in complex conversations, adapt robustly to diverse speech patterns, and generate empathic responses with human-like, contextualized speech. Domain-specific expertise in speech or language AI is not required. Rather we’re looking for seasoned scientists who have a track record of solving hard data problems while exploring research frontiers. Our start-up environment offers a stunning growth trajectory for adventure-seeking individuals, providing a level of project ownership and on-ground connection with end-customers that larger research labs simply cannot provide.What You’ll DoBuild high performance data acquisition, preparation and synthesis pipelines and drive them to generate data for training foundational voice models across modalities and tasksDevelop advanced characterizations of complex conversational audio utilizing a diverse toolkit of signals processing techniques and deep learning methodsCollaborate with DataOps and Engineering to create automated systems which scale the ability of human annotators to label high value data and provide feedback on model outputsBuild advanced benchmarking methodologies for evaluating interactive, conversational agent systemsIt’s Important To Us That You HaveExperience building data processing pipelines from a blank page and owning the entire data stack including acquisition, characterization, cleaning, serving and transformationExperience applying statistical methods and deep learning models to understand complex dataAbility to design and carry out research programs independently and with minimal oversightStrong software engineering skills with particular emphasis on developing clean, modular code in Python and working with PytorchStrong communication skills and the ability to translate complex concepts in simple terms, depending on the target audienceBacked by prominent investors including Y Combinator, Madrona, Tiger Global, Wing VC and NVIDIA, Deepgram has raised over $85 million in total funding after closing our Series B funding round last year. If you're looking to work on cutting-edge technology and make a significant impact in the AI industry, we'd love to hear from you!Deepgram is an equal opportunity employer. We want all voices and perspectives represented in our workforce. We are a curious bunch focused on collaboration and doing the right thing. We put our customers first, grow together and move quickly. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, gender identity or expression, age, marital status, veteran status, disability status, pregnancy, parental status, genetic information, political affiliation, or any other status protected by the laws or regulations in the locations where we operate.We are happy to provide accommodations for applicants who need them.

#J-18808-Ljbffr