Logo
Deepgram

Research Scientist, Voice

Deepgram, San Francisco, California, United States, 94199


[Full Time] Research Scientist, Voice at Deepgram (United States)

Research Scientist, VoiceDeepgram United States

Date Posted: 09 Jan, 2024

Work Location: San Francisco, United States

Salary Offered: $150000 — $230000 yearly

Job Type: Full Time

Experience Required: 3+ years

Remote Work: Yes

Vacancies: 1 available

Company Overview

Deepgram is a foundational AI company building state of the art, production-ready AI models that streamline human-computer interaction and amplify productivity. By enabling seamless communication between humans and machines, we believe we can harness the untapped potential of AI and help pave the way for a more productive future.

The Opportunity

At Deepgram, we spend every day tackling big, real-world challenges in voice. Our customers hire us to solve their hardest problems, taking real, complex audio and transforming it into novel insights. As a Research Scientist at Deepgram, you’ll have the freedom to explore and uncover breakthroughs. You’ll also have a mandate to build -- applying the latest advancements in deep learning to develop accurate and performant voice AI models.

The Role

Deepgram is currently looking for an experienced Research Scientist who has worked extensively on building models to solve hard problems in voice AI domains including automatic speech recognition (ASR), text-to-speech (TTS), diarization and speaker identification, language detection, or code switching.

What You’ll Do

Stay up to date with the latest advances in deep learning with a particular eye towards their implications and applications within our products.

Design and carry out experimental programs to build new voice AI models that solve critical problems for our customers.

Drive large-scale training jobs successfully on distributed computing infrastructure.

Optimize model architecture to make them as fast and memory-efficient as possible; deploy new models into production for use at massive scale.

Document and present results and complex technical concepts clearly for internal and external audiences.

You’ll Love This Role If You

Are passionate about AI and excited about working on state of the art speech research.

Enjoy building from the ground up and love to create new systems from scratch.

Are obsessed with building and shipping practical solutions to real world problems.

Are data-driven and prefer to solve problems using iterative experimentation.

Have strong communication skills and are able to translate complex concepts in simple terms, depending on the target audience.

It’s Important To Us That You Have

Prior industry experience in building deep learning models to solve audio problems.

Proven experience building models from a blank page and owning the entire deep learning stack.

Strong software engineering skills with particular emphasis on developing clean, modular code in Python and working with Pytorch.

Prior experience in designing and conducting experimental programs with the ability to rapidly iterate and change course as needed.

It Would Be Great if You Had

Deep understanding and experience working with state-of-the-art network architectures including transformers.

Experience building generative audio models for speech or music synthesis.

Understanding of different parallelism paradigms for efficient distributed training.

Up-to-date knowledge of recent techniques and developments in multiple voice AI problem domains (ASR, TTS, diarization, etc.).

Deepgram is an equal opportunity employer. We want all voices and perspectives represented in our workforce. We are a curious bunch focused on collaboration and doing the right thing. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, gender identity or expression, age, marital status, veteran status, disability status, pregnancy, parental status, genetic information, political affiliation, or any other status protected by the laws or regulations in the locations where we operate.

#J-18808-Ljbffr