Logo
SoundHound

Software Engineer Machine CurationSearch Data

SoundHound, Boulder, Colorado, United States, 80301


About the Role:

We believe that a cutting-edge “A+ solution” to organizing/structuring searchable data (e.g. data curation) is essential. It's difficult to get there via human curation because there's too much data. It's also difficult to get there with a purely machine-curated solution (we've learned that lesson first hand). Our solution at SoundHound has been to hybridize the two approaches. In this role, you'll be part of a small and innovative team designing automated algorithms smart enough to approximate a human curator, while also working closely with human curators on the team to achieve the best possible searchable data quality for our apps.

About You:

You are fascinated by the challenge of implementing machine-based solutions to organize/label data the way human curators would – on a scale that would require millions of human curators

You are a quick learner with sharp coding skills, enabling you to meet aggressive deadlines

You enjoy being part of a small team responsible for systems that directly power the search experience of millions of users every day

You pay attention to detail and take pride in implement highly maintainable code

Responsibilities:

Design and implement systems that manage a unique blend of searchable data (including audio, meta-data, and traditional text-based content)

Lots of coding, testing, and scripting in a linux environment

Regularly analyze the data curated by the system and identify areas for improvement

Requirements:

Proficient in C++ and one or more scripting languages

Strong grasp of algorithms and data structures

Experience manipulating and managing large amounts of data

Self-motivated and able to contribute in a fast-paced team environment

BS/MS in Computer Science or equivalent

Nice to haves:

Experience working with large catalogs of real-world / messy data

Familiarity with text search algorithms and related systems

Experience working with music-related data

Proficiency with databases/SQL