SoundHound
Software Engineer Machine CurationSearch Data
SoundHound, Boulder, Colorado, United States, 80301
About the Role:
We believe that a cutting-edge “A+ solution” to organizing/structuring searchable data (e.g. data curation) is essential. It's difficult to get there via human curation because there's too much data. It's also difficult to get there with a purely machine-curated solution (we've learned that lesson first hand). Our solution at SoundHound has been to hybridize the two approaches. In this role, you'll be part of a small and innovative team designing automated algorithms smart enough to approximate a human curator, while also working closely with human curators on the team to achieve the best possible searchable data quality for our apps.
About You:
You are fascinated by the challenge of implementing machine-based solutions to organize/label data the way human curators would – on a scale that would require millions of human curators
You are a quick learner with sharp coding skills, enabling you to meet aggressive deadlines
You enjoy being part of a small team responsible for systems that directly power the search experience of millions of users every day
You pay attention to detail and take pride in implement highly maintainable code
Responsibilities:
Design and implement systems that manage a unique blend of searchable data (including audio, meta-data, and traditional text-based content)
Lots of coding, testing, and scripting in a linux environment
Regularly analyze the data curated by the system and identify areas for improvement
Requirements:
Proficient in C++ and one or more scripting languages
Strong grasp of algorithms and data structures
Experience manipulating and managing large amounts of data
Self-motivated and able to contribute in a fast-paced team environment
BS/MS in Computer Science or equivalent
Nice to haves:
Experience working with large catalogs of real-world / messy data
Familiarity with text search algorithms and related systems
Experience working with music-related data
Proficiency with databases/SQL
We believe that a cutting-edge “A+ solution” to organizing/structuring searchable data (e.g. data curation) is essential. It's difficult to get there via human curation because there's too much data. It's also difficult to get there with a purely machine-curated solution (we've learned that lesson first hand). Our solution at SoundHound has been to hybridize the two approaches. In this role, you'll be part of a small and innovative team designing automated algorithms smart enough to approximate a human curator, while also working closely with human curators on the team to achieve the best possible searchable data quality for our apps.
About You:
You are fascinated by the challenge of implementing machine-based solutions to organize/label data the way human curators would – on a scale that would require millions of human curators
You are a quick learner with sharp coding skills, enabling you to meet aggressive deadlines
You enjoy being part of a small team responsible for systems that directly power the search experience of millions of users every day
You pay attention to detail and take pride in implement highly maintainable code
Responsibilities:
Design and implement systems that manage a unique blend of searchable data (including audio, meta-data, and traditional text-based content)
Lots of coding, testing, and scripting in a linux environment
Regularly analyze the data curated by the system and identify areas for improvement
Requirements:
Proficient in C++ and one or more scripting languages
Strong grasp of algorithms and data structures
Experience manipulating and managing large amounts of data
Self-motivated and able to contribute in a fast-paced team environment
BS/MS in Computer Science or equivalent
Nice to haves:
Experience working with large catalogs of real-world / messy data
Familiarity with text search algorithms and related systems
Experience working with music-related data
Proficiency with databases/SQL