Logo
Kuraray America, Inc.

Data Scientist

Kuraray America, Inc., San Francisco, California, United States, 94199


Company

At Directly, we believe the fundamental nature of work is changing from fixed to fluid, as AI and the gig economy transform how people and companies work in the next economy. We’re leading this shift with an enterprise platform that changes the way companies deliver customer service. Our platform helps companies like Airbnb, Autodesk, LinkedIn, Microsoft, SAP, and Samsung look beyond the traditional contact center and deliver better answers to customers in the moments that matter. We’re based in San Francisco and backed by top firms including Microsoft Ventures, True Ventures, Costanoa Ventures, and Northgate. www.directly.com

Role

We are looking for new members of our Data Science team to help build augmented intelligence systems that allow humans and machines to work together to redefine customer support. Our work spans many disciplines, such as information retrieval, natural language processing, and machine learning. Directly was granted several patents on our innovative technology, and we are constantly expanding the science and inventing new processes for optimizing the benefits of our platform.

Core Responsibilities

Conduct ad-hoc analysis on the data set and present patterns and trends to the team.

Extract, combine, clean, pre-process, and analyze a corpus of questions and answers from internal and external data sets.

Perform feature extraction for the data set and create a training set for a classifier model.

Optimize hyperparameters of a classifier from an existing machine learning library.

Analyze performance of a data product and come up with suggestions for scaling it.

Suggest enhancements for a data product and articulate them from a systems approach.

Find potential opportunities for building new capabilities into the product as well as develop prototypes of new internal tools to make our operations more efficient.

Help the operations team to improve the service we provide to our customers.

Collaborate with Engineering to ship new capabilities of data products to production.

Work with Data Engineering and DevOps teams to improve data pipelines and analytics.

Investigate if an algorithm from a scientific paper can be applied to our domain.

Implement a new machine learning algorithm in TensorFlow and evaluate its efficacy.

Work on a patent application and/or conference submissions.

Requirements

Academic background in a technical/quantitative field (graduate degree a plus) or equivalent experience.

Working knowledge of statistics as pertains to machine learning, such as distributions, statistical testing, regression, etc.

Proficiency in using SQL with several major DBMS and DW engines.

Experience with a variety of Big Data technologies, distributed machine learning and computing frameworks (S3, Spark, Hadoop, Elasticsearch, TensorFlow, etc.).

Good scripting and programming skills in Python and UNIX shell.

Data manipulation skills - extract data from relational and non-relational databases, files of multiple formats, clean it, join it, slice/dice it, organize it, analyze it, and explain it.

Experience in the Python data science ecosystem: Pandas, NumPy, SciPy, scikit-learn, NLTK, Gensim, etc. You should be able to hit the ground running with these tools fast.

Experience with partitioning and clustering techniques (K-Means, DBSCAN, etc.).

Experience with text mining, parsing, and classification using state-of-the-art techniques.

Experience with information retrieval, Natural Language Processing, Natural Language Understanding, Neural Language Modeling, and Chat and Dialog Modeling technologies.

Strong background in machine learning (unsupervised and supervised techniques). In particular, excellent understanding of machine learning techniques and algorithms, such as k-NN, Naive Bayes, SVM, Decision Forests, logistic regression, MLPs, RNNs, etc.

Ability to evaluate quality of ML models and to define the right performance metrics for models in accordance with the requirements of the business.

Quantitative, metrics-driven, data-oriented personality.

Directly is an equal opportunity employer. We believe in diversity and inclusion as a strength.

#J-18808-Ljbffr