Logo
Sarepta Therapeutics

Contract Data Scientist

Sarepta Therapeutics, Cambridge, Massachusetts, us, 02140


The data scientist role at Sarepta is an exciting opportunity for someone with strong data science and visualization skills who wants to see their work directly impact patient care. In this hands-on role, the data scientist will support the data cleaning, processing, and building of R Shiny dashboards to visualize internal and external biological data and perform interactive analytics. You will get exposure to the entire drug development process and collaborate with other data scientists, computational biologists, and scientists in the organization. It is a fantastic opportunity to work at the forefront of precision genetic medicine, impacting the development of transformative therapies that may change patient’s lives.

Primary Responsibilities Include

Support the development of databases and organize datasets, and make them easily accessible to support R&D programsClean, restructure, and enrich new datasets and perform exploratory analysis to determine data quality and usabilityCollaborate cross-functionally to manage data warehousing and cloud infrastructure, and ensure compliance where necessaryDevelop interactive applications and visual dashboards using R ShinyTroubleshoot system environment, software, and workflow problems related to the deployment and operations of database and analytic pipelinesCreate and maintain robust documentation of datasets, data workflow, and data dashboardAdhere to data governance policies, regulatory requirements, licensing policies, and other industry best practices for data handling, coding, and application development.Maintain and develop a well-documented codebase and documentation for all developments relating to data dashboardsPerform additional related tasks as assigned.

Desired Education And Skills

BS/ MS in a STEM discipline with prior data science experience. BS with 5+ years of hands-on experience in the biotech/pharmaceutical industry or MS with 3+ years of professional experience. A background in RNA therapeutics, gene editing, or gene therapy is preferred.Advanced programming/ scripting abilities in R is required. The ability to code in other languages is a plus.Expertise in building data dashboards with R Shiny is required. Experience with Power BI and Tableau would be a plus.Experience in data wrangling and data visualization with R packages such as dplyr, data.table, ggplot2, plotlyExperience working with cloud computing services, database systems, workflow management systems, and development tools in a production environment: AWS, Docker, Nextflow, GitLab, database technologies, and API standardsProficiency in designing relational and non-relational schemas and queries to capture and represent multi-type data. Hands-on experience using SQL is desired (MySQL, Postgres, DuckDB etc.).Experience developing and maintaining data warehouses using SnowflakeExperience in programmatically interfacing with the database via APIs, FTPs, or remote servers.Basic understanding of Linux shell (bash, ssh, file management, dependency management, etc.)Ability to effectively communicate and work with a multidisciplinary team including chemists, biologists, geneticists, data scientists, and clinical scientists, to complete scientific projects.Background knowledge in RNA therapeutics, gene editing, and gene therapy technologies is a strong plus.Experience with FAIR and tidy data principles is preferred.Strong team player, excellent communicator, and continuous learner

#J-18808-Ljbffr