Logo
Society of Exploration Geophysicists

Sr Data Scientist

Society of Exploration Geophysicists, South San Francisco, California, us, 94083


Job description:We are looking for a Sr Data Scientist with experience in Machine Learning Engineering to join the Roche Product Development Digital Strategy & Enablement team (PD-DSE). In the DSE, we focus on delivering technology that evolves the practice of medicine and helps patients live longer, better lives.We are a diverse team of open and friendly people, enthusiastic about technological novelties and optimal enterprise solutions. We share knowledge, experience & appreciate different points of view.As a Senior Data Scientist, you will work closely with multi-disciplinary teams to design, develop and deploy structured, high-quality data solutions in particular Large Language Model (LLM) applications.These solutions will be leveraged across the PD organization to help our teams fulfill our mission: to do now what patients need next.Key Accountabilities:Partner with fellow Data Scientists, ML engineers, MLOps / DevOps engineers and cross functional teams to solve complex problems and create unique solutions by using modern NLP technologies in particular LLMs.Build data pipelines and deployment pipelines for ML models.Development of ML models according to business and functional requirements.Able to help deploy various models and tune them for better performance.Document and communicate the design and implementation details.Contribute to the DSE AI team on technical decisions.Collaborate with clients and informatics departments to deploy scalable and easy-to-maintain solutions.Serve as a technical point of contact for enterprise-wide technology solutions. Lead complex troubleshooting efforts and root cause analysis.Qualifications:Experience with LLM applications development including tool using and reasoning, for instance RAG solution and code interpreter.Experience with LLM fine tuning is a big plus.Experience in building data pipelines and deployment pipelines for LLM applications.Recent experience with ML/AI toolkits such as AWS Sagemaker (other toolkits like Pytorch, Tensorflow, Keras, MXNet, H20, etc. are nice to have).Experience with MLOps technologies (Sagemaker, Vertex AI, Kubeflow).Experience with cloud solutions (AWS / Azure / GCP), docker.Proven scripting and automation skills.Good knowledge of: git, bash, linux, CI/CD tools (e.g., Jenkins, GitLab CI), software lifecycle, RDB, visualization tools e.g., Tableau, Jira, Confluence.Programming languages:Python, R, Test driven development, good coding practices.Problem-solving and decision-making skills.Good interpersonal skills.Customer & delivery focus.Ability to work effectively with team members and virtual teams from different locations and different cultural backgrounds.Experience with deployment of scalable apps is a plus.Experience with clinical study data is a plus.Education / Years of Experience:Master in a quantitative field (e.g., mathematics, statistics, computer science, EE, etc.), and/or Life Sciences degree with significant computational experience, or equivalent, with 5+ years working experience in Data Science. PhD is a plus.2+ years of commercial Data Engineering / ML Engineering / MLOps / UI/UX engineering experience.3+ years of commercial software engineering experience.Notes from the hiring manager:TOP THREE MUST-HAVE QUALIFICATIONS:Recent LLM application development experience, in particular RAG applications.Strong general software development skill.Good collaborator in a diverse team.Targeting level II (3-5 years experience) senior level Data Scientist / ML engineer.

#J-18808-Ljbffr