MD Anderson Cancer Center
Data Scientist Bioinformatics
MD Anderson Cancer Center, Houston, Texas, United States, 77246
The primary purpose of the Data Scientist is to leverage the advancement of next-generation sequencing data to pioneer the discovery and development of groundbreaking therapeutics for cancer patients. This role revolves around innovating and refining sophisticated pipelines and methodologies for analyzing intricate genetic information at the single-cell level. By pioneering the development of cutting-edge computational tools and algorithms, the Data Scientist will lead the charge in accelerating scientific breakthroughs. This integral contribution will drive the creation of novel therapies and diagnostics that can transform patient care and health outcomes.
Led by Prof. Bissan Al-Lazikani, Director of Therapeutics Data Science, the intelligent and ever-learning A3D3a platform is part of the new initiative in Therapeutics Data Science and part of our ambitious Institute for Data Science in Oncology at MD Anderson. A3D3a will accelerate the discovery and impact of novel therapies for cancer by enabling novel opportunities for optimized therapies for patients with a focus on rare and hard-to-treat cancers through the development of novel machine learning and AI technologies.
JOB SPECIFIC COMPETENCIESCarry out preparation, clean-up, and quality control of biological data, including scRNA-Seq, scATAC-Seq, Spatial transcriptomics and/or other multi-dimensional omic data modalities.Develop and maintain pipelines for bioinformatics and statistical analyses of aforementioned data types; activities to include handling raw data, evaluating outputs, optimizing parameters and summarizing findings.Keep abreast of advancements in single-cell sequencing and other new technologies and data analysis techniques. Stay engaged with the scientific to identify emerging technologies and methodologies.Actively collaborate with interdisciplinary teams to design experiments, understand data generation protocols and optimize analytical workflows accordingly.Rigorously validate newly developed methods using benchmark datasets and simulated data to assess their accuracy, sensitivity, specificity, and scalability.Present results at multidisciplinary project meetings.Contribute to open-source projects and publish findings in peer-reviewed journals to share insights, methodologies, and tools with the wider scientific community.Prepare written reports, manuscripts, and grant applications with investigators.Work closely with the team and collaborators to discover novel therapeutic opportunities for cancer patients.
Expected SkillsDeep knowledge of bioinformatics tools and their implementation as part of pipelines, particularly for scRNA-Seq, scATAC-Seq, Spatial transcriptomics and/or other multi-dimensional omic data modalities.Advanced knowledge of statistical methods and data analysis techniques relevant to single-cell genomics, including differential expression analysis, clustering, dimensionality reduction, trajectory inference, and data integration.Proficiency in machine learning techniques for analyzing high-dimensional single-cell data, such as supervised and unsupervised learning algorithms.Addressing challenges in bioinformatics as well as mitigation strategies such as bias, batch correction, etc.Utilizing High Performance Computing to run large-scale analyses.Strong programming skills in languages commonly used in bioinformatics and data science, such as Python and R. Ability to write efficient, modular, and maintainable code for data manipulation, analysis, and visualization.Experienced with code version control systems (e.g., Gitlab and Github).Other duties as assigned.
COMPETENCIESWith Inclusion, you understand that your ideas and contributions are valued. You promote the same for others. You address your own biases while promoting diversity and equity. (Competencies: Cultural Humility, Cultural Awareness, Cultural Intelligence)
With Drive, you see that you can serve as a leader whether you have a formal leadership role or not. You tackle problems, move past setbacks and hardships, and don't lose sight of your goals. (Competencies: Self-Confidence, Analytical Thinking, Innovative Thinking, Technical Expertise)
You demonstrate Professionalism by setting the example for others and consistently modeling MD Anderson's values and service standards. You communicate effectively in a variety of ways. (Competencies: Inspire Trust, Oral Communication, Written Communication)
Through Emotional Intelligence, you maintain awareness of your own emotions and the emotions of those around you. Use nonverbal cues and feelings to engage others in an inclusive and responsive way. (Competencies: Active Listening, Teaming, Self-Reflection)
Having Coachability means you are engaged in relentless learning. You constantly ask questions and stay curious. You understand that the organization constantly evolves, and you should as well. (Competencies: Develop Oneself, Adaptability)
Working ConditionsLaboratory environment
This position requires:
Working in Office Environment: YesWorking in Patient Care Unit: NoExposure to human/animal blood, body fluids, or tissues: NoExposure to harmful chemicals: NoExposure to radiation: No
Physical DemandsIndicate the time required to do each of the following physical demands:
Standing: OccasionallyWalking: OccasionallySitting: FrequentlyReaching: OccasionallyLifting/Carrying: Up to 10 lbs: OccasionallyLifting/Carrying: 10lbs to 50 lbs: OccasionallyLifting/Carrying: More than 50 lbs: OccasionallyPushing/Pulling: Up to 10 lbs: OccasionallyPushing/Pulling: 10lbs to 50 lbs: OccasionallyPushing/Pulling: More than 50 lbs: OccasionallyUse computer/keyboard: Frequently
EDUCATION:Required: Bachelor's degree in Biomedical Engineering, Electrical Engineering, Computer Engineering, Physics, Applied Mathematics, Science, Engineering, Computer Science, Statistics, Computational Biology, or related field.
Preferred: PhD in Biomedical Engineering, Electrical Engineering, Computer Engineering, Physics, Applied Mathematics, Science, Engineering, Computer Science, Statistics, Computational Biology, or related field.
EXPERIENCE:Required: Three years experience in scientific software or industry development/analysis. With Master's degree, one years experience required. With PhD, no experience required.
Preferred: Single cell sequencing, next generation sequencing, publications.
It is the policy of The University of Texas MD Anderson Cancer Center to provide equal employment opportunity without regard to race, color, religion, age, national origin, sex, gender, sexual orientation, gender identity/expression, disability, protected veteran status, genetic information, or any other basis protected by institutional policy or by federal, state or local laws unless such distinction is required by law.
Additional InformationRequisition ID: 167724Employment Status: Full-TimeEmployee Status: RegularWork Week: DaysMinimum Salary: US Dollar (USD) 103,000Midpoint Salary: US Dollar (USD) 129,000Maximum Salary: US Dollar (USD) 155,000FLSA: exempt and not eligible for overtime payFund Type: SoftWork Location: Hybrid Onsite/RemotePivotal Position: YesReferral Bonus Available?: YesRelocation Assistance Available?: YesScience Jobs: Yes
#LI-Hybrid
#J-18808-Ljbffr
Led by Prof. Bissan Al-Lazikani, Director of Therapeutics Data Science, the intelligent and ever-learning A3D3a platform is part of the new initiative in Therapeutics Data Science and part of our ambitious Institute for Data Science in Oncology at MD Anderson. A3D3a will accelerate the discovery and impact of novel therapies for cancer by enabling novel opportunities for optimized therapies for patients with a focus on rare and hard-to-treat cancers through the development of novel machine learning and AI technologies.
JOB SPECIFIC COMPETENCIESCarry out preparation, clean-up, and quality control of biological data, including scRNA-Seq, scATAC-Seq, Spatial transcriptomics and/or other multi-dimensional omic data modalities.Develop and maintain pipelines for bioinformatics and statistical analyses of aforementioned data types; activities to include handling raw data, evaluating outputs, optimizing parameters and summarizing findings.Keep abreast of advancements in single-cell sequencing and other new technologies and data analysis techniques. Stay engaged with the scientific to identify emerging technologies and methodologies.Actively collaborate with interdisciplinary teams to design experiments, understand data generation protocols and optimize analytical workflows accordingly.Rigorously validate newly developed methods using benchmark datasets and simulated data to assess their accuracy, sensitivity, specificity, and scalability.Present results at multidisciplinary project meetings.Contribute to open-source projects and publish findings in peer-reviewed journals to share insights, methodologies, and tools with the wider scientific community.Prepare written reports, manuscripts, and grant applications with investigators.Work closely with the team and collaborators to discover novel therapeutic opportunities for cancer patients.
Expected SkillsDeep knowledge of bioinformatics tools and their implementation as part of pipelines, particularly for scRNA-Seq, scATAC-Seq, Spatial transcriptomics and/or other multi-dimensional omic data modalities.Advanced knowledge of statistical methods and data analysis techniques relevant to single-cell genomics, including differential expression analysis, clustering, dimensionality reduction, trajectory inference, and data integration.Proficiency in machine learning techniques for analyzing high-dimensional single-cell data, such as supervised and unsupervised learning algorithms.Addressing challenges in bioinformatics as well as mitigation strategies such as bias, batch correction, etc.Utilizing High Performance Computing to run large-scale analyses.Strong programming skills in languages commonly used in bioinformatics and data science, such as Python and R. Ability to write efficient, modular, and maintainable code for data manipulation, analysis, and visualization.Experienced with code version control systems (e.g., Gitlab and Github).Other duties as assigned.
COMPETENCIESWith Inclusion, you understand that your ideas and contributions are valued. You promote the same for others. You address your own biases while promoting diversity and equity. (Competencies: Cultural Humility, Cultural Awareness, Cultural Intelligence)
With Drive, you see that you can serve as a leader whether you have a formal leadership role or not. You tackle problems, move past setbacks and hardships, and don't lose sight of your goals. (Competencies: Self-Confidence, Analytical Thinking, Innovative Thinking, Technical Expertise)
You demonstrate Professionalism by setting the example for others and consistently modeling MD Anderson's values and service standards. You communicate effectively in a variety of ways. (Competencies: Inspire Trust, Oral Communication, Written Communication)
Through Emotional Intelligence, you maintain awareness of your own emotions and the emotions of those around you. Use nonverbal cues and feelings to engage others in an inclusive and responsive way. (Competencies: Active Listening, Teaming, Self-Reflection)
Having Coachability means you are engaged in relentless learning. You constantly ask questions and stay curious. You understand that the organization constantly evolves, and you should as well. (Competencies: Develop Oneself, Adaptability)
Working ConditionsLaboratory environment
This position requires:
Working in Office Environment: YesWorking in Patient Care Unit: NoExposure to human/animal blood, body fluids, or tissues: NoExposure to harmful chemicals: NoExposure to radiation: No
Physical DemandsIndicate the time required to do each of the following physical demands:
Standing: OccasionallyWalking: OccasionallySitting: FrequentlyReaching: OccasionallyLifting/Carrying: Up to 10 lbs: OccasionallyLifting/Carrying: 10lbs to 50 lbs: OccasionallyLifting/Carrying: More than 50 lbs: OccasionallyPushing/Pulling: Up to 10 lbs: OccasionallyPushing/Pulling: 10lbs to 50 lbs: OccasionallyPushing/Pulling: More than 50 lbs: OccasionallyUse computer/keyboard: Frequently
EDUCATION:Required: Bachelor's degree in Biomedical Engineering, Electrical Engineering, Computer Engineering, Physics, Applied Mathematics, Science, Engineering, Computer Science, Statistics, Computational Biology, or related field.
Preferred: PhD in Biomedical Engineering, Electrical Engineering, Computer Engineering, Physics, Applied Mathematics, Science, Engineering, Computer Science, Statistics, Computational Biology, or related field.
EXPERIENCE:Required: Three years experience in scientific software or industry development/analysis. With Master's degree, one years experience required. With PhD, no experience required.
Preferred: Single cell sequencing, next generation sequencing, publications.
It is the policy of The University of Texas MD Anderson Cancer Center to provide equal employment opportunity without regard to race, color, religion, age, national origin, sex, gender, sexual orientation, gender identity/expression, disability, protected veteran status, genetic information, or any other basis protected by institutional policy or by federal, state or local laws unless such distinction is required by law.
Additional InformationRequisition ID: 167724Employment Status: Full-TimeEmployee Status: RegularWork Week: DaysMinimum Salary: US Dollar (USD) 103,000Midpoint Salary: US Dollar (USD) 129,000Maximum Salary: US Dollar (USD) 155,000FLSA: exempt and not eligible for overtime payFund Type: SoftWork Location: Hybrid Onsite/RemotePivotal Position: YesReferral Bonus Available?: YesRelocation Assistance Available?: YesScience Jobs: Yes
#LI-Hybrid
#J-18808-Ljbffr