RTI International
Data Scientist, Innovation Programs and Policy
RTI International, Durham, North Carolina, United States, 27703
Overview
RTI International has an opening for a data scientist/modeler in the innovation economics and policy practice within the Center for Applied Economics and Strategy (CAES). Our team in CAES draws on a range of quantitative and qualitative skill sets to produce high quality research for government agencies, foundations, and non-profit organizations. Our work requires implementing the best available theory and methods in assessing research, development and innovation programs and their economic outcomes to report cogent and actionable results to our clients. The successful candidate will work with a team of economists and policy analysts to apply machine learning and data science approaches to assess research, development and innovation policies and programs, and technological innovations arising from different sectors of the research and development ecosystem, including biomedicine. They will do this by building new models or utilizing existing standard models, and methods. The successful candidate will have training in data science, machine learning, economic modeling or related disciplines and experience applying modeling best practices in the maintenance and enhancement of core datasets and identifying fit-for-purpose technical approaches in partnership with senior staff and project teams. The role will execute quantitative analyses and clearly articulate results to a wide range of audiences. We seek a candidate with a clear interest in R&D and innovation programs, a versatile technical skill set, demonstrated R and/or python programming experience, and strong writing skills. Responsibilities
The successful candidate will be expected to contribute to the following task areas: * Conduct literature reviews to identify best available data and/or best practices for data processing and analysis * Design, execute, and communicate data analysis and research * Experience with modern open-source programming languages used in data science, such as: Python, SQL, R. * Lead the maintenance of core datasets to our analyses in replicable code and version-controlled repositories (i.e. via GitHub) * Participate in and/or lead projects, tasks, and staff in a wide variety of technical activities: data pipeline development and orchestration, data wrangling/munging, data infrastructure, ETL (Extract, Transform, Load) processes, exploratory data analyses (EDA), DataOps, AI and machine learning, deep learning, natural language processing (NLP), microsimulation modeling, MLOps, generative AI and Large Language Models (LLMs), computer vision, automation, cloud computing, social media analytics, privacy analytics, rapid prototyping, data visualization, and user-centered design. * Ability to develop ETL pipelines and conduct analyses using a wide variety of sources (e.g., relational databases, text and unstructured files, sensor data, image and video data, streaming data). * Contribute to the development of research, development and innovation economic models, incorporate underlying datasets and understand data availability and limitations. * Develop post-processing routines for analyzing, reporting, and visualizing model outputs. * Visualize data in clear and compelling graphics including dynamic implementations (e.g. in Tableau or R markdown). * Maintain currency with key issues and concepts in innovation topic areas including research policies and programs, R&D workforce policy, research assessment, research commercialization and technology transfer. * Work with a wide variety of data content and formats, including grant award data, research output data (such as patents, publications), economic data including industry, household, and national accounts information, global financial data, government statistical datasets for R&D and innovation. * Collaborate effectively with project team members, and with external scientists. * Manage workflow in a timely, realistic, and cost-effective manner to meet client expectations. * Contribute to grant, cooperative agreement, and contract proposals. * Present research methods and findings via technical reports, journal articles, and presentations. Qualifications
Qualified applicants should have the following: * Master's degree and three years of relevant work experience or a Bachelor's degree and five years of relevant work experience. * Demonstrated experience collecting, processing, managing and analyzing data, including experience with and knowledge of best coding practices for one or more of the following: R, Python, Julia, or related programs. * Demonstrated proficiency with advanced analytics and data science techniques, such as: AI and machine learning, generative AI and Large Language Models (LLMs), deep learning, natural language processing (NLP), computer vision, optimization, and simulation modeling (e.g., agent-based models, microsimulation models). * Demonstrated interest in innovation, research & development programs or science & technology policy issues. * Excellent verbal and written communication skills, including ability to communicate complex issues to a wide range of audiences, and a track-record of publications and/or quantitative analytical outputs. If selected to interview for the position, writing samples will be required, preferably with a focus related to one of the topic areas above. Preferred * Demonstrated experience analyzing grant funding data sets and grant funding outcomes. * Demonstrated experience with economic analysis of private R&D initiatives. * Demonstrated experience with Named-Entity-Recognition (NER), Optical Character Recognition (OCR, Tesseract), and transformer models. * Demonstrated experience developing and programming machine learning or other types of models for predictive analysis. * Experience working on topics related to government support for research & development, biomedical research & development, science & technology policy and workforce. #LI-KW1 EEO & Pay Equity Statements
For San Francisco, CA USA Job Postings Only: Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records. Further information is available here.
RTI accepts applications to our job openings from candidates with criminal histories or conviction records in accordance with all applicable laws, including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act.
The anticipated pay range for this role is listed below. Our pay ranges represent national averages and may vary by location as a geographic differential may be applied to some locations within the United States. RTI considers multiple factors when making an offer including, for example: established salary range, internal budget, business needs, and education and years of work experience possessed by the applicant. Further, salary is merely one element to our offer.
At RTI, we demonstrate our commitment to rewarding individual and team achievement through a total rewards package. This package includes (among other things) a competitive base salary, a generous paid time off policy, merit based annual increases, bonus opportunities and a robust recognition program. Other benefits include a competitive range of insurance plans (including health, dental, life, and short-term and long-term disability), access to a retirement savings program such as a 401(k) plan, paid parental leave for all parents, financial assistance with adoption expenses or infertility treatments, financial reimbursement for education and developmental opportunities, an employee assistance program, and numerous other offerings to support a healthy work-life balance. Equal Pay Act Minimum/Range
$115,000-$143,000
RTI International has an opening for a data scientist/modeler in the innovation economics and policy practice within the Center for Applied Economics and Strategy (CAES). Our team in CAES draws on a range of quantitative and qualitative skill sets to produce high quality research for government agencies, foundations, and non-profit organizations. Our work requires implementing the best available theory and methods in assessing research, development and innovation programs and their economic outcomes to report cogent and actionable results to our clients. The successful candidate will work with a team of economists and policy analysts to apply machine learning and data science approaches to assess research, development and innovation policies and programs, and technological innovations arising from different sectors of the research and development ecosystem, including biomedicine. They will do this by building new models or utilizing existing standard models, and methods. The successful candidate will have training in data science, machine learning, economic modeling or related disciplines and experience applying modeling best practices in the maintenance and enhancement of core datasets and identifying fit-for-purpose technical approaches in partnership with senior staff and project teams. The role will execute quantitative analyses and clearly articulate results to a wide range of audiences. We seek a candidate with a clear interest in R&D and innovation programs, a versatile technical skill set, demonstrated R and/or python programming experience, and strong writing skills. Responsibilities
The successful candidate will be expected to contribute to the following task areas: * Conduct literature reviews to identify best available data and/or best practices for data processing and analysis * Design, execute, and communicate data analysis and research * Experience with modern open-source programming languages used in data science, such as: Python, SQL, R. * Lead the maintenance of core datasets to our analyses in replicable code and version-controlled repositories (i.e. via GitHub) * Participate in and/or lead projects, tasks, and staff in a wide variety of technical activities: data pipeline development and orchestration, data wrangling/munging, data infrastructure, ETL (Extract, Transform, Load) processes, exploratory data analyses (EDA), DataOps, AI and machine learning, deep learning, natural language processing (NLP), microsimulation modeling, MLOps, generative AI and Large Language Models (LLMs), computer vision, automation, cloud computing, social media analytics, privacy analytics, rapid prototyping, data visualization, and user-centered design. * Ability to develop ETL pipelines and conduct analyses using a wide variety of sources (e.g., relational databases, text and unstructured files, sensor data, image and video data, streaming data). * Contribute to the development of research, development and innovation economic models, incorporate underlying datasets and understand data availability and limitations. * Develop post-processing routines for analyzing, reporting, and visualizing model outputs. * Visualize data in clear and compelling graphics including dynamic implementations (e.g. in Tableau or R markdown). * Maintain currency with key issues and concepts in innovation topic areas including research policies and programs, R&D workforce policy, research assessment, research commercialization and technology transfer. * Work with a wide variety of data content and formats, including grant award data, research output data (such as patents, publications), economic data including industry, household, and national accounts information, global financial data, government statistical datasets for R&D and innovation. * Collaborate effectively with project team members, and with external scientists. * Manage workflow in a timely, realistic, and cost-effective manner to meet client expectations. * Contribute to grant, cooperative agreement, and contract proposals. * Present research methods and findings via technical reports, journal articles, and presentations. Qualifications
Qualified applicants should have the following: * Master's degree and three years of relevant work experience or a Bachelor's degree and five years of relevant work experience. * Demonstrated experience collecting, processing, managing and analyzing data, including experience with and knowledge of best coding practices for one or more of the following: R, Python, Julia, or related programs. * Demonstrated proficiency with advanced analytics and data science techniques, such as: AI and machine learning, generative AI and Large Language Models (LLMs), deep learning, natural language processing (NLP), computer vision, optimization, and simulation modeling (e.g., agent-based models, microsimulation models). * Demonstrated interest in innovation, research & development programs or science & technology policy issues. * Excellent verbal and written communication skills, including ability to communicate complex issues to a wide range of audiences, and a track-record of publications and/or quantitative analytical outputs. If selected to interview for the position, writing samples will be required, preferably with a focus related to one of the topic areas above. Preferred * Demonstrated experience analyzing grant funding data sets and grant funding outcomes. * Demonstrated experience with economic analysis of private R&D initiatives. * Demonstrated experience with Named-Entity-Recognition (NER), Optical Character Recognition (OCR, Tesseract), and transformer models. * Demonstrated experience developing and programming machine learning or other types of models for predictive analysis. * Experience working on topics related to government support for research & development, biomedical research & development, science & technology policy and workforce. #LI-KW1 EEO & Pay Equity Statements
For San Francisco, CA USA Job Postings Only: Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records. Further information is available here.
RTI accepts applications to our job openings from candidates with criminal histories or conviction records in accordance with all applicable laws, including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act.
The anticipated pay range for this role is listed below. Our pay ranges represent national averages and may vary by location as a geographic differential may be applied to some locations within the United States. RTI considers multiple factors when making an offer including, for example: established salary range, internal budget, business needs, and education and years of work experience possessed by the applicant. Further, salary is merely one element to our offer.
At RTI, we demonstrate our commitment to rewarding individual and team achievement through a total rewards package. This package includes (among other things) a competitive base salary, a generous paid time off policy, merit based annual increases, bonus opportunities and a robust recognition program. Other benefits include a competitive range of insurance plans (including health, dental, life, and short-term and long-term disability), access to a retirement savings program such as a 401(k) plan, paid parental leave for all parents, financial assistance with adoption expenses or infertility treatments, financial reimbursement for education and developmental opportunities, an employee assistance program, and numerous other offerings to support a healthy work-life balance. Equal Pay Act Minimum/Range
$115,000-$143,000