Global Commerce and Information, Inc.
Data Scientist
Global Commerce and Information, Inc., Gwynn Oak, Maryland, United States, 21207
Your Success is Our Success.
Global CI is an award-winning 30-year IT Services company founded on the principles of providing high-quality, value-added technology consulting services. Our vision is to create a better future by improving the lives of the people we serve through emerging technologies. Join us and together we will advance the future of technology services.Global CI offers competitive compensation and non-salary benefits to all eligible employees.
Job Description
Role Description:
Formulating, design and deliver AI/Client-based decision-making frameworks and models for business outcomes. Measure and justify AI/Client based solution values.
We are seeking a skilled Data Scientist with deep expertise in developing, fine-tuning, and integrating AI models, particularly in natural language processing (NLP). This role will focus heavily on analyzing unstructured medical records, developing AI models for extracting insights, and incorporating human-in-the-loop feedback to improve model performance. You will work closely with software engineers and other stakeholders to ensure that AI solutions are effectively integrated into the overall system architecture.
Required Qualifications & Experience:• 5+ years of experience in AI/Client development with a strong focus on NLP using frameworks such as TensorFlow, PyTorch, and Hugging Face• Expertise in Python, with experience in libraries like Transformers, NLTK, SpaCy, Gensim, and data manipulation tools such as Pandas and NumPy• Experience working with human-in-the-loop systems, integrating clinician feedback to refine AI models• Ability to effectively articulate technical challenges and solutions• Strong communicator with excellent written and verbal communication skills• Knowledge about Agile development Methodologies.• Identify and analyze user requirements to generate stories and tasks for team backlog• Prioritize and execute tasks throughout the software development life cycle• Create custom NLP algorithms and annotators to evaluate medical record data• Create custom tools to enable analysts to perform data research• Solid understanding of statistical modeling, data analysis, and performance evaluation metrics.• Demonstrated experience analyzing and processing unstructured clinical data (e.g., electronic health records, physician notes, imaging reports), using techniques such as tokenization, lemmatization, and word embeddings (e.g., TF-IDF, BERT)• Familiarity with healthcare data formats and standards such as HL7, FHIR, ICD codes, and SNOMED• Experience with cloud platforms (AWS, Azure), containerization (Docker), and using CI/CD pipelines for machine learning model deployment• Knowledge of SQL (PostgreSQL, MySQL) and NoSQL (MongoDB, Elasticsearch) databases, and how to structure data pipelines for efficient data processing• Develop and fine-tune AI models for natural language processing (NLP) tasks, including Named Entity Recognition (NER), text classification, and sentiment analysis, particularly with unstructured clinical records• Conduct experiments to evaluate model performance, utilizing metrics such as precision, recall, and F1-score to iteratively improve models through hyperparameter tuning and training optimizations• Analyze and preprocess large datasets, particularly unstructured medical records (e.g., physician notes, discharge summaries), using tools like Pandas, NLTK, and SpaCy• Master's degree (Data Science, AI, Computer Science, or a related field) + 10 years experience; or PhD + 4 years
Preferred Qualifications:• Experience in healthcare, particularly working with unstructured medical records in clinical settings, leveraging NLP models for insight extraction.• Experience working with human-in-the-loop systems, incorporating clinician/end-user feedback and leveraging tools like SciPy and NumPy to improve AI model accuracy• Educational background or practical training in a clinical setting, with exposure to clinical workflows and medical terminologies• Familiarity with deep learning techniques, attention mechanisms, and transformers applied to healthcare data
Benefits include:Comprehensive medical, dental, vision, life, and short & long-term disability insurance + health savings accountMatching 401k retirement plan + IRA's and Roth IRA'sGenerous paid time off and paid holidaysEmployee recruitment/referral bonusPaid community service hoursTuition reimbursementEmployee discounts
At Global Commerce & Information, Inc. we celebrate, support, and are committed to creating a diverse and inclusive environment. We're proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, veteran status, or any other legally protected characteristics.Global Commerce & Information, Inc maintains a drug-free workplace.
Global CI is an award-winning 30-year IT Services company founded on the principles of providing high-quality, value-added technology consulting services. Our vision is to create a better future by improving the lives of the people we serve through emerging technologies. Join us and together we will advance the future of technology services.Global CI offers competitive compensation and non-salary benefits to all eligible employees.
Job Description
Role Description:
Formulating, design and deliver AI/Client-based decision-making frameworks and models for business outcomes. Measure and justify AI/Client based solution values.
We are seeking a skilled Data Scientist with deep expertise in developing, fine-tuning, and integrating AI models, particularly in natural language processing (NLP). This role will focus heavily on analyzing unstructured medical records, developing AI models for extracting insights, and incorporating human-in-the-loop feedback to improve model performance. You will work closely with software engineers and other stakeholders to ensure that AI solutions are effectively integrated into the overall system architecture.
Required Qualifications & Experience:• 5+ years of experience in AI/Client development with a strong focus on NLP using frameworks such as TensorFlow, PyTorch, and Hugging Face• Expertise in Python, with experience in libraries like Transformers, NLTK, SpaCy, Gensim, and data manipulation tools such as Pandas and NumPy• Experience working with human-in-the-loop systems, integrating clinician feedback to refine AI models• Ability to effectively articulate technical challenges and solutions• Strong communicator with excellent written and verbal communication skills• Knowledge about Agile development Methodologies.• Identify and analyze user requirements to generate stories and tasks for team backlog• Prioritize and execute tasks throughout the software development life cycle• Create custom NLP algorithms and annotators to evaluate medical record data• Create custom tools to enable analysts to perform data research• Solid understanding of statistical modeling, data analysis, and performance evaluation metrics.• Demonstrated experience analyzing and processing unstructured clinical data (e.g., electronic health records, physician notes, imaging reports), using techniques such as tokenization, lemmatization, and word embeddings (e.g., TF-IDF, BERT)• Familiarity with healthcare data formats and standards such as HL7, FHIR, ICD codes, and SNOMED• Experience with cloud platforms (AWS, Azure), containerization (Docker), and using CI/CD pipelines for machine learning model deployment• Knowledge of SQL (PostgreSQL, MySQL) and NoSQL (MongoDB, Elasticsearch) databases, and how to structure data pipelines for efficient data processing• Develop and fine-tune AI models for natural language processing (NLP) tasks, including Named Entity Recognition (NER), text classification, and sentiment analysis, particularly with unstructured clinical records• Conduct experiments to evaluate model performance, utilizing metrics such as precision, recall, and F1-score to iteratively improve models through hyperparameter tuning and training optimizations• Analyze and preprocess large datasets, particularly unstructured medical records (e.g., physician notes, discharge summaries), using tools like Pandas, NLTK, and SpaCy• Master's degree (Data Science, AI, Computer Science, or a related field) + 10 years experience; or PhD + 4 years
Preferred Qualifications:• Experience in healthcare, particularly working with unstructured medical records in clinical settings, leveraging NLP models for insight extraction.• Experience working with human-in-the-loop systems, incorporating clinician/end-user feedback and leveraging tools like SciPy and NumPy to improve AI model accuracy• Educational background or practical training in a clinical setting, with exposure to clinical workflows and medical terminologies• Familiarity with deep learning techniques, attention mechanisms, and transformers applied to healthcare data
Benefits include:Comprehensive medical, dental, vision, life, and short & long-term disability insurance + health savings accountMatching 401k retirement plan + IRA's and Roth IRA'sGenerous paid time off and paid holidaysEmployee recruitment/referral bonusPaid community service hoursTuition reimbursementEmployee discounts
At Global Commerce & Information, Inc. we celebrate, support, and are committed to creating a diverse and inclusive environment. We're proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, veteran status, or any other legally protected characteristics.Global Commerce & Information, Inc maintains a drug-free workplace.