Logo
IntelliBridge

Lead Data Scientist

IntelliBridge, Mc Lean, Virginia, us, 22107


Job Title:

Senior Data Engineer

Locations:

Remote

Clearance:

Eligible to obtain a Secret Clearance for upcoming work

Job SummaryIntelliBridge is seeking a Lead Data Scientist to lead a data science applied research team to develop novel methods of extracting insights and analysis of massive disparate datasets. As a lead data scientist, you will identify and source data to be consumed by our data lakehouse ELT processes and contribute to smart AI-enabled data pipelines. You will be responsible for using AI to mine, enrich, fuse, and visualize data to provide insight to our clients. Within this role, you'll actively collaborate with both technical and non-technical members of the data and development teams to define requirements and consistently deploy top-notch data products. Our primary goal is to deliver agile value to our stakeholders. We seek an individual with an insatiable curiosity about data, a genuine passion for comprehensively understanding datasets, and a keen attention to detail essential for ensuring accurate data comprehension.

Job ResponsibilitiesGuide and support the implementation of new data science solutionsDesign, architect, and support key datasets that provide structured and timely access to actionable business insights or decision-makingImplement, test, deploy, and maintain stable, secure, and scalable data mining, enrichment, fusion, and predictive AI/ML solutionsFine-tune and deploy Generative LLM AI models for specific tasksDevelop Retrieval Augmented Generation (RAG) services using open-source tools and models to provide AI assistant services that have access to internal data sourcesLead cross-functional teams to develop data-intensive software productsSupport all data staff in troubleshooting code issues, perform code reviews, and devise testing strategiesMonitor existing metrics, analyze data, and lead partnership with other Data and Analytics personnel to identify and implement system and process improvementsLead a team to develop processes that ingest multiple data sources, enrich data with AI/ML processes, and provide that data to other data consumersLead a team to maintain the infrastructure to support extraction, loading, and transformation (ELT) of data from a wide variety of data sourcesExpose AI/ML-enriched data via APIs, dashboards, and user applicationsUtilize DevOps Continuous Delivery best practicesConfigure and manage data analytic frameworks and pipelines using databases and toolsLead a team to design and manage custom data dashboards using Kibana and Power BI to display data insightsAdminister cloud computing and CI/CD pipelines to include Amazon Web Services (AWS)Contribute to MLOps processes including the deployment and integration of Generative AI LLMs

Position RequirementsMinimum of ten (10) years of Software or Data Science Experience or equivalentBachelor’s Degree in Computer Science, Information Technology, or a STEM fieldStrong understanding in data operations and data systemsProficient in Agile DevelopmentAbility to form strong cross-functional relationships and lead a project teamDemonstrated expertise in technical data science and engineering on complex applications, systems, software, and projectsSenior-level experience in analysis, design, development, testing, and implementation of applicationsExpertise in developing and maintaining data pipelines and databases for insights, analytics, and visualizationsDeep knowledge in machine learning and artificial intelligence for building enrichment data pipelines and AI backend servicesExcellent verbal and written communicationsGeneral knowledge of Generative Pre-Trained AI Large Language Models (GPT AI LLMs), as well as traditional machine learning approachesDevelop novel data mining processes using GenAI and LLMsLead teams through ML development, training, deployment, monitoring, and support lifecycles

Desired Skills And Abilities (salary Commensurate)Knowledgeable and experienced in:Data analysis and statisticsMachine LearningPythonGit and Git OperationsSQLAWSDockerTransformers and Natural Language Processing model fine-tuningExperience with machine learning processes, solutions, and applicationsExperience with data science algorithms such as boosted decision trees, logistic regression, and autoregressive integrated moving averagesExperience with all parts of the AI/ML lifecycle including training, deployment, and monitoringExperience in network analytics, knowledge graphs, and graph databasesExperience building data pipelines and working with data lakes or lakehouses using DatabricksAdvanced Degree (Master’s or PhD)

Additional Preferred Skills And AbilitiesExperience in multiple programming languagesExperience with python libraries including transformers, FastAPI, pydanticExperience with AWS services: S3, EC2, Athena, Glue, ECR, ECSExperience with Retrieval Augmented Generation (RAG) LLM systemsExperience working with and fine-tuning LLM AI modelsExperience working with CI/CD workflowsExperience with data augmentation techniques

About UsIntelliBridge delivers IT strategy, cloud, cybersecurity, application, data and analytics, enterprise IT, intelligence analysis, and mission operation support services to accelerate technical performance and efficiency for Defense, Civilian, and National Security & Federal Law Enforcement clients.

#J-18808-Ljbffr