IBM Computing
Foundation Models for Data Software Engineer Intern: 2025
IBM Computing, Yorktown Heights, New York, United States, 10598
IBM Foundation Models for Data Software Engineer Intern: 2025 in Yorktown Heights, New York
IntroductionAt IBM, work is more than a job - it's a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you've never thought possible. Are you ready to lead in this new era of technology and solve some of the world's most challenging problems? If so, let's talk.Your Role and ResponsibilitiesThis is for a 2025 summer internship with the following start dates: May - August or June - September for quarter system schools.We are broadly interested in further improving the capabilities of foundation models (FMs) for a range of data management tasks such as data discovery, metadata enrichment, data access and retrieval with querying, and automated data-driven insights.We are looking for interns with research interest and expertise in implementation of interactive data workflows such as natural language to data insights, implementation of agentic solutions with design patterns like Think, Act, Observe, Reflect.Skills and tasks of interest include:LLM for code generation:
Implementation of pipelines that use foundational models for code generation specific to data tasks such as SQL for data retrieval.Agents and Reasoning:
Implementation of novel autonomous agentic systems to compete with Text-to-SQL on public leaderboards like BIRD and Spider 2.0.Knowledge Graphs, Multi-Modal FMs:
Implementation of techniques using combinations of foundational models, knowledge graphs, and multi-modal data for improving tasks such as data discovery and automated text-to-SQL.LLMs for DataOps:
Implementation of generative AI tooling for DataOps such as data integration and flows, similar to usage of LLMs for DevOps.Required Technical and Professional ExpertiseApplicants should be PhD & MS students pursuing graduate studies.Pursuing graduate studies in computer science and related fields.Strong programming skills in Python.Preferred Technical and Professional ExpertiseFamiliarity and working expertise with large language models (LLMs).Familiarity with knowledge graphs, RAG, agentic frameworks.Familiarity with SQL, Langgraph, Langchain.About IBMIBM's greatest invention is the IBMer. We believe that through the application of intelligence, reason and science, we can improve business, society and the human condition, bringing the power of an open hybrid cloud and AI strategy to life for our clients and partners around the world.Being You @ IBMIBM is committed to creating a diverse environment and is proud to be an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, caste, genetics, pregnancy, disability, neurodivergence, age, veteran status, or other characteristics.
#J-18808-Ljbffr
IntroductionAt IBM, work is more than a job - it's a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you've never thought possible. Are you ready to lead in this new era of technology and solve some of the world's most challenging problems? If so, let's talk.Your Role and ResponsibilitiesThis is for a 2025 summer internship with the following start dates: May - August or June - September for quarter system schools.We are broadly interested in further improving the capabilities of foundation models (FMs) for a range of data management tasks such as data discovery, metadata enrichment, data access and retrieval with querying, and automated data-driven insights.We are looking for interns with research interest and expertise in implementation of interactive data workflows such as natural language to data insights, implementation of agentic solutions with design patterns like Think, Act, Observe, Reflect.Skills and tasks of interest include:LLM for code generation:
Implementation of pipelines that use foundational models for code generation specific to data tasks such as SQL for data retrieval.Agents and Reasoning:
Implementation of novel autonomous agentic systems to compete with Text-to-SQL on public leaderboards like BIRD and Spider 2.0.Knowledge Graphs, Multi-Modal FMs:
Implementation of techniques using combinations of foundational models, knowledge graphs, and multi-modal data for improving tasks such as data discovery and automated text-to-SQL.LLMs for DataOps:
Implementation of generative AI tooling for DataOps such as data integration and flows, similar to usage of LLMs for DevOps.Required Technical and Professional ExpertiseApplicants should be PhD & MS students pursuing graduate studies.Pursuing graduate studies in computer science and related fields.Strong programming skills in Python.Preferred Technical and Professional ExpertiseFamiliarity and working expertise with large language models (LLMs).Familiarity with knowledge graphs, RAG, agentic frameworks.Familiarity with SQL, Langgraph, Langchain.About IBMIBM's greatest invention is the IBMer. We believe that through the application of intelligence, reason and science, we can improve business, society and the human condition, bringing the power of an open hybrid cloud and AI strategy to life for our clients and partners around the world.Being You @ IBMIBM is committed to creating a diverse environment and is proud to be an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, caste, genetics, pregnancy, disability, neurodivergence, age, veteran status, or other characteristics.
#J-18808-Ljbffr