Loading…
Loading grant details…
| Funder | National Science Foundation (US) |
|---|---|
| Recipient Organization | University of California-San Diego |
| Country | United States |
| Start Date | Jan 15, 2025 |
| End Date | Dec 31, 2026 |
| Duration | 715 days |
| Number of Grantees | 1 |
| Roles | Principal Investigator |
| Data Source | National Science Foundation (US) |
| Grant ID | 2440196 |
The goal of this project is to empower Artificial Intelligence (AI) researchers to more easily search, discover, and use AI-ready data sets. This will potentially streamline and democratize AI research. The project will investigate and develop data discovery services using innovative techniques that are themselves based on AI methods and can extract data set information from scientific papers.
The resulting discovery services will be integrated into the National Data Platform Pilot (NSF award #2333609), providing scientists and students with an end-to-end research environment that connects them to national computing and storage resources. The project will conduct outreach and training efforts that will engage both scientists and students, particularly those at minority-serving institutions, who will help evaluate the technology.
This project advances data search and discovery capabilities by using AI techniques to automatically extract and store data citation information, which must frequently be inferred, from research publications. This capability will help scientists and students, particularly those new to AI research, to identify AI-ready data sets that are relevant to their research from related publications.
This removes startup impediments to creating new AI pipelines. Integrating these search and discovery services into the National Data Platform Pilot will enable users to more seamlessly conduct AI research on national-scale research resources that can scale beyond their personal computing and storage. The project uses AI-ready datasets from the National Artificial Intelligence Research Resource (NAIRR) to demonstrate and evaluate the effectiveness of the service.
It also develops a generalized approach to support the integration of additional AI-ready NAIRR datasets and open corpora. The project democratizes the discovery and use of data in support of AI and other research through outreach and community engagement activities, including integration with hands-on workshops and hackathons within the National Data Platform Pilot.
It also supports evaluating and reporting on the use and value of data by automatically producing usage statistics.
This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
University of California-San Diego
Complete our application form to express your interest and we'll guide you through the process.
Apply for This Grant