Loading…

Loading grant details…

Active COOPERATIVE AGREEMENT National Science Foundation (US)

Category II: Democratizing the Accelerator Ecosystem for Science and Discovery

$70M USD

Funder National Science Foundation (US)
Recipient Organization University of California-San Diego
Country United States
Start Date Jul 01, 2024
End Date Jun 30, 2029
Duration 1,825 days
Number of Grantees 5
Roles Principal Investigator; Co-Principal Investigator
Data Source National Science Foundation (US)
Grant ID 2404323
Grant Description

Accelerated computing has become an essential capability for advancing science and engineering. The growth of artificial intelligence (AI) and machine learning, and the performance benefits afforded by graphics processing units (GPUs) are driving researchers across nearly every domain to adopt GPUs. The National Science Foundation has made substantial investments in providing the community with GPU resources, expertise, training, and other programs in support of this transition.

The innovative Cosmos system at the San Diego Supercomputer Center (SDSC) features AMD’s MI300A accelerated processing unit (APU), which contains both a CPU and a GPU accelerator in a single chip together with high-bandwidth, unified memory. The unified memory facilitates an incremental programming approach, lowering the barrier to the adoption of GPUs by many communities, easing the process of porting and optimizing applications.

Cosmos enables researchers to exploit this innovative and powerful accelerator technology in an open software environment to expand the range of applications that can effectively use accelerators. The benefits of accelerating the applications described in the proposal will aid discoveries in materials science, genomics, astrophysics, large language models, artificial intelligence, and many other domains.

Cosmos nodes contain AMD MI300A APUs, each with high-bandwidth, unified memory, integrated into 4-socket nodes with all-to-all connectivity using AMD’s high-speed interconnect, which provides a socket-to-socket global memory interface. The system architecture is based on HPE’s EX2500, which provides a dense, energy-efficient, liquid-cooled system.

A high-performance, flash-based storage system provides the high IOPS and bandwidth needed for the anticipated mixed-application workload. The system can be cross-mounted to other SDSC systems to facilitate data sharing, software development, and benchmarking. Capacity storage is provided via a Ceph filesystem.

The project is structured as a three-year testbed phase, followed by a two-year allocations phase. During the testbed phase Cosmos project staff will collaborate with research teams covering several exemplar science and engineering applications including those from astronomy, neuroscience, molecular biology, structural engineering, machine learning and others.

Included are applications that have yet to be ported and those that can already run on GPUs but would benefit from the flexible and open architecture of the APU and its software ecosystem. Collaborations specifically target community codes, science gateways, and enabling middleware, where success in porting a single application brings along many users and institutions.

Integration with the Open Science Grid aims to further extend the benefits of the APU to thousands of users in the high-throughput computing community. Lessons learned and best practices developed from the research collaborations will be shared with the wider user community through project workshops, user training events, and participation in the AMD User Forum.

The allocations phase will incorporate lessons learned from the testbed phase regarding application porting to the APU, leading to software development resources, training materials, and publications that allow others to migrate their applications to realize the benefits of accelerated computing. During the allocations phase, Cosmos will be available to researchers through an NSF-approved allocation process.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

All Grantees

University of California-San Diego

Advertisement
Discover thousands of grant opportunities
Advertisement
Browse Grants on GrantFunds
Interested in applying for this grant?

Complete our application form to express your interest and we'll guide you through the process.

Apply for This Grant