Loading…
Loading grant details…
| Funder | National Science Foundation (US) |
|---|---|
| Recipient Organization | Texas A&M University Corpus Christi |
| Country | United States |
| Start Date | Oct 01, 2025 |
| End Date | Sep 30, 2027 |
| Duration | 729 days |
| Number of Grantees | 1 |
| Roles | Principal Investigator |
| Data Source | National Science Foundation (US) |
| Grant ID | 2451516 |
Distributed software systems are collections of computer programs that utilize computational resources across multiple computing devices to achieve a common, shared goal. They are used everywhere these days, from banking and reservation systems to online gaming and social media. Finding and fixing bugs in distributed systems is difficult because bugs can spread across multiple computers and cloud environments, making traditional troubleshooting methods ineffective.
A particularly common and challenging to manage type of problems are concurrency bugs that occur when multiple software processes within a program try to access and modify shared data simultaneously. This project introduces an automated tool to help developers identify and fix concurrency issues in distributed systems. It uses innovative techniques to simulate real-world scenarios, uncovers hidden communication patterns, and detects common but difficult-to-find problems.
Outcomes from this research will be included in college-level computer science courses on distributed systems and software engineering.
Rust has gained popularity for distributed system development due to its memory safety feature. However, current methods to trigger concurrency bugs in Rust code require substantial human effort and expertise, and effective automated testing tools are lacking. This project addresses these challenges in three ways.
First, it will conduct an empirical study of real-world modern distributed systems implemented in Rust to identify current trending issues and their root causes. Second, it will develop a static distributed system dependency analysis and a deterministic simulation testing framework driven by system-level scheduling, enabling automated schedule generation to expose concurrency and other emerging bugs.
Third, it will provide an open-source testing platform for public access and future integration. The project will produce a benchmark suite of prevalent and emerging bugs in modern distributed systems to facilitate high-quality bug reproduction. It will also serve as a foundation for future research in improving system reliability and enhance the security of distributed software systems.
This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
Texas A&M University Corpus Christi
Complete our application form to express your interest and we'll guide you through the process.
Apply for This Grant