Loading…
Loading grant details…
| Funder | National Science Foundation (US) |
|---|---|
| Recipient Organization | University of California-Santa Barbara |
| Country | United States |
| Start Date | Apr 01, 2025 |
| End Date | Mar 31, 2030 |
| Duration | 1,825 days |
| Number of Grantees | 1 |
| Roles | Principal Investigator |
| Data Source | National Science Foundation (US) |
| Grant ID | 2443777 |
Modern network applications require high performance and security, yet smaller community and enterprise networks often lack the infrastructure to meet these demands. Machine learning (ML) has shown promise in improving network management, but existing ML models often fail when deployed outside controlled lab environments because they learn patterns that do not generalize well to real-world settings.
This project develops a closed-loop ML pipeline that iteratively refines training data collection to improve model generalizability. By analyzing the decision-making process of ML models and identifying statistical biases in training data, the project hopes to ensure that ML-based network solutions are robust, effective, and suitable for deployment in diverse network environments.
Technically, the project introduces a new ML framework that continuously improves generalization by iteratively refining training datasets. The research efforts are divided into two main thrusts: (1) designing a programmable data-collection platform that enables flexible and scalable training data acquisition across different network environments, and (2) developing methodologies that use explainable ML techniques to detect and address underspecification issues in network models.
The closed-loop approach ensures that ML models adapt over multiple iterations, mitigating learning shortcuts and spurious correlations that degrade performance in real-world settings. This project has significant broader impacts by enabling the development of generalizable ML models for networking, lowering the barrier to collecting high-quality training data, and fostering trust in ML-based network management solutions.
By making it easier to curate datasets across diverse network environments, the project supports reproducible research and improves ML adoption in production networks, particularly for research and education (R&E) and last-mile community networks with constrained resources.
All project outcomes, including datasets, software, and model implementations, will be made publicly available at https://clml.cs.ucsb.edu/. The repository will be maintained for at least five years, with regular updates based on research progress and community contributions. It will provide documentation, reproducible experiments, and open-source implementations to facilitate further research and practical adoption.
This long-term commitment to transparency and accessibility ensures that the project’s contributions benefit researchers, network operators, and educators.
This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
University of California-Santa Barbara
Complete our application form to express your interest and we'll guide you through the process.
Apply for This Grant