Active CONTINUING GRANT National Science Foundation (US)

CAREER: Developing Generalizable ML Models for Diverse Learning Problems in Network Operations

$1.24M USD

Funder	National Science Foundation (US)
Recipient Organization	University of California-Santa Barbara
Country	United States
Start Date	Apr 01, 2025
End Date	Mar 31, 2030
Duration	1,825 days
Number of Grantees	1
Roles	Principal Investigator
Data Source	National Science Foundation (US)
Grant ID	`2443777`

Grant Description

Modern network applications require high performance and security, yet smaller community and enterprise networks often lack the infrastructure to meet these demands. Machine learning (ML) has shown promise in improving network management, but existing ML models often fail when deployed outside controlled lab environments because they learn patterns that do not generalize well to real-world settings.

This project develops a closed-loop ML pipeline that iteratively refines training data collection to improve model generalizability. By analyzing the decision-making process of ML models and identifying statistical biases in training data, the project hopes to ensure that ML-based network solutions are robust, effective, and suitable for deployment in diverse network environments.

Technically, the project introduces a new ML framework that continuously improves generalization by iteratively refining training datasets. The research efforts are divided into two main thrusts: (1) designing a programmable data-collection platform that enables flexible and scalable training data acquisition across different network environments, and (2) developing methodologies that use explainable ML techniques to detect and address underspecification issues in network models.

The closed-loop approach ensures that ML models adapt over multiple iterations, mitigating learning shortcuts and spurious correlations that degrade performance in real-world settings. This project has significant broader impacts by enabling the development of generalizable ML models for networking, lowering the barrier to collecting high-quality training data, and fostering trust in ML-based network management solutions.

By making it easier to curate datasets across diverse network environments, the project supports reproducible research and improves ML adoption in production networks, particularly for research and education (R&E) and last-mile community networks with constrained resources.

All project outcomes, including datasets, software, and model implementations, will be made publicly available at https://clml.cs.ucsb.edu/. The repository will be maintained for at least five years, with regular updates based on research progress and community contributions. It will provide documentation, reproducible experiments, and open-source implementations to facilitate further research and practical adoption.

This long-term commitment to transparency and accessibility ensures that the project’s contributions benefit researchers, network operators, and educators.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

All Grantees

University of California-Santa Barbara

Interested in applying for this grant?

Complete our application form to express your interest and we'll guide you through the process.

Apply for This Grant

CAREER: Developing Generalizable ML Models for Diverse Learning Problems in Network Operations

Grant Description

All Grantees

Interested in applying for this grant?

Quick Summary

Related Grants