NSF HDR ML Anomaly Detection Challenge
ORAL
Abstract
The National Science Foundation's (NSF) initiative, Harnessing the Data Revolution (HDR) Big Idea, is a nationwide effort to facilitate innovative modes of data-driven discovery addressing fundamental questions at the frontiers of science and engineering. To raise interest in critical issues within the HDR community, we aim to develop a cross-domain ML challenge for anomaly detection across the five science-focused HDR institutes. This interdisciplinary challenge seeks to provide participants with diverse datasets spanning various scientific domains covering climate science, phylogenetics, materials science, and the identification of astrophysical anomalies from LIGO. The participants should design a novel foundation model for anomaly detection. The algorithm should be able to find anomalies across many different datasets with one metric. We aim to utilize the open-source benchmark ecosystem Codabench to host the challenge and ensure a Findable, Accessible, Interoperable, and Reusable (FAIR) dataset and workflow for the participants from other communities to contribute. The intention is to facilitate contributions from participants across different communities, promoting collaboration and advancing the broader goal of data-driven discovery.
–
Presenters
-
Yuan-Tang Chou
University of Washington
Authors
-
Yuan-Tang Chou
University of Washington
-
Elham E Khoda
University of Washington
-
Shih-Chieh Hsu
University of Washington
-
Philip C Harris
Massachusetts Institute of Technology
-
Maria Dadarlat
Purdue University
-
Seungbin Park
Purdue University
-
Ekaterina Govorkova
Massachusetts Institute of Technology