Learning Augmentation from Data: A Case Study in Scientific Diagnostic Simulations
POSTER
Abstract
Deep neural networks are a popular machine learning technique for analyzing and extracting useful information from data. However, neural networks are usually overparameterized and require large amounts of example training data to avoid overfitting and maximize generalizability. For image classification applications, data augmentation has shown great success as a means of increasing the effective size of datasets. However, automated data augmentation for scientific applications has not been thoroughly explored. Scientific datasets would especially benefit from data augmentation since simulations and experimental data can be prohibitively expensive. In this paper, we explore the idea of population-based augmentation (PBA) to learn on scientific datasets. PBA involves multiple neural networks, each with different data augmentation schedules and policies, that are trained simultaneously and evaluated at intervals to find the best augmentation policy. We provide experimental results from an automated analysis of proton beam imaging energy spectrometer data (PROBIES).
*This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under contract DE-AC52-07NA27344, and supported in part by LDRDs 20-ERD-048 21-ERD-015, and 21-ERD-026, DOE Early Career SCW1651, and DOE-SC SCW1720 and SCW1722.
Presenters
-
Sam A Jacobs
- Lawrence Livermore National Laboratory