Generalization error in the spherical perceptron
ORAL
Abstract
The spherical perceptron is a simple linear classifier that can be mapped onto the sphere packing problem. Its analytical solvability has led to extensive research, including precise phase boundary calculations and detailed analyses of the solution space. However, the aspect of generalization error has been largely unexplored due to the absence of a method for incorporating generalizable features. In our research, we quantify the generalization error by directly modeling the data's noise statistics and linking them to the error. Our findings reveal the presence of the double descent phenomenon, which has received much attention lately. Notably, we find that this phenomenon only occurs when the system experiences a jamming transition that maintains replica symmetry. We present numerical and theoretical evidence supporting this observation.
* This research has been supported by the POSCO Science Fellowship of POSCO TJ Park Foundation.
–
Presenters
-
Gilhan Kim
Seoul Natl Univ
Authors
-
Gilhan Kim
Seoul Natl Univ
-
Yongjoo Baek
Seoul National University
-
Hyungjoon Soh
Seoul Natl Univ