Machine Learning Meets Statistical Physics II
FOCUS · MAR-L69 · ID: 3096900
Presentations
-
Variational gradient descent: enhancing generalization with automatically learned landscape-dependent noise.
ORAL · Invited
–
Presenters
-
David Hathcock
- IBM Thomas J. Watson Research Center
Authors
-
David Hathcock
- IBM Thomas J. Watson Research Center
-
Yuhai Tu
- IBM Thomas J. Watson Research Center
-
-
Top-Down approach to dynamical coarse-graining using Differentiable Generalized Langevin Equation
ORAL
–
Publication: Jeong, Jinu, Ishan Nadkarni, and Narayana Aluru. "DiffGLE: Differentiable Coarse-Grained Dynamics using Generalized Langevin Equation." arXiv preprint arXiv:2410.08424 (2024).
Presenters
-
Ishan Mangesh Nadkarni
- The University of Texas at Austin
Authors
-
Ishan Mangesh Nadkarni
- The University of Texas at Austin
-
Jinu Jeong
- University of Illinois at Urbana−Champaign, Urbana
- The University of Illinois at Urbana-Champaign
-
Narayana R Aluru
- The University of Texas at Austin
- University of Texas at Austin
-
-
The Manifold Packing Loss Function: A Physics-Inspired Approach to Contrastive Self-Supervised Learning
ORAL
–
Presenters
-
Guanming Zhang
- New York University (NYU)
Authors
-
Guanming Zhang
- New York University (NYU)
-
David J Heeger
- New York University
-
Stefano Martiniani
- New York University (NYU)
-
-
Mutual Information Can Be Estimated when Undersampled Data Have Low-Dimensional Latent Structure
ORAL
–
Presenters
-
Eslam Abdelaleem
- Georgia Institute of Technology
Authors
-
Eslam Abdelaleem
- Georgia Institute of Technology
-
K. Michael Martini
- Emory University
-
Ilya M Nemenman
- Emory University
-
-
Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasks
ORAL
–
Publication: Tianyu He, Darshil Doshi, Aritra Das, Andrey Gromov; "Learning to grok: Emergence of in-context learning and skill compostion in modular arithmetic tasks"; NeurIPS 2024 (Oral)
Presenters
-
Darshil H Doshi
- University of Maryland College Park
Authors
-
Darshil H Doshi
- University of Maryland College Park
-
Tianyu He
- University of Maryland College Park
-
Aritra Das
- University of Maryland, College Park
- University of Maryland College Park
-
Andrey Gromov
- University of Maryland College Park
-
-
Specialization-generalization transition in exemplar-based in-context learning
ORAL
–
Presenters
-
Chase Waring Goddard
- Princeton University
Authors
-
Chase Waring Goddard
- Princeton University
-
Lindsay Maleckar Smith
- Princeton University
-
Vudtiwat Ngampruetikorn
- University of Sydney
-
David J Schwab
- CUNY Graduate Center
- The Graduate Center, CUNY
- CUNY
-
-
Diffusion Models as an Extension of Variational Autoencoders
ORAL
–
Presenters
-
Kentaro Kaba
- Institute of Science Tokyo
Authors
-
Kentaro Kaba
- Institute of Science Tokyo
-
Reo Shimizu
- Tohoku University
-
Masayuki Ohzeki
- Graduate School of Information Sciences, Tohoku University, Department of Physics, Institute of Science Tokyo, Sigma-i Co., Ltd.
- Institute of Science Tokyo, Tohoku University, Sigma-i Co., Ltd.,
- Graduate School of Information Sciences, Tohoku University; Department of Physics, Institute of Science Tokyo; Sigma-i Co., Ltd.
-
Yuki Sughiyama
- Tohoku University
-
-
Origins and mitigation of double descent in sparse sensing
ORAL
–
Publication: Andrei A. Klishin, Samuel E. Otto, J. Nathan Kutz, Krithika Manohar, in preparation (2024)
Presenters
-
Andrei A. Klishin
- University of Hawaiʻi at Mānoa
Authors
-
Andrei A. Klishin
- University of Hawaiʻi at Mānoa
-
Samuel E Otto
- Cornell University
-
J. Nathan Kutz
- University of Washington, AI Institute for Dynamic Systems
-
Krithika Manohar
- University of Washington
-
-
Statistical Mechanics of Double Descent in Deep Learning: a Phase Transition Perspective
ORAL
–
Presenters
-
Chan Li
- University of California, San Diego
Authors
-
Chan Li
- University of California, San Diego
-
Nigel Goldenfeld
- University of California, San Diego
-
-
LLMs Learn Physical Rules of Dynamical Systems: A Geometric Investigation of Emergent Algorithms
ORAL
–
Publication: T. J.B. Liu, N. Boullé, R. Sarfati, & C. J. Earls, LLMs learn governing principles of dynamical systems, revealing an in-context neural scaling law, EMNLP (2024)
Liu, T.J., Boull'e, N., Sarfati, R., & Earls, C.J. Density estimation with LLMs: a geometric investigation of in-context learning trajectories, (2024)Presenters
-
Toni Jianbang Liu
- Cornell University
Authors
-
Toni Jianbang Liu
- Cornell University
-
Raphael Sarfati
- Cornell University
-
Christopher Earls
- Cornell University
- Cornell university
-
Nicolas Boulle
- Imperial College London
-
-
Physics-Inspired Model Compression of Neural Networks
ORAL
–
Presenters
-
Daniel T Bernstein
- Princeton University
Authors
-
Daniel T Bernstein
- Princeton University
-
David J Schwab
- CUNY Graduate Center
- The Graduate Center, CUNY
- CUNY
-
-
Long-range order in classification tasks
ORAL
–
Publication: Zhang, YH., Sipling, C., Qiu, E. et al. Collective dynamics and long-range order in thermal neuristor networks. Nat Commun 15, 6986 (2024). https://doi.org/10.1038/s41467-024-51254-4
Computing with long-range order: when, why, and how. In preparation.Presenters
-
Yuan-Hang Zhang
- University of California, San Diego
Authors
-
Yuan-Hang Zhang
- University of California, San Diego
-
Chesson Sipling
- University of California, San Diego
-
Massimiliano Di Ventra
- University of California, San Diego
-