Machine Learning Meets Statistical Physics II
FOCUS · MAR-L69 · ID: 3096900
Presentations
-
Variational gradient descent: enhancing generalization with automatically learned landscape-dependent noise.
ORAL · Invited
–
Presenters
-
David Hathcock
IBM Thomas J. Watson Research Center
Authors
-
David Hathcock
IBM Thomas J. Watson Research Center
-
Yuhai Tu
IBM Thomas J. Watson Research Center
-
-
Top-Down approach to dynamical coarse-graining using Differentiable Generalized Langevin Equation
ORAL
–
Publication: Jeong, Jinu, Ishan Nadkarni, and Narayana Aluru. "DiffGLE: Differentiable Coarse-Grained Dynamics using Generalized Langevin Equation." arXiv preprint arXiv:2410.08424 (2024).
Presenters
-
Ishan Mangesh Nadkarni
The University of Texas at Austin
Authors
-
Ishan Mangesh Nadkarni
The University of Texas at Austin
-
Jinu Jeong
University of Illinois at Urbana−Champaign, Urbana, The University of Illinois at Urbana-Champaign
-
Narayana R Aluru
The University of Texas at Austin, University of Texas at Austin
-
-
The Manifold Packing Loss Function: A Physics-Inspired Approach to Contrastive Self-Supervised Learning
ORAL
–
Presenters
-
Guanming Zhang
New York University (NYU)
Authors
-
Guanming Zhang
New York University (NYU)
-
David J Heeger
New York University
-
Stefano Martiniani
New York University (NYU)
-
-
Mutual Information Can Be Estimated when Undersampled Data Have Low-Dimensional Latent Structure
ORAL
–
Presenters
-
Eslam Abdelaleem
Georgia Institute of Technology
Authors
-
Eslam Abdelaleem
Georgia Institute of Technology
-
K. Michael Martini
Emory University
-
Ilya M Nemenman
Emory University
-
-
Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasks
ORAL
–
Publication: Tianyu He, Darshil Doshi, Aritra Das, Andrey Gromov; "Learning to grok: Emergence of in-context learning and skill compostion in modular arithmetic tasks"; NeurIPS 2024 (Oral)
Presenters
-
Darshil H Doshi
University of Maryland College Park
Authors
-
Darshil H Doshi
University of Maryland College Park
-
Tianyu He
University of Maryland College Park
-
Aritra Das
University of Maryland, College Park, University of Maryland College Park
-
Andrey Gromov
University of Maryland College Park
-
-
Specialization-generalization transition in exemplar-based in-context learning
ORAL
–
Presenters
-
Chase Waring Goddard
Princeton University
Authors
-
Chase Waring Goddard
Princeton University
-
Lindsay Maleckar Smith
Princeton University
-
Vudtiwat Ngampruetikorn
University of Sydney
-
David J Schwab
CUNY Graduate Center, The Graduate Center, CUNY, CUNY
-
-
Diffusion Models as an Extension of Variational Autoencoders
ORAL
–
Presenters
-
Kentaro Kaba
Institute of Science Tokyo
Authors
-
Kentaro Kaba
Institute of Science Tokyo
-
Reo Shimizu
Tohoku University
-
Masayuki Ohzeki
Graduate School of Information Sciences, Tohoku University, Department of Physics, Institute of Science Tokyo, Sigma-i Co., Ltd., Institute of Science Tokyo, Tohoku University, Sigma-i Co., Ltd.,, Graduate School of Information Sciences, Tohoku University; Department of Physics, Institute of Science Tokyo; Sigma-i Co., Ltd.
-
Yuki Sughiyama
Tohoku University
-
-
Origins and mitigation of double descent in sparse sensing
ORAL
–
Publication: Andrei A. Klishin, Samuel E. Otto, J. Nathan Kutz, Krithika Manohar, in preparation (2024)
Presenters
-
Andrei A. Klishin
University of Hawaiʻi at Mānoa
Authors
-
Andrei A. Klishin
University of Hawaiʻi at Mānoa
-
Samuel E Otto
Cornell University
-
J. Nathan Kutz
University of Washington, AI Institute for Dynamic Systems
-
Krithika Manohar
University of Washington
-
-
Statistical Mechanics of Double Descent in Deep Learning: a Phase Transition Perspective
ORAL
–
Presenters
-
Chan Li
University of California, San Diego
Authors
-
Chan Li
University of California, San Diego
-
Nigel Goldenfeld
University of California, San Diego
-
-
LLMs Learn Physical Rules of Dynamical Systems: A Geometric Investigation of Emergent Algorithms
ORAL
–
Publication: T. J.B. Liu, N. Boullé, R. Sarfati, & C. J. Earls, LLMs learn governing principles of dynamical systems, revealing an in-context neural scaling law, EMNLP (2024)<br><br>Liu, T.J., Boull'e, N., Sarfati, R., & Earls, C.J. Density estimation with LLMs: a geometric investigation of in-context learning trajectories, (2024)
Presenters
-
Toni Jianbang Liu
Cornell University
Authors
-
Toni Jianbang Liu
Cornell University
-
Raphael Sarfati
Cornell University
-
Christopher Earls
Cornell University, Cornell university
-
Nicolas Boulle
Imperial College London
-
-
Physics-Inspired Model Compression of Neural Networks
ORAL
–
Presenters
-
Daniel T Bernstein
Princeton University
Authors
-
Daniel T Bernstein
Princeton University
-
David J Schwab
CUNY Graduate Center, The Graduate Center, CUNY, CUNY
-
-
Long-range order in classification tasks
ORAL
–
Publication: Zhang, YH., Sipling, C., Qiu, E. et al. Collective dynamics and long-range order in thermal neuristor networks. Nat Commun 15, 6986 (2024). https://doi.org/10.1038/s41467-024-51254-4<br>Computing with long-range order: when, why, and how. In preparation.
Presenters
-
Yuan-Hang Zhang
University of California, San Diego
Authors
-
Yuan-Hang Zhang
University of California, San Diego
-
Chesson Sipling
University of California, San Diego
-
Massimiliano Di Ventra
University of California, San Diego
-