[Sds-seminars] [Sds-announce] FDS Colloquium: Eran Malach, 1/15/25, KT 13th Floor, Rm. 1327, 12pm-1pm, "Learning Hard Problems with Neural Networks and Language Models"
elizavette.torres at yale.edu
elizavette.torres at yale.edu
Mon Jan 13 13:36:13 EST 2025
- Previous message (by thread): [Sds-seminars] [Sds-announce] S&DS Seminar, Theo McKenzie, 1/13/25, KT 13th Floor, Rm. 1327, 12pm-1pm, "Random Matrices as a Model of Discrete Quantum Chaos"
- Next message (by thread): [Sds-seminars] [Sds-announce] S&DS Seminar, Eric Sun, 1/27/25, KT 13th Floor, Rm. 1327, 12pm-1pm, "Machine Learning for Aging and Spatial Omics"
- Messages sorted by:
[ date ]
[ thread ]
[ subject ]
[ author ]
FDS Colloquium
Eran Malach, Harvard University
Date: Wednesday, January 15, 2025
Time: 12:00PM to 1:00PM
Location: Kline Tower, 13th Floor, Rm. 1327 See map
<http://maps.google.com/?q=219+Prospect+Street%2C+New+Haven%2C+CT%2C+06511%2
C+us>
219 Prospect Street
New Haven, CT 06511
Website <https://www.eranmalach.com/>
Webcast:
https://yale.hosted.panopto.com/Panopto/Pages/Viewer.aspx?id=6f3e7e6c-c1ef-4
066-b4f2-b2590138c39
Learning Hard Problems with Neural Networks and Language Models
Information and Abstract: Modern machine learning models, and in particular
large language models, can now solve surprisingly complex mathematical
reasoning problems. In this talk I will explore how neural networks and
autoregressive language models can learn to solve computationally hard
reasoning tasks. I will begin by discussing the sparse parity problem, a
theoretical proxy for studying the challenges of learning complex functions
with Stochastic Gradient Descent (SGD). I will show that the computational
resources required for learning sparse parities with SGD scale exponentially
with the "sparsity" of the problem, making it computationally hard to learn.
Next, I will demonstrate how introducing step-by-step supervision through
auto-regressive language models overcomes these barriers, enabling simple
models trained on next-token prediction to efficiently learn any
Turing-computable function. These results serve as a basis for studying
machine learning with language models, with implications on data structure,
architecture design and training paradigms.
Bio: Eran Malach is a postdoc Research Fellow in the Kempner Institute at
Harvard University. Previously, he did his PhD at the School of Computer
Science and Engineering in the Hebrew University of Jerusalem, advised by
Prof. Shai Shalev-Shwartz. His research focus is Machine Learning and
Theoretical Foundations of Deep Learning and Language Models. He is mainly
interested in computational aspects of learning and optimization. He also
worked in Mobileye, where he developed machine learning and computer vision
algorithms for driver-assistance systems and self-driving cars. His research
is supported by the Rothschild Fellowship, the William F. Milton Fund and
the OpenAI Superalignment Fast Grant.
Lunch at 11:30am in room 1307
Talk at 12:00-1:00pm in room 1327A
For more details and upcoming events visit our website at
<https://statistics.yale.edu/calendar> https://statistics.yale.edu/calendar.
Department of Statistics and Data Science
Yale University
Kline Tower
219 Prospect Street
New Haven, CT 06511
<https://statistics.yale.edu/> https://statistics.yale.edu/
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.yale.edu/pipermail/sds-seminars/attachments/20250113/a781c987/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image002.jpg
Type: image/jpeg
Size: 32188 bytes
Desc: not available
URL: <http://mailman.yale.edu/pipermail/sds-seminars/attachments/20250113/a781c987/attachment.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image003.jpg
Type: image/jpeg
Size: 4853 bytes
Desc: not available
URL: <http://mailman.yale.edu/pipermail/sds-seminars/attachments/20250113/a781c987/attachment-0001.jpg>
-------------- next part --------------
--
Sds-announce mailing list
Sds-announce at mailman.yale.edu
https://mailman.yale.edu/mailman/listinfo/sds-announce
- Previous message (by thread): [Sds-seminars] [Sds-announce] S&DS Seminar, Theo McKenzie, 1/13/25, KT 13th Floor, Rm. 1327, 12pm-1pm, "Random Matrices as a Model of Discrete Quantum Chaos"
- Next message (by thread): [Sds-seminars] [Sds-announce] S&DS Seminar, Eric Sun, 1/27/25, KT 13th Floor, Rm. 1327, 12pm-1pm, "Machine Learning for Aging and Spatial Omics"
- Messages sorted by:
[ date ]
[ thread ]
[ subject ]
[ author ]
More information about the Sds-seminars
mailing list