Loading Events

« All Events

  • This event has passed.

ESE & CIS Spring Seminar – “Beyond the black box: characterizing and improving how neural networks learn”

February 13 at 11:00 AM - 12:00 PM

The predominant paradigm in deep learning practice treats neural networks as “black boxes”. This leads to economic and environmental costs as brute-force scaling remains the performance driver, and to safety issues as robust reasoning and alignment remain challenging. My research opens up the neural network black box with mathematical and statistical analyses of how networks learn, and yields engineering insights that improve the efficiency and transparency of these models. In this talk I will present characterizations of (1) how large language models can learn to reason with abstract symbols, and (2) how hierarchical structure in data guides deep learning, and will conclude with (3) new tools to distill trained neural networks into lightweight and transparent models.

Enric Boix-Adsera

Ph.D. Candidate, MIT

Enric Boix is a PhD candidate at MIT, under the supervision of Guy Bresler and Philippe Rigollet. His research focuses on building a mathematical science of deep learning. He aims to characterize the fundamental mechanisms driving how neural networks learn, so as to enable more efficient and more trustworthy deep learning systems. His research has been supported by an NSF Graduate Research Fellowship, a Siebel Fellowship, and an Apple AI/ML fellowship.

Details

Date:
February 13
Time:
11:00 AM - 12:00 PM
Event Category:
Event Tags:
, ,
Website:
https://upenn.zoom.us/j/99074346805?pwd=cm5pNFo3YnZtNGt2QTFhZ05mQTBFQT09

Organizers

Electrical and Systems Engineering
Computer and Information Science

Venue

Raisler Lounge (Room 225), Towne Building
220 South 33rd Street
Philadelphia, PA 19104 United States
+ Google Map
View Venue Website