Machine Learning talk: Andrew Wilson, NYU Building Models that Learn and Generalize

Thursday, September 8, 2022
3:30 p.m.
Online zoom talk

TALK: "How Do We Build Models That Learn and Generalize?"
 
SPEAKER: Andrew Wilson, Associate Professor at New York University 
 
DATE: Thursday, September 8, 2022
TIME3:30—4:30 p.m.
 
PASSWORD: 828w
 
ABSTRACTTo answer scientific questions and reason about data, we must build models and perform inference within those models. But how should we approach model construction and inference to make the most successful predictions? How do we represent uncertainty and prior knowledge? How flexible should our models be? Should we use a single model, or multiple different models? Should we follow a different procedure depending on how much data are available? How do we learn desirable constraints, such as rotation, translation, or reflection symmetries, when they don't improve standard training loss? How do we select between models that are entirely consistent with any observed data? What if our test data are drawn from a different but semantically related distribution?
 
In this lecture, I will present a philosophy for model construction, grounded in probability theory. I will exemplify this approach with methods that exploit loss surface geometry for scalable and practical Bayesian deep learning, and resolutions to seemingly mysterious generalization behaviour such as double descent. I will also consider prior specification, model selection, generalized Bayesian inference, domain shift, and automatic constraint (symmetry) learning.
 
REFERENCES:
 
BIO: Andrew Gordon Wilson is an associate professor at New York University with a joint appointment in the Courant Institute of Mathematical Sciences and the Center for Data Science Computer Science Department. Before that, he was an assistant professor at Cornell and a research fellow at Carnegie Mellon. Wilson graduated with his Ph.D. in machine learning from the University of Cambridge in 2014. He is specifically interested in developing flexible, interpretable, and scalable machine learning models, often involving deep learning, Gaussian processes, and kernel learning.

remind we with google calendar

 

May 2025

SU MO TU WE TH FR SA
27 28 29 30 1 2 3
4 5 6 7 8 9 10
11 12 13 14 15 16 17
18 19 20 21 22 23 24
25 26 27 28 29 30 31
1 2 3 4 5 6 7
Submit an Event