Dissertation talk: Statistics meets Optimization - Computational guarantees for statistical learning algorithms

Presentation: Dissertation Talk: EE | May 4 | 2-3 p.m. | 400 Cory Hall

 Fanny Yang

 Electrical Engineering and Computer Sciences (EECS)

Modern technological advances have prompted massive scale data collection in many fields. This has led to an increasing need for scalable machine learning algorithms and statistical methods to draw conclusions about the world. Two principal challenges arise in this context: How can we collect data efficiently such that a reduced sample size is enough to draw conclusion with high confidence? How should we design the learning algorithm and how long should we run it? In this talk, I will address these questions for concrete problems in estimation and testing.

I will start by discussing the regularization effect of stopping gradient-type methods before convergence for non-parametric estimation. In particular, we show how both early stopping and penalty regularization can be explained by localized complexities. In the second part of the talk, I will elaborate on the aspect of efficient data collection and demonstrate how adaptive algorithms can reduce the number of required samples for simultaneous false discovery rate control and best-arm detection in multiple testing.

 fanny-yang@berkeley.edu