Dissertation Talk: Exploratory model analysis for machine learning
Seminar | April 26 | 11 a.m.-12 p.m. | 405 Soda Hall
Biye Jiang, UC Berkeley, Department of EECS
Machine learning is growing in importance in many different fields. However, it is still very hard for users to tune hyper-parameters when optimizing their models, or perform a comprehensive and interpretable diagnosis for complex models like deep neural nets. Existing developer tool like TensorBoard only provides limited functionality which usually visualizes model statistics based on metrics predefined before the training starts. Almost nothing can be adjusted during the training. But the real model optimization and diagnosis procedures actually involve lots of interaction and continuous experiments. To tackle those challenges, we developed a framework which allows users to perform exploratory model analysis based on the hardware accelerated machine learning toolkit BIDMach. Our system is unique that we allow users to interactively plug in live visualizations and adjust hyper-parameters during training, also, we use Monte Carlo style algorithms to allow users explore the entire model space under different user preferences rather than only reaching the local optimums.
We demonstrate the usage of our system in several real-world applications. For problems like advertisement optimization or clustering where multiple optimization objectives exist , users can incorporate secondary criteria into the model-generation process and make trade-offs in an interactive way. For deep convolution neural net diagnostic, users can use our LDAM (Langevin Dynamic Activation Maximization) algorithm to systematically explore the images that stimulate a given neuron. We conduct experiments and user studies on several public dataset to show how users from different background can benefit from our tools.
Advisor: John Canny