Robust Visual Recognition Using Multilayer Generative Neural Networks

Tang, Yichuan

Robust Visual Recognition Using Multilayer Generative Neural Networks

Files

TangMMathThesis.pdf (2.09 MB)

Date

2010-08-25T18:38:27Z

Authors

Tang, Yichuan

Publisher

University of Waterloo

Abstract

Deep generative neural networks such as the Deep Belief Network and Deep Boltzmann Machines have been used successfully to model high dimensional visual data. However, they are not robust to common variations such as occlusion and random noise. In this thesis, we explore two strategies for improving the robustness of DBNs. First, we show that a DBN with sparse connections in the first layer is more robust to variations that are not in the training set. Second, we develop a probabilistic denoising algorithm to determine a subset of the hidden layer nodes to unclamp. We show that this can be applied to any feedforward network classifier with localized first layer connections. By utilizing the already available generative model for denoising prior to recognition, we show significantly better performance over the standard DBN implementations for various sources of noise on the standard and Variations MNIST databases.