Global-connected network with generalized ReLU activation

Chen, Zhi; Ho, Pin-Han

Global-connected network with generalized ReLU activation

Files

zhi_chen.pdf (938.97 KB)

Date

2019-12

Authors

Chen, Zhi

Ho, Pin-Han

Publisher

Elsevier

Abstract

Recent Progress has shown that exploitation of hidden layer neurons in convolutional neural networks (CNN) incorporating with a carefully designed activation function can yield better classification results in the field of computer vision. The paper firstly introduces a novel deep learning (DL) architecture aiming to mitigate the gradient-vanishing problem, in which the earlier hidden layer neurons could be directly connected with the last hidden layer and fed into the softmax layer for classification. We then design a generalized linear rectifier function as the activation function that can approximate arbitrary complex functions via training of the parameters. We will show that our design can achieve similar performance in a number of object recognition and video action benchmark tasks, such as MNIST, CIFAR-10/100, SVHN, Fashion-MNIST, STL-10, and UCF YoutTube Action Video datasets, under significantly less number of parameters and shallower network infrastructure, which is not only promising in training in terms of computation burden and memory usage, but is also applicable to low-computation, low-memory mobile scenarios for inference.

Description

The final publication is available at Elsevier via https://doi.org/10.1016/j.patcog.2019.07.006. © 2019. This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/

Keywords

computer vision, deep learning, activation, convolution neural network

URI

https://doi.org/10.1016/j.patcog.2019.07.006
http://hdl.handle.net/10012/15277

Collections

Waterloo Research
Electrical and Computer Engineering

Full item page

Global-connected network with generalized ReLU activation

Files

Date

Authors

Advisor

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

LC Subject Headings

Citation

URI

Collections