» version anglaise » APPENDICES » News » SEMINAR 29 Sept.2014 : Do Deep Nets Really Need to Be Deep ?

SEMINAR 29 Sept.2014 : Do Deep Nets Really Need to Be Deep ?

Do Deep Nets Really Need to Be Deep ?

Abstract :
Currently, deep neural networks are the state of the art on problems such as speech recognition and computer vision. We show that by using a method called model compression that shallow feed-forward nets can learn the complex functions previously learned by deep nets and achieve accuracies previously only achievable with deep models. Moreover, in some cases the shallow neural nets can learn these deep functions using the same number of parameters as the original deep models. On the TIMIT phoneme recognition and CIFAR-10 image recognition tasks, shallow nets can be trained that perform similarly to complex, well-engineered, deeper convolutional architectures. Our success in training shallow neural nets to mimic deeper models suggests that there may be better algorithms for training shallow nets than those currently available.

PDF - 609.3 ko