Clicky

Using Second-Order Information in Training Deep Neural Networks by Yi Ren and similar books you'll love - Bookscovery

Home > Authors > Yi Ren > Using Second-Order Information in Training Deep Neural Networks

Using Second-Order Information in Training Deep Neural Networks

Yi Ren

In this dissertation, we are concerned with the advancement of optimization algorithms for training deep learning models, and in particular about practical second-order methods that take into account the structure of deep neural networks (DNNs). Although first-order methods such as stochastic gradient descent have long been the predominant optimization algorithm used in deep learning, second-order methods are of interest because of their ability to use curvature information to accelerate the optimization process. After the presentation of some background information in Chapter 1, Chapters 2 and 3 focus on the development of practical quasi-Newton methods for training DNNs. We analyze the Kronecker-factored structure of the Hessian matrix of multi-layer perceptrons and convolutional neural networks and consequently propose block-diagonal Kronecker-factored quasi-Newton methods named...

Recent activity

Rate this book to see your activity here.

9 Books Similar to Using Second-Order Information in Training Deep Neural Networks by Yi Ren

Bookscovery readers who liked Using Second-Order Information in Training Deep Neural Networks also like Chinese for Beginners, Cong ming de xiao tu and Mandarin Chinese for Beginners. How many of these have you read?

Comments and reviews of Using Second-Order Information in Training Deep Neural Networks

Please sign in to leave a comment