gradient descent

A Fast Optimization Method for Level Set Segmentation

2009

Thord Andersson Gunnar Läthén Reiner Lenz Magnus Borga

Level set methods are a popular way to solve the image segmentation problem in computer image analysis. A contour is implicitly represented by the zero level of a signed distance function, and evolved according to a motion equation in order to minimize a cost function. This function defines the objective of the segmentation problem and also includes regularization constraints. Gradient descent ...

متن کامل

Stochastic Gradient Descent in Continuous Time

Journal: :SIAM J. Financial Math. 2017

Justin A. Sirignano Konstantinos Spiliopoulos

We consider stochastic gradient descent for continuous-time models. Traditional approaches for the statistical estimation of continuous-time models, such as batch optimization, can be impractical for large datasets where observations occur over a long period of time. Stochastic gradient descent provides a computationally efficient method for such statistical learning problems. The stochastic gr...

متن کامل

Stochastic Spectral Descent for Restricted Boltzmann Machines

2015

David E. Carlson Volkan Cevher Lawrence Carin

Restricted Boltzmann Machines (RBMs) are widely used as building blocks for deep learning models. Learning typically proceeds by using stochastic gradient descent, and the gradients are estimated with sampling methods. However, the gradient estimation is a computational bottleneck, so better use of the gradients will speed up the descent algorithm. To this end, we first derive upper bounds on t...

متن کامل

Learning from Finite Training Sets

1997

Peter Sollich David Barber

{ We analyse online (gradient descent) learning of a rule from a nite set of training examples at non-innnitesimal learning rates , calculating exactly the time-dependent generalization error for a simple model scenario. In the thermodynamic limit, we close the dynamical equation for the generating function of an innnite hierarchy of order parameters using`within-sample self-averaging'. The res...

متن کامل

Towards Stochastic Conjugate Gradient Methods

2002

Nicol N. Schraudolph Thore Graepel

The method of conjugate gradients provides a very effective way to optimize large, deterministic systems by gradient descent. In its standard form, however, it is not amenable to stochastic approximation of the gradient. Here we explore a number of ways to adopt ideas from conjugate gradient in the stochastic setting, using fast Hessian-vector products to obtain curvature information cheaply. I...

متن کامل

Use of Zernike Polynomials and SPGD Algorithm for Measuring the Reflected Wavefronts from the Lens Surfaces

Journal: International Journal of Optics and Photonics 2016

Hamid Reza Fallah, Morteza Hajimahmoodzadeh, Roghayeh Yazdani, Sebastian Petsch,

Recently, we have demonstrated a new and efficient method to simultaneously reconstruct two unknown interfering wavefronts. A three-dimensional interference pattern was analyzed and then Zernike polynomials and the stochastic parallel gradient descent algorithm were used to expand and calculate wavefronts. In this paper, as one of the applications of this method, the reflected wavefronts from t...

متن کامل

Handwritten Character Recognition using Modified Gradient Descent Technique of Neural Networks and Representation of Conjugate Descent for Training Patterns

Journal: International Journal of Engineering 2009

Manu Pratap Singh, Vijaypal Dhaka,

The purpose of this study is to analyze the performance of Back propagation algorithm with changing training patterns and the second momentum term in feed forward neural networks. This analysis is conducted on 250 different words of three small letters from the English alphabet. These words are presented to two vertical segmentation programs which are designed in MATLAB and based on portions (1...

متن کامل

Backstitch: Counteracting Finite-Sample Bias via Negative Steps

2017

Yiming Wang Vijayaditya Peddinti Hainan Xu Xiaohui Zhang Daniel Povey Sanjeev Khudanpur

In this paper we describe a modification to Stochastic Gradient Descent (SGD) that improves generalization to unseen data. It consists of doing two steps for each minibatch: a backward step with a small negative learning rate, followed by a forward step with a larger learning rate. The idea was initially inspired by ideas from adversarial training, but we show that it can be viewed as a crude w...

متن کامل

Towards Optimal One Pass Large Scale Learning with Averaged Stochastic Gradient Descent

Journal: :CoRR 2011

Wei Xu

For large scale learning problems, it is desirable if we can obtain the optimal model parameters by going through the data in only one pass. Polyak and Juditsky (1992) showed that asymptotically the test performance of the simple average of the parameters obtained by stochastic gradient descent (SGD) is as good as that of the parameters which minimize the empirical cost. However, to our knowled...

متن کامل

Scalable Heterogeneous Transfer Ranking

2014

Mohammad Taha Bahadori Yi Chang Bo Long Yan Liu

In this paper, we propose to study the problem of heterogeneous transfer ranking, a transfer learning problem with heterogeneous features in order to utilize the rich large-scale labeled data in popular languages to help the ranking task in less popular languages. We develop a large-margin algorithm, namely LM-HTR, to solve the problem by mapping the input features in both the source domain and...

متن کامل