stochastic gradient descent

نتایج جستجو برای: stochastic gradient descent

تعداد نتایج: 258150 فیلتر نتایج به سال:

Asynchronous Distributed Semi-Stochastic Gradient Optimization

2016

Ruiliang Zhang Shuai Zheng James T. Kwok

With the recent proliferation of large-scale learning problems, there have been a lot of interest on distributed machine learning algorithms, particularly those that are based on stochastic gradient descent (SGD) and its variants. However, existing algorithms either suffer from slow convergence due to the inherent variance of stochastic gradients, or have a fast linear convergence rate but at t...

متن کامل

Generalization Bounds for Randomized Learning with Application to Stochastic Gradient Descent

2016

Ben London

Randomized algorithms are central to modern machine learning. In the presence of massive datasets, researchers often turn to stochastic optimization to solve learning problems. Of particular interest is stochastic gradient descent (SGD), a first-order method that approximates the learning objective and gradient by a random point estimate. A classical question in learning theory is, if a randomi...

متن کامل

Generalization Error Bounds with Probabilistic Guarantee for SGD in Nonconvex Optimization

Journal: :CoRR 2018

Yi Zhou Yingbin Liang Huishuai Zhang

The success of deep learning has led to a rising interest in the generalization property of the stochastic gradient descent (SGD) method, and stability is one popular approach to study it. Existing works based on stability have studied nonconvex loss functions, but only considered the generalization error of the SGD in expectation. In this paper, we establish various generalization error bounds...

متن کامل

Efficient Low-Rank Stochastic Gradient Descent Methods for Solving Semidefinite Programs

2014

Jianhui Chen Tianbao Yang Shenghuo Zhu

We propose a low-rank stochastic gradient descent (LR-SGD) method for solving a class of semidefinite programming (SDP) problems. LR-SGD has clear computational advantages over the standard SGD peers as its iterative projection step (a SDP problem) can be solved in an efficient manner. Specifically, LR-SGD constructs a low-rank stochastic gradient and computes an optimal solution to the project...

متن کامل

Stochastic Optimization with Importance Sampling

Journal: :CoRR 2014

Peilin Zhao Tong Zhang

Uniform sampling of training data has been commonly used in traditional stochastic optimization algorithms such as Proximal Stochastic Gradient Descent (prox-SGD) and Proximal Stochastic Dual Coordinate Ascent (prox-SDCA). Although uniform sampling can guarantee that the sampled stochastic quantity is an unbiased estimate of the corresponding true quantity, the resulting estimator may have a ra...

متن کامل

Gradient - Based Optimization of Markov Reward Processes :

2000

Peter Marbach John N. Tsitsiklis

We consider a discrete time, nite state Markov reward process that depends on a set of parameters. In earlier work, we proposed a class of (stochastic) gradient descent methods that tune the parameters in order to optimize the average reward, using a single (possibly simulated) sample path of the process of interest. The resulting algorithms can be implemented online, and have the property that...

متن کامل

Probabilistic Line Searches for Stochastic Optimization

Journal: :Journal of Machine Learning Research 2015

Maren Mahsereci Philipp Hennig

In deterministic optimization problems, line search routines are a standard tool ensuring stability and efficiency. In the stochastic setting, no direct equivalent has so far been formulated, because uncertain gradients do not allow for a strict sequence of decisions collapsing the search space. We construct a probabilistic version of the line search paradigm, by combining the structure of exis...

متن کامل

Hyphenation with Conditional Random Field

2012

Panqu Wang

In this project, we approach the problem of English-word hyphenation using a linear-chain conditional random field model. We measure the effectiveness of different feature combinations and two different learning methods: Collins perceptron and stochastic gradient following. We achieve the accuracy rate of 77.95% using stochastic gradient descent.

متن کامل

Gradient Convergence in Gradient methods with Errors

Journal: :SIAM Journal on Optimization 2000

Dimitri P. Bertsekas John N. Tsitsiklis

We consider the gradient method xt+1 = xt + γt(st + wt), where st is a descent direction of a function f : �n → � and wt is a deterministic or stochastic error. We assume that ∇f is Lipschitz continuous, that the stepsize γt diminishes to 0, and that st and wt satisfy standard conditions. We show that either f(xt) → −∞ or f(xt) converges to a finite value and ∇f(xt) → 0 (with probability 1 in t...

متن کامل

Fast Online Policy Gradient Learning with SMD Gain Vector Adaptation

2005

Nicol N. Schraudolph Douglas Aberdeen Jin Yu

Reinforcement learning by direct policy gradient estimation is attractive in theory but in practice leads to notoriously ill-behaved optimization problems. We improve its robustness and speed of convergence with stochastic meta-descent, a gain vector adaptation method that employs fast Hessian-vector products. In our experiments the resulting algorithms outperform previously employed online sto...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید