gradient descent

نتایج جستجو برای: gradient descent

تعداد نتایج: 137892 فیلتر نتایج به سال:

Conditional Accelerated Lazy Stochastic Gradient Descent

2017

Guanghui Lan Sebastian Pokutta Yi Zhou Daniel Zink

In this work we introduce a conditional accelerated lazy stochastic gradient descent algorithm with optimal number of calls to a stochastic first-order oracle and convergence rate O( 1 ε2 ) improving over the projection-free, Online Frank-Wolfe based stochastic gradient descent of Hazan and Kale [2012] with convergence rate O( 1 ε4 ).

متن کامل

Stochastic gradient descent algorithms for strongly convex functions at O(1/T) convergence rates

Journal: :CoRR 2013

Shenghuo Zhu

With a weighting scheme proportional to t, a traditional stochastic gradient descent (SGD) algorithm achieves a high probability convergence rate of O(κ/T ) for strongly convex functions, instead of O(κ ln(T )/T ). We also prove that an accelerated SGD algorithm also achieves a rate of O(κ/T ).

متن کامل

Training an articulatory synthesizer with continuous acoustic data

2013

Santitham Prom-on Peter Birkholz Yi Xu

This paper reports preliminary results of our effort to address the acoustic-to-articulatory inversion problem. We tested an approach that simulates speech production acquisition as a distal learning task, with acoustic signals of natural utterances in the form of MFCC as input, VocalTractLab — a 3D articulatory synthesizer controlled by target approximation models as the learner, and stochasti...

متن کامل

Matrix Factorization using Window Sampling and Negative Sampling for Improved Word Representations

Journal: :CoRR 2016

Alexandre Salle Aline Villavicencio Marco Idiart

In this paper, we propose LexVec, a new method for generating distributed word representations that uses low-rank, weighted factorization of the Positive Point-wise Mutual Information matrix via stochastic gradient descent, employing a weighting scheme that assigns heavier penalties for errors on frequent cooccurrences while still accounting for negative co-occurrence. Evaluation on word simila...

متن کامل

Analog VLSI Implementation of Gradient Descent

1992

David B. Kirch Douglas Kerns Kurt W. Fleischer Alan H. Barr

We describe an analog VLSI implementation of a multi-dimensional gradient estimation and descent technique for minimizing an onchip scalar function fO. The implementation uses noise injection and multiplicative correlation to estimate derivatives, as in [Anderson, Kerns 92]. One intended application of this technique is setting circuit parameters on-chip automatically, rather than manually [Kir...

متن کامل

Statistical inference using SGD

Journal: :CoRR 2017

Tianyang Li Liu Liu Anastasios Kyrillidis Constantine Caramanis

We present a novel method for frequentist statistical inference in M -estimation problems, based on stochastic gradient descent (SGD) with a fixed step size: we demonstrate that the average of such SGD sequences can be used for statistical inference, after proper scaling. An intuitive analysis using the OrnsteinUhlenbeck process suggests that such averages are asymptotically normal. From a prac...

متن کامل

Abstract - Training Non-linear Structured Prediction Models with Stochastic Gradient Descent

2008

Thomas Gärtner

Training Non-linear Structured Prediction Models with Stochastic Gradient Descent Thomas Gärtner [email protected] Shankar Vembu [email protected] Fraunhofer IAIS, Schloß Birlinghoven, 53754 Sankt Augustin, Germany

متن کامل

Use it or Lose it: Selective Memory and Forgetting in a Perpetual Learning Machine

Journal: :CoRR 2015

Andrew J. R. Simpson

In a recent article we described a new type of deep neural network– a Perpetual Learning Machine (PLM) – which is capable of learning ‘on the fly’ like a brain by existing in a state of Perpetual Stochastic Gradient Descent (PSGD). Here, by simulating the process of practice, we demonstrate both selective memory and selective forgetting when we introduce statistical recall biases during PSGD. F...

متن کامل

Link Prediction via Matrix Factorization

2011

Aditya Krishna Menon Charles Elkan

We propose to solve the link prediction problem in graphs using a supervised matrix factorization approach. The model learns latent features from the topological structure of a (possibly directed) graph, and is shown to make better predictions than popular unsupervised scores. We show how these latent features may be combined with optional explicit features for nodes or edges, which yields bett...

متن کامل

Generalization Error Bounds for Aggregation by Mirror Descent with Averaging

2005

Anatoli Juditsky Alexander V. Nazin Alexandre B. Tsybakov Nicolas Vayatis

We consider the problem of constructing an aggregated estimator from a finite class of base functions which approximately minimizes a convex risk functional under the l1 constraint. For this purpose, we propose a stochastic procedure, the mirror descent, which performs gradient descent in the dual space. The generated estimates are additionally averaged in a recursive fashion with specific weig...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید