gradient descent

نتایج جستجو برای: gradient descent

تعداد نتایج: 137892 فیلتر نتایج به سال:

Robust supervised learning with coordinate gradient descent

Journal: :Statistics and Computing 2023

This paper considers the problem of supervised learning with linear methods when both features and labels can be corrupted, either in form heavy tailed data and/or corrupted rows. We introduce a combination coordinate gradient descent as algorithm together robust estimators partial derivatives. leads to statistical that have numerical complexity nearly identical non-robust ones based on empiric...

متن کامل

Accelerated Sparse Recovery via Gradient Descent with Nonlinear Conjugate Gradient Momentum

Journal: :Journal of Scientific Computing 2023

This paper applies an idea of adaptive momentum for the nonlinear conjugate gradient to accelerate optimization problems in sparse recovery. Specifically, we consider two types minimization problems: a (single) differentiable function and sum non-smooth function. In first case, adopt fixed step size avoid traditional line search establish convergence analysis proposed algorithm quadratic proble...

متن کامل

PnP-ReG: Learned Regularizing Gradient for Plug-and-Play Gradient Descent

Journal: :Siam Journal on Imaging Sciences 2023

The plug-and-play framework makes it possible to integrate advanced image denoising priors into optimization algorithms efficiently solve a variety of restoration tasks generally formulated as maximum posteriori (MAP) estimation problems. alternating direction method multipliers (ADMM) and the regularization by (RED) are two examples such methods that made breakthrough in restoration. However, ...

متن کامل

Leveraging Asynchronicity in Gradient Descent for Scalable Deep Learning

2016

Jeff Daily Abhinav Vishnu Charles Siegel

In this paper, we present multiple approaches for improving the performance of gradient descent when utilizing mutiple compute resources. The proposed approaches span a solution space ranging from equivalence to running on a single compute device to delaying gradient updates a fixed number of times. We present a new approach, asynchronous layer-wise gradient descent that maximizes overlap of la...

متن کامل

A Global Transformation Approach to RBF Neural Network Learning

2002

Kar-Ann Toh Kezhi Mao

In this paper, we propose to train the RBF neural network using a global descent method. Essentially, the method imposes a monotonic transformation on the training objective to improve numerical sensitivity without altering the relative orders of all local extrema. A gradient descent search which inherits the global descent property is derived to locate the global solution of an error objective...

متن کامل

A new conjugate gradient method for acceleration of gradient descent algorithms

Journal: :Moroccan Journal of Pure and Applied Analysis 2020

متن کامل

Asymptotic and finite-sample properties of estimators based on stochastic gradients∗

2016

Panos Toulis Edoardo M. Airoldi Joe Blitzstein Leon Bottou Bob Carpenter David Dunson Andrew Gelman Brian Kulis Xiao-Li Meng Natesh Pillai

Stochastic gradient descent procedures have gained popularity for parameter estimation from large data sets. However, their statistical properties are not well understood, in theory. And in practice, avoiding numerical instability requires careful tuning of key parameters. Here, we introduce implicit stochastic gradient descent procedures, which involve parameter updates that are implicitly def...

متن کامل

Unifying the Stochastic Spectral Descent for Restricted Boltzmann Machines with Bernoulli or Gaussian Inputs

2017

Kai Fan

Stochastic gradient descent based algorithms are typically used as the general optimization tools for most deep learning models. A Restricted Boltzmann Machine (RBM) is a probabilistic generative model that can be stacked to construct deep architectures. For RBM with Bernoulli inputs, non-Euclidean algorithm such as stochastic spectral descent (SSD) has been specifically designed to speed up th...

متن کامل

Convergence Results for Neural Networks via Electrodynamics

2018

Rina Panigrahy Ali Rahimi Sushant Sachdeva Qiuyi Zhang

We study whether a depth two neural network can learn another depth two network using gradient descent. Assuming a linear output node, we show that the question of whether gradient descent converges to the target function is equivalent to the following question in electrodynamics: Given k fixed protons in R, and k electrons, each moving due to the attractive force from the protons and repulsive...

متن کامل

Learning Finite-State Controllers for Partially Observable Environments

1999

Nicolas Meuleau Leonid Peshkin Kee-Eung Kim Leslie Pack Kaelbling

Reactive (memoryless) policies are sufficient in completely observable Markov decision processes (MDPs), but some kind of memory is usually necessary for optimal control of a partially observable MDP. Policies with finite memory can be represented as finite-state automata. In this paper, we extend Baird and Moore’s VAPS algorithm to the problem of learning general finite-state automata. Because...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید