نتایج جستجو برای: data augmentation

تعداد نتایج: 2428395  

2015
Brian McFee Eric J. Humphrey Juan Pablo Bello

Predictive models for music annotation tasks are practically limited by a paucity of well-annotated training data. In the broader context of large-scale machine learning, the concept of “data augmentation” — supplementing a training set with carefully perturbed samples — has emerged as an important component of robust systems. In this work, we develop a general software framework for augmenting...

Journal: :CoRR 2015
Hugh Perkins Minjie Xu Jun Zhu Bo Zhang

As one of the most popular classifiers, linear SVMs still have challenges in dealing with very large-scale problems, even though linear or sub-linear algorithms have been developed recently on single machines. Parallel computing methods have been developed for learning large-scale SVMs. However, existing methods rely on solving local sub-optimization problems. In this paper, we develop a novel ...

2016
William Fithian

If a document is about travel, we may expect that short snippets of the document should also be about travel. We introduce a general framework for incorporating these types of invariances into a discriminative classifier. The framework imagines data as being drawn from a slice of a Lévy process. If we slice the Lévy process at an earlier point in time, we obtain additional pseudo-examples, whic...

Journal: :CoRR 2017
Luke Taylor Geoff S. Nitschke

Deep artificial neural networks require a large corpus of training data in order to effectively learn, where collection of such training data is often expensive and laborious. Data augmentation overcomes this issue by artificially inflating the training set with label preserving transformations. Recently there has been extensive use of generic data augmentation to improve Convolutional Neural N...

2011
Vivekananda Roy

The sandwich algorithm (SA) is an alternative to the data augmentation (DA) algorithm that uses an extra simulation step at each iteration. In this paper, we show that the sandwich algorithm always converges at least as fast as the DA algorithm, in the Markov operator norm sense. We also establish conditions under which the spectrum of SA dominates that of DA. An example illustrates the results.

2017
Chuanhai Zhang Wallapak Tavanapong Johnny S. Wong Piet C. de Groen Jung-Hwan Oh

Many medical image classification tasks share a common unbalanced data problem. That is images of the target classes, e.g., certain types of diseases, only appear in a very small portion of the entire dataset. Nowadays, large co llections of medical images are readily available. However, it is costly and may not even be feasible for medical experts to manually comb through a huge unlabeled data...

Journal: :Statistics and Computing 2015
Peter Neal Theodore Kypraios

Data augmentation is a common tool in Bayesian statistics, especially in the application of MCMC. Data augmentation is used where direct computation of the posterior density, π(θ |x), of the parameters θ , given the observed data x, is not possible. We show that for a range of problems, it is possible to augment the data by y, such that, π(θ |x,y) is known, and π(y|x) can easily be computed. In...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید