نتایج جستجو برای: deep seq2seq network

تعداد نتایج: 847003  

2017
Aosen Wang Hua Zhou Wenyao Xu Xin Chen

In recent years, deep neural network exhibits its powerful superiority on information discrimination in many computer vision applications. However, the capacity of deep neural network architecture is still a mystery to the researchers. Intuitively, larger capacity of neural network can always deposit more information to improve the discrimination ability of the model. But, the learnable paramet...

2017
Ji Feng Zhi-Hua Zhou

In many real world applications, the concerned objects are with multiple labels, and can be represented as a bag of instances. Multi-instance Multi-label (MIML) learning provides a framework for handling such task and has exhibited excellent performance in various domains. In a MIML setting, the feature representation of instances usually has big impact on the final performance; inspired by the...

Journal: :CoRR 2017
Yunhao Tang Alp Kucukelbir

We propose a framework that directly tackles the probability distribution of the value function parameters in Deep Q Network (DQN), with powerful variational inference subroutines to approximate the posterior of the parameters. We will establish the equivalence between our proposed surrogate objective and variational inference loss. Our new algorithm achieves efficient exploration and performs ...

Journal: :CoRR 2017
Kevin Cremanns Dirk Roos

The correlation length-scale θ next to the noise variance σ are the most used hyperparameters for the Gaussian processes GP . Typically stationary covariance functions k(xi,xj) are used, which are only dependent on the distances between input points τ = ||xi − xj || and thus invariant to the translations in the input space X . The optimization of the hyperparameters is commonly done by maximizi...

Journal: :CoRR 2016
Chaoyun Zhang Mingjun Zhong Zongzuo Wang Nigel H. Goddard Charles A. Sutton

Energy disaggregation (a.k.a nonintrusive load monitoring, NILM), a single-channel blind source separation problem, aims to decompose the mains which records the whole house electricity consumption into appliance-wise readings. This problem is difficult because it is inherently unidentifiable. Recent approaches have shown that the identifiability problem could be reduced by introducing domain k...

سیدصالحی, سیده زهره , سیدصالحی, سید علی ,

In this paper, we propose efficient method for pre-training of deep bottleneck neural network (DBNN). Pre-training is used for initial value of network weights convergence of DBNN is difficult because of different local minimums. While with efficient initial value for network weights can avoided some local minimums. This method divides DBNN to multi single hidden layer and adjusts them, then we...

2010
Yuanqing Lin Tong Zhang Shenghuo Zhu Kai Yu

This paper proposes a principled extension of the traditional single-layer flat sparse coding scheme, where a two-layer coding scheme is derived based on theoretical analysis of nonlinear functional approximation that extends recent results for local coordinate coding. The two-layer approach can be easily generalized to deeper structures in a hierarchical multiple-layer manner. Empirically, it ...

Journal: :CoRR 2017
Junying Li Zichen Yang Haifeng Liu Deng Cai

Recently, learning equivariant representations has attracted considerable research attention. Dieleman et al. introduce four operations which can be inserted into convolutional neural network to learn deep representations equivariant to rotation. However, feature maps should be copied and rotated four times in each layer in their approach, which causes much running time and memory overhead. In ...

Journal: :CoRR 2014
Ludovic Denoyer Patrick Gallinari

Neural Networks sequentially build high-level features through their successive layers. We propose here a new neural network model where each layer is associated with a set of candidate mappings. When an input is processed, at each layer, one mapping among these candidates is selected according to a sequential decision process. The resulting model is structured according to a DAG like architect...

Journal: :CoRR 2016
Xin Wang Siu-Ming Yiu

Based on API call sequences, semantic-aware and machine learning (ML) based malware classifiers can be built for malware detection or classification. Previous works concentrate on crafting and extracting various features from malware binaries, disassembled binaries or API calls via static or dynamic analysis and resorting to ML to build classifiers. However, they tend to involve too much featur...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید