lstm

Empirical Evaluation of A New Approach to Simplifying Long Short-term Memory (LSTM)

Journal: :CoRR 2016

Yuzhen Lu

The standard LSTM, although it succeeds in the modeling long-range dependences, suffers from a highly complex structure that can be simplified through modifications to its gate units. This paper was to perform an empirical comparison between the standard LSTM and three new simplified variants that were obtained by eliminating input signal, bias and hidden unit signal from individual gates, on t...

متن کامل

Reducing the Computational Complexity of Two-Dimensional LSTMs

2017

Bo Li Tara N. Sainath

Long Short-Term Memory Recurrent Neural Networks (LSTMs) are good at modeling temporal variations in speech recognition tasks, and have become an integral component of many state-of-the-art ASR systems. More recently, LSTMs have been extended to model variations in the speech signal in two dimensions, namely time and frequency [1, 2]. However, one of the problems with two-dimensional LSTMs, suc...

متن کامل

LSTM, GRU, Highway and a Bit of Attention: An Empirical Overview for Language Modeling in Speech Recognition

2016

Kazuki Irie Zoltán Tüske Tamer Alkhouli Ralf Schlüter Hermann Ney

Popularized by the long short-term memory (LSTM), multiplicative gates have become a standard means to design artificial neural networks with intentionally organized information flow. Notable examples of such architectures include gated recurrent units (GRU) and highway networks. In this work, we first focus on the evaluation of each of the classical gated architectures for language modeling fo...

متن کامل

Towards Online-Recognition with Deep Bidirectional LSTM Acoustic Models

2016

Albert Zeyer Ralf Schlüter Hermann Ney

Online-Recognition requires the acoustic model to provide posterior probabilities after a limited time delay given the online input audio data. This necessitates unidirectional modeling and the standard solution is to use unidirectional long short-term memory (LSTM) recurrent neural networks (RNN) or feedforward neural networks (FFNN). It is known that bidirectional LSTMs are more powerful and ...

متن کامل

Model-free prediction of noisy chaotic time series by deep learning

Journal: :CoRR 2017

Kyongmin Yeo

We present a deep neural network for a model-free prediction of a chaotic dynamical system from noisy observations. The proposed deep learning model aims to predict the conditional probability distribution of a state variable. The Long Short-Term Memory network (LSTM) is employed to model the nonlinear dynamics and a softmax layer is used to approximate a probability distribution. The LSTM mode...

متن کامل

Forward-Backward Convolutional LSTM for Acoustic Modeling

2017

Shigeki Karita Atsunori Ogawa Marc Delcroix Tomohiro Nakatani

An automatic speech recognition (ASR) performance has greatly improved with the introduction of convolutional neural network (CNN) or long-short term memory (LSTM) for acoustic modeling. Recently, a convolutional LSTM (CLSTM) has been proposed to directly use convolution operation within the LSTM blocks and combine the advantages of both CNN and LSTM structures into a single architecture. Thi...

متن کامل

Self-Guiding Multimodal LSTM - when we do not have a perfect training dataset for image captioning

Journal: :CoRR 2017

Yang Xian Yingli Tian

In this paper, a self-guiding multimodal LSTM (sg-LSTM) image captioning model is proposed to handle uncontrolled imbalanced real-world image-sentence dataset. We collect FlickrNYC dataset from Flickr as our testbed with 306, 165 images and the original text descriptions uploaded by the users are utilized as the ground truth for training. Descriptions in FlickrNYC dataset vary dramatically rang...

متن کامل

Traffic Prediction Based on Random Connectivity in Deep Learning with Long Short-Term Memory

Journal: :CoRR 2017

Yuxiu Hua Zhifeng Zhao Rongpeng Li Xianfu Chen Zhiming Liu Honggang Zhang

Traffic prediction plays an important role in evaluating the performance of telecommunication networks and attracts intense research interests. A significant number of algorithms and models have been put forward to analyse traffic data and make prediction. In the recent big data era, deep learning has been exploited to mine the profound information hidden in the data. In particular, Long Short-...

متن کامل

Deep Learning for Query Semantic Domains Classification

2016

I-Ting Fang

6 Long Short Term Memory (LSTM), a type of recurrent neural network, has 7 been widely used for Language Model. One of the application is speech query 8 domain classification where LSTM is shown to be more effective than 9 traditional statistic models and feedforward neural networks. Different from 10 speech queries, text queries to search engines are usually shorter and lack of 11 correct gram...

متن کامل

Hashtag Recommendation with Topical Attention-Based LSTM

2016

Yang Li Ting Liu Jing Jiang Liang Zhang

Microblogging services allow users to create hashtags to categorize their posts. In recent years, the task of recommending hashtags for microblogs has been given increasing attention. However, most of existing methods depend on hand-crafted features. Motivated by the successful use of long short-term memory (LSTM) for many natural language processing tasks, in this paper, we adopt LSTM to learn...

متن کامل