Next-Flow: Hybrid Multi-Tasking with Next-Frame Prediction to Boost Optical-Flow Estimation in the Wild
نویسنده
چکیده
CNN-based optical flow estimation has attracted attention recently, mainly due to its impressively high frame rates. These networks perform well on synthetic datasets, but they are still far behind the classical methods in realworld videos. This is because there is no ground truth optical flow for training these networks on real data. In this paper, we boost CNN-based optical flow estimation in real scenes with the help of the freely available self-supervised task of next-frame prediction. To this end, we train the network in a hybrid way, providing it with a mixture of synthetic and real videos. With the help of a sample-variant multi-tasking architecture, the network is trained on different tasks depending on the availability of ground-truth. We also experiment with the prediction of “next-flow” instead of estimation of the current flow, which is intuitively closer to the task of next-frame prediction and yields favorable results. We demonstrate the improvement in optical flow estimation on the real-world KITTI benchmark. Additionally, we test the optical flow indirectly in an action classification scenario. As a side product of this work, we report significant improvements over state-of-the-art in the task of next-frame prediction.
منابع مشابه
Hybrid Models Performance Assessment to Predict Flow of Gamasyab River
Awareness of the level of river flow and its fluctuations at different times is one of the significant factor to achieve sustainable development for water resource issues. Therefore, the present study two hybrid models, Wavelet- Adaptive Neural Fuzzy Interference System (WANFIS) and Wavelet- Artificial Neural Network (WANN) are used for flow prediction of Gamasyab River (Nahavand, Hamedan, Iran...
متن کاملHybrid Models Performance Assessment to Predict Flow of Gamasyab River
Awareness of the level of river flow and its fluctuations at different times is one of the significant factor to achieve sustainable development for water resource issues. Therefore, the present study two hybrid models, Wavelet- Adaptive Neural Fuzzy Interference System (WANFIS) and Wavelet- Artificial Neural Network (WANN) are used for flow prediction of Gamasyab River (Nahavand, Hamedan, Iran...
متن کاملSpatio-temporal video autoencoder with differentiable memory
We describe a new spatio-temporal video autoencoder, based on a classic spatial image autoencoder and a novel nested temporal autoencoder. The temporal encoder is represented by a differentiable visual memory composed of convolutional long short-term memory (LSTM) cells that integrate changes over time. Here we target motion changes and use as temporal decoder a robust optical flow prediction m...
متن کاملDifferentiable Memory
We describe a new spatio-temporal video autoencoder, based on a classic spatial image autoencoder and a novel nested temporal autoencoder. The temporal encoder is represented by a differentiable visual memory composed of convolutional long short-term memory (LSTM) cells that integrate changes over time. Here we target motion changes and use as temporal decoder a robust optical flow prediction m...
متن کاملEstimation of optical flow for large displacements
In this paper we present a new method to estimate optical flow for large displacements. It is based on prediction of global flow field parameters, performs better than multiresolution estimation methods and has been verified using standard test sequences as well as real-world data. Global flow field parameters can be estimated from optical flow measurements in all flow regions. They can then be...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1612.03777 شماره
صفحات -
تاریخ انتشار 2016