Multi-Timescale Gated Neural Network for Video Recognition

Liu Cong; Ma Longhua; Liu Feng

doi:10.2174/2213275910666170502144924

ISSN: 2213-2759
E-ISSN: 1874-4796

Multi-Timescale Gated Neural Network for Video Recognition
By Liu Cong, Ma Longhua and Liu Feng
Source: Recent Patents on Computer Science, Volume 10, Issue 1, Feb 2017, p. 96 - 103
DOI: https://doi.org/10.2174/2213275910666170502144924
- Available online: 01 Feb 2017

Abstract

Background: Deep neural network based methods have obtained great progress in a variety of computer vision tasks, as described in various patents. But, so far, it is still a challenge task to model temporal dependencies in the tasks of recognizing object movement from videos. Method: In this paper, we propose a multi-timescale gated neural network for encoding the temporal dependencies from videos. The developed model stacks multiple gated layers in a recurrent pyramid, which makes it possible to hierarchically model not just pairs but long-term dependencies from video frames. Additionally, the model combines the Convolutional Neural Networks into its structure that exploits the pictorial nature of the frames and reduces the number of model parameters. Result: We evaluated the proposed model on the datasets of synthetic bouncing-MNIST, standard actions benchmark of UCF101 and facial expressions benchmark of CK+. The experiment results reveal that on all tasks, the proposed model outperforms the existing approach to build deep stacked gated model and achieves superior performance compared to several recent state-of-the-art techniques. Conclusion: From the experimental results, we can make the conclusion that our proposed model is able to adapt its structure based on different time scales and can be applied in motion estimation, action recognition and tracking, etc.

Article metrics loading...

/content/journals/cseng/10.2174/2213275910666170502144924

2017-02-01

2026-02-28

From This Site

/content/journals/cseng/10.2174/2213275910666170502144924

dcterms_title,dcterms_subject,pub_keyword

-contentType:Contributor -contentType:Concept -contentType:Institution

10

5

Full text loading...

/content/journals/cseng/10.2174/2213275910666170502144924

Article Type: Research Article

Keyword(s): Deep learning; model temporal dependencies; multiplicative interactions; neural networks; optimization

Multi-Timescale Gated Neural Network for Video Recognition

Abstract

From This Site

Most Read This Month