Beyond Frame-level CNN: Saliency-Aware 3-D CNN With LSTM for Video Action Recognition

Volume: 24, Issue: 4, Pages: 510 - 514
Published: Apr 1, 2017
Abstract
Human activity recognition in videos with convolutional neural network (CNN) features has received increasing attention in multimedia understanding. Taking videos as a sequence of frames, a new record was recently set on several benchmark datasets by feeding frame-level CNN sequence features to long short-term memory (LSTM) model for video activity recognition. This recurrent model-based visual recognition pipeline is a natural choice for...
Paper Details
Title
Beyond Frame-level CNN: Saliency-Aware 3-D CNN With LSTM for Video Action Recognition
Published Date
Apr 1, 2017
Volume
24
Issue
4
Pages
510 - 514
Citation AnalysisPro
  • Scinapse’s Top 10 Citation Journals & Affiliations graph reveals the quality and authenticity of citations received by a paper.
  • Discover whether citations have been inflated due to self-citations, or if citations include institutional bias.