SciTePress - Publication Details
loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Felipe F. Costa ; Priscila T. M. Saito and Pedro H. Bugatti

Affiliation: Department of Computing, Federal Unversity of Technology - Parana, 1640 Alberto Carazzai Ave., Cornelio Procopio, Brazil

Keyword(s): Deep Learning, Graph Convolutional Network, Computer Vision, Action Classification.

Abstract: Video classification methods have been evolving through proposals based on end-to-end deep learning architectures. Several works have testified that end-to-end models are effective for the learning of intrinsic video features, especially when compared to the handcrafted ones. In general, convolutional neural networks are used for deep learning in videos. Usually, when applied to such contexts, these vanilla deep learning networks cannot identify variations based on temporal information. To do so, memory-based cells (e.g. long-short term memory), or even optical flow techniques are used in conjunction with the convolutional process. However, despite their effectiveness, those methods neglect global analysis, processing only a small quantity of frames in each batch during the learning and inference process. Moreover, they also completely ignore the semantic relationship between different videos that belong to the same context. Thus, the present work aims to fill these gaps by u sing information grouping concepts and contextual detection through graph-based convolutional neural networks. The experiments show that our method achieves up to 87% of accuracy in a well-known public video dataset. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 8.209.245.224

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Costa, F. ; Saito, P. and Bugatti, P. (2021). Video Action Classification through Graph Convolutional Networks. In Proceedings of the 16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2021) - Volume 4: VISAPP; ISBN 978-989-758-488-6; ISSN 2184-4321, SciTePress, pages 490-497. DOI: 10.5220/0010321304900497

@conference{visapp21,
author={Felipe F. Costa and Priscila T. M. Saito and Pedro H. Bugatti},
title={Video Action Classification through Graph Convolutional Networks},
booktitle={Proceedings of the 16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2021) - Volume 4: VISAPP},
year={2021},
pages={490-497},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010321304900497},
isbn={978-989-758-488-6},
issn={2184-4321},
}

TY - CONF

JO - Proceedings of the 16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2021) - Volume 4: VISAPP
TI - Video Action Classification through Graph Convolutional Networks
SN - 978-989-758-488-6
IS - 2184-4321
AU - Costa, F.
AU - Saito, P.
AU - Bugatti, P.
PY - 2021
SP - 490
EP - 497
DO - 10.5220/0010321304900497
PB - SciTePress