Utilize este identificador para referenciar este registo: https://hdl.handle.net/1822/78007

TítuloDeep learning for activity recognition using audio and video
Autor(es)Reinolds, Francisco
Neto, Cristiana
Machado, José Manuel
Palavras-chaveaction recognition
violence detection
real-time video stream
neural networks
audio classifiers
video classifiers
DataMar-2022
EditoraMDPI
RevistaElectronics
CitaçãoReinolds, F.; Neto, C.; Machado, J. Deep Learning for Activity Recognition Using Audio and Video. Electronics 2022, 11, 782. https://doi.org/10.3390/electronics11050782
Resumo(s)Neural networks have established themselves as powerhouses in what concerns several types of detection, ranging from human activities to their emotions. Several types of analysis exist, and the most popular and successful is video. However, there are other kinds of analysis, which, despite not being used as often, are still promising. In this article, a comparison between audio and video analysis is drawn in an attempt to classify violence detection in real-time streams. This study, which followed the CRISP-DM methodology, made use of several models available through PyTorch in order to test a diverse set of models and achieve robust results. The results obtained proved why video analysis has such prevalence, with the video classification handily outperforming its audio classification counterpart. Whilst the audio models attained on average 76% accuracy, video models secured average scores of 89%, showing a significant difference in performance. This study concluded that the applied methods are quite promising in detecting violence, using both audio and video.
TipoArtigo
URIhttps://hdl.handle.net/1822/78007
DOI10.3390/electronics11050782
e-ISSN2079-9292
Versão da editorahttps://www.mdpi.com/2079-9292/11/5/782
Arbitragem científicayes
AcessoAcesso aberto
Aparece nas coleções:CAlg - Artigos em revistas internacionais / Papers in international journals

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
electronics-11-00782.pdf2,11 MBAdobe PDFVer/Abrir

Este trabalho está licenciado sob uma Licença Creative Commons Creative Commons

Partilhe no FacebookPartilhe no TwitterPartilhe no DeliciousPartilhe no LinkedInPartilhe no DiggAdicionar ao Google BookmarksPartilhe no MySpacePartilhe no Orkut
Exporte no formato BibTex mendeley Exporte no formato Endnote Adicione ao seu ORCID