Utilize este identificador para referenciar este registo: https://hdl.handle.net/1822/86310

Registo completo
Campo DCValorIdioma
dc.contributor.authorVeloso, Brunopor
dc.contributor.authorDurães, Dalilapor
dc.contributor.authorNovais, Paulopor
dc.date.accessioned2023-09-06T11:00:37Z-
dc.date.issued2022-
dc.identifier.citationVeloso, B., Durães, D., Novais, P. (2022). Analysis of Machine Learning Algorithms for Violence Detection in Audio. In: González-Briones, A., et al. Highlights in Practical Applications of Agents, Multi-Agent Systems, and Complex Systems Simulation. The PAAMS Collection. PAAMS 2022. Communications in Computer and Information Science, vol 1678. Springer, Cham. https://doi.org/10.1007/978-3-031-18697-4_17por
dc.identifier.isbn978-3-031-18696-7-
dc.identifier.issn1865-0929-
dc.identifier.urihttps://hdl.handle.net/1822/86310-
dc.description.abstractViolence has always been part of humanity, however, there are different types of violence, with physical violence being the most recurrent in our daily lives. This type of violence increasingly affects many people’s lives, so it is essential to try to combat violence. In recent years, human action recognition has been extensively studied, but mainly in video, an important computer vision area. Audio appears as a factor capable of circumventing these problems. Audio sensors can be omnidirectional, requiring less processing power and hardware and software performance when compared to the video. The audio can represent emotions. It is not affected by lighting or temperature problems, nor does it need to be at a favourable angle to capture the intended information. That said, audio is seen as the best way to recognize violence, applied with Machine Learning/Deep Learning/Transfer Learning techniques. In this paper we test a Convolutional Neural Network (CNN), a ResNet50, VGG16 and VGG19, in order to classify audios. Later we see that CNN obtains the best results, with a 92.44% accuracy in the test set. ResNet50 was the worst model used, obtaining an 86.34% accuracy. For the VGG models, both show a good potential but did not get better results than CNN.por
dc.description.sponsorshipThis work is supported by: FCT Fundação para a Ciência e Tecnologia within the RD Units Project Scope: UIDB/00319/2020.por
dc.language.isoengpor
dc.publisherSpringer, Champor
dc.relationinfo:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/UIDB%2F00319%2F2020/PTpor
dc.rightsrestrictedAccesspor
dc.subjectAudio action recognitionpor
dc.subjectAudio violence detectionpor
dc.subjectDeep learningpor
dc.subjectTransfer learningpor
dc.titleAnalysis of machine learning algorithms for violence detection in audiopor
dc.typeconferencePaperpor
dc.peerreviewedyespor
dc.relation.publisherversionhttps://link.springer.com/chapter/10.1007/978-3-031-18697-4_17por
oaire.citationStartPage210por
oaire.citationEndPage221por
oaire.citationVolume1678 CCISpor
dc.date.updated2023-08-01T00:05:47Z-
dc.identifier.doi10.1007/978-3-031-18697-4_17por
dc.date.embargo10000-01-01-
dc.identifier.eisbn978-3-031-18697-4-
sdum.export.identifier12674-
sdum.journalCommunications in Computer and Information Sciencepor
oaire.versionAMpor
Aparece nas coleções:CAlg - Artigos em livros de atas/Papers in proceedings

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
PAAMS22_paper_8476.pdf
Acesso restrito!
2,79 MBAdobe PDFVer/Abrir

Partilhe no FacebookPartilhe no TwitterPartilhe no DeliciousPartilhe no LinkedInPartilhe no DiggAdicionar ao Google BookmarksPartilhe no MySpacePartilhe no Orkut
Exporte no formato BibTex mendeley Exporte no formato Endnote Adicione ao seu ORCID