Utilize este identificador para referenciar este registo:
https://hdl.handle.net/1822/52786
Título: | Multilingual voice control for endoscopic procedures |
Autor(es): | Afonso, Simão Pedro Oliveira Laranjo, Isabel Maria Cunha Braga, Joel Teles Alves, Victor Neves, José |
Palavras-chave: | Automatic speech recognition Endoscopic procedures Hidden Markov Models Pocketsphinx Sphinxtrain |
Data: | Jun-2015 |
Editora: | Springer International Publishing AG |
Revista: | Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering (LNICST) |
Resumo(s): | In this paper it is present a solution to improve the current endoscopic exams’ workflow. These exams require complex procedures, such as using both hands to manipulate buttons and pressing a foot pedal at the same time, to perform simple tasks like capturing frames for posterior analysis. In addition to this downside, the act of capturing frames freezes the video. The developed software module was integrated with the MIVbox device, a device for the acquisition, processing and storage of the endoscopic results It uses libraries developed by the PocketSphinx project to recognize a small amount of commands. The module was fine-tuned for the Portuguese language which presents some specific difficulties with speech recognition. It was obtained a Word Error Rate (WER) of 23.3% for the English model and 29.1% for the Portuguese one. |
Tipo: | Artigo em ata de conferência |
URI: | https://hdl.handle.net/1822/52786 |
ISBN: | 978-3-319-19655-8 |
DOI: | 10.1007/978-3-319-19656-5_33 |
ISSN: | 1867-8211 |
Versão da editora: | https://link.springer.com/chapter/10.1007/978-3-319-19656-5_33 |
Arbitragem científica: | yes |
Acesso: | Acesso restrito UMinho |
Aparece nas coleções: |
Ficheiros deste registo:
Ficheiro | Descrição | Tamanho | Formato | |
---|---|---|---|---|
HIOT14.pdf Acesso restrito! | 241,25 kB | Adobe PDF | Ver/Abrir |