Utilize este identificador para referenciar este registo:
https://hdl.handle.net/1822/2047
Título: | Spectral normalization MFCC derived features for robust speech recognition |
Autor(es): | Lima, C. S. Tavares, Adriano Silva, Carlos A. Oliveira, Jorge F. |
Palavras-chave: | Robust speech recognition Features mapping |
Data: | Set-2004 |
Citação: | SPECOM'2004. INTERNATIONAL CONFERENCE SPEECH AND COMPUTER, 9, Saint Petersburg, 2004. |
Resumo(s): | This paper presents a method for extracting MFCC parameters from a normalised power spectrum density. The underlined spectral normalisation method is based on the fact that the speech regions with less energy need more robustness, since in these regions the noise is more dominant, thus the speech is more corrupted. Less energy speech regions contain usually sounds of unvoiced nature where are included nearly half of the consonants, and are by nature the least reliable ones due to the effective noise presence even when the speech is acquired under controlled conditions. This spectral normalisation was tested under additive artificial white noise in an Isolated Speech Recogniser and showed very promising results [1]. It is well known that concerned to speech representation, MFCC parameters appear to be more effective than power spectrum based features. This paper shows how the cepstral speech representation can take advantage of the above-referred spectral normalisation and shows some results in the continuous speech recognition paradigm in clean and artificial noise conditions. |
Tipo: | Artigo em ata de conferência |
URI: | https://hdl.handle.net/1822/2047 |
Arbitragem científica: | yes |
Acesso: | Acesso aberto |
Aparece nas coleções: | DEI - Artigos em atas de congressos internacionais |
Ficheiros deste registo:
Ficheiro | Descrição | Tamanho | Formato | |
---|---|---|---|---|
SPECOM2004.pdf | 209,49 kB | Adobe PDF | Ver/Abrir |