Spectral normalization MFCC derived features for robust speech recognition

Utilize este identificador para referenciar este registo: https://hdl.handle.net/1822/2047

Título:	Spectral normalization MFCC derived features for robust speech recognition
Autor(es):	Lima, C. S. Tavares, Adriano Silva, Carlos A. Oliveira, Jorge F.
Palavras-chave:	Robust speech recognition Features mapping
Data:	Set-2004
Citação:	SPECOM'2004. INTERNATIONAL CONFERENCE SPEECH AND COMPUTER, 9, Saint Petersburg, 2004.
Resumo(s):	This paper presents a method for extracting MFCC parameters from a normalised power spectrum density. The underlined spectral normalisation method is based on the fact that the speech regions with less energy need more robustness, since in these regions the noise is more dominant, thus the speech is more corrupted. Less energy speech regions contain usually sounds of unvoiced nature where are included nearly half of the consonants, and are by nature the least reliable ones due to the effective noise presence even when the speech is acquired under controlled conditions. This spectral normalisation was tested under additive artificial white noise in an Isolated Speech Recogniser and showed very promising results [1]. It is well known that concerned to speech representation, MFCC parameters appear to be more effective than power spectrum based features. This paper shows how the cepstral speech representation can take advantage of the above-referred spectral normalisation and shows some results in the continuous speech recognition paradigm in clean and artificial noise conditions.
Tipo:	Artigo em ata de conferência
URI:	https://hdl.handle.net/1822/2047
Arbitragem científica:	yes
Acesso:	Acesso aberto
Aparece nas coleções:	DEI - Artigos em atas de congressos internacionais

Ficheiros deste registo:

Ficheiro	Descrição	Tamanho	Formato
SPECOM2004.pdf		209,49 kB	Adobe PDF	Ver/Abrir

Ver registo completo Sugerir correção Estatísticas