Utilize este identificador para referenciar este registo: https://hdl.handle.net/1822/17403

TítuloEmploying compact intra-genomic language models to predict genomic sequences and characterize their entropy
Autor(es)Deusdado, Sérgio
Carvalho, Paulo
Palavras-chaveDNA entropy estimation
Genomic sequence modeling
Language models
genomic sequences modeling
DataJun-2010
EditoraSpringer
RevistaAdvances in Intelligent and Soft Computing
Resumo(s)Probabilistic models of languages are fundamental to understand and learn the profile of the subjacent code in order to estimate its entropy, enabling the verification and prediction of “natural” emanations of the language. Language models are devoted to capture salient statistical characteristics of the distribution of sequences of words, which transposed to the genomic language, allow modeling a predictive system of the peculiarities and regularities of genomic code in different inter and intra-genomic conditions. In this paper, we propose the application of compact intra-genomic language models to predict the composition of genomic sequences, aiming to achieve valuable resources for data compression and to contribute to enlarge the similarity analysis perspectives in genomic sequences. The obtained results encourage further investigation and validate the use of language models in biological sequence analysis.
TipoArtigo em ata de conferência
Descriçãohttp://iwpacbb2010.di.uminho.pt/
URIhttps://hdl.handle.net/1822/17403
ISBN9783642132131
ISSN1867-5662
Arbitragem científicayes
AcessoAcesso aberto
Aparece nas coleções:DI/CCTC - Artigos (papers)

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
IWPACBB_2010_deusdado_carvalho_CR.pdfIWPACBB-2010-cr.pdf (paper)261,88 kBAdobe PDFVer/Abrir

Partilhe no FacebookPartilhe no TwitterPartilhe no DeliciousPartilhe no LinkedInPartilhe no DiggAdicionar ao Google BookmarksPartilhe no MySpacePartilhe no Orkut
Exporte no formato BibTex mendeley Exporte no formato Endnote Adicione ao seu ORCID