Utilize este identificador para referenciar este registo: https://hdl.handle.net/1822/6293

TítuloProtein sequence pattern mining with constraints
Autor(es)Ferreira, Pedro Gabriel
Azevedo, Paulo J.
Palavras-chaveBioinformatics
Databases
Data2005
EditoraSpringer
RevistaLecture Notes in Computer Science
Citação“Lecture notes in computer science”. ISSN 0302-9743. 3721 (Nov. 2005) 96-107.
Resumo(s)Considering the characteristics of biological sequence databases, which typically have a small alphabet, a very long length and a relative small size (several hundreds of sequences), we propose a new sequence mining algorithm (gIL). gIL was developed for linear sequence pattern mining and results from the combination of some of the most efficient techniques used in sequence and itemset mining. The algorithm exhibits a high adaptability, yielding a smooth and direct introduction of various types of features into the mining process, namely the extraction of rigid and arbitrary gap patterns. Both breadth or a depth first traversal are possible. The experimental evaluation, in synthetic and real life protein databases, has shown that our algorithm has superior performance to state-of-the art algorithms. The use of constraints has also proved to be a very useful tool to specify user interesting patterns
TipoArtigo em ata de conferência
URIhttps://hdl.handle.net/1822/6293
ISBN978-3-540-29244-9
DOI10.1007/11564126_14
ISSN0302-9743
Arbitragem científicayes
AcessoAcesso aberto
Aparece nas coleções:DI/CCTC - Artigos (papers)

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
PKDD05_pja_1.pdfDocumento principal181,26 kBAdobe PDFVer/Abrir

Partilhe no FacebookPartilhe no TwitterPartilhe no DeliciousPartilhe no LinkedInPartilhe no DiggAdicionar ao Google BookmarksPartilhe no MySpacePartilhe no Orkut
Exporte no formato BibTex mendeley Exporte no formato Endnote Adicione ao seu ORCID