Utilize este identificador para referenciar este registo: https://hdl.handle.net/1822/37041

TítuloEnabling network inference methods to handle missing data and outliers
Autor(es)Folch-Fortuny, Abel
Villaverde, Alejandro F.
Ferrer, Alberto
Banga, Julio R.
Palavras-chaveNetwork inference
Missing data
Outlier detection
Projection to latent structures
Trimmed scores regression
Information theory
Mutual information
Data2015
EditoraBioMed Central (BMC)
RevistaBMC Bioinformatics
Resumo(s)The inference of complex networks from data is a challenging problem in biological sciences, as well as in a wide range of disciplines such as chemistry, technology, economics, or sociology. The quantity and quality of the data greatly affect the results. While many methodologies have been developed for this task, they seldom take into account issues such as missing data or outlier detection and correction, which need to be properly addressed before network inference. Results Here we present an approach to (i) handle missing data and (ii) detect and correct outliers based on multivariate projection to latent structures. The method, called trimmed scores regression (TSR), enables network inference methods to analyse incomplete datasets by imputing the missing values coherently with the latent data structure. Furthermore, it substitutes the faulty values in a dataset by proper estimations. We provide an implementation of this approach, and show how it can be integrated with any network inference method as a preliminary data curation step. This functionality is demonstrated with a state of the art network inference method based on mutual information distance and entropy reduction, MIDER. Conclusion The methodology presented here enables network inference methods to analyse a large number of incomplete and faulty datasets that could not be reliably analysed so far. Our comparative studies show the superiority of TSR over other missing data approaches used by practitioners. Furthermore, the method allows for outlier detection and correction.
TipoArtigo
URIhttps://hdl.handle.net/1822/37041
DOI10.1186/s12859-015-0717-7
ISSN1471-2105
e-ISSN1471-2105
Versão da editorahttp://www.biomedcentral.com/bmcbioinformatics
Arbitragem científicayes
AcessoAcesso aberto
Aparece nas coleções:CEB - Publicações em Revistas/Séries Internacionais / Publications in International Journals/Series

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
document_22359_1.pdf852,56 kBAdobe PDFVer/Abrir

Partilhe no FacebookPartilhe no TwitterPartilhe no DeliciousPartilhe no LinkedInPartilhe no DiggAdicionar ao Google BookmarksPartilhe no MySpacePartilhe no Orkut
Exporte no formato BibTex mendeley Exporte no formato Endnote Adicione ao seu ORCID