Utilize este identificador para referenciar este registo: https://hdl.handle.net/1822/89568

TítuloA comparative study of data mining techniques applied to renal-cell carcinomas
Autor(es)Duarte, Ana Rita C.
Peixoto, Hugo
Machado, José Manuel
Palavras-chaveData Mining
Life expectancy
RapidMiner
Renal-Cell Carcinoma
Survival
Data2022
EditoraSpringer
RevistaLecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering (LNICST)
CitaçãoDuarte, A., Peixoto, H., Machado, J. (2022). A Comparative Study of Data Mining Techniques Applied to Renal-Cell Carcinomas. In: Spinsante, S., Silva, B., Goleva, R. (eds) IoT Technologies for Health Care. HealthyIoT 2021. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 432. Springer, Cham. https://doi.org/10.1007/978-3-030-99197-5_5
Resumo(s)Despite being one of the deadliest diseases and the enormous evolution in fighting it, the best methods to predict kidney cancer, namely Renal-Cell Carcinomas (RCC), are not well-known. One of the solutions to accelerate the current knowledge about RCC is through the use of Data Mining techniques based on patients' personal and clinical data. Therefore, it is crucial to understand which techniques are the most suitable to extract knowledge about this disease. In this paper, we followed the CRISP-DM methodology to simulate different techniques to determine the ones with the best predictive performance. For this purpose, we used a dataset of 821 records of RCC patients, obtained from The Cancer Genome Atlas. The present work tests different Data Mining techniques, that can be used to predict the 5-year life expectancy of patients with renal cancer and to predict the number of days to death for patients who have a life expectancy of less than 5 years. The results obtained demonstrated that the best algorithm for estimating the vital status at 5 years was Random Forest. This algorithm presented an accuracy of 87.65% and an AUROC of 0.931. For the prediction of days to death, the best performance was obtained with the k-Nearest Neighbors algorithm with a root mean square error of 354.6 days. The work suggested that Data Mining techniques can help to understand the influence of various risk factors on the life expectancy of patients with RCC.
TipoArtigo em ata de conferência
URIhttps://hdl.handle.net/1822/89568
ISBN978-3-030-99196-8
e-ISBN978-3-030-99197-5
DOI10.1007/978-3-030-99197-5_5
ISSN1867-8211
Versão da editorahttps://link.springer.com/chapter/10.1007/978-3-030-99197-5_5
Arbitragem científicayes
AcessoAcesso restrito UMinho
Aparece nas coleções:CAlg - Artigos em livros de atas/Papers in proceedings

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
renalcellcarcinomas.pdf
Acesso restrito!
390,94 kBAdobe PDFVer/Abrir

Este trabalho está licenciado sob uma Licença Creative Commons Creative Commons

Partilhe no FacebookPartilhe no TwitterPartilhe no DeliciousPartilhe no LinkedInPartilhe no DiggAdicionar ao Google BookmarksPartilhe no MySpacePartilhe no Orkut
Exporte no formato BibTex mendeley Exporte no formato Endnote Adicione ao seu ORCID