Utilize este identificador para referenciar este registo: https://hdl.handle.net/1822/66785

TítuloChallenging SQL-on-Hadoop performance with Apache Druid
Autor(es)Correia, José
Costa, Carlos A. P.
Santos, Maribel Yasmina
Palavras-chaveBig Data
Big Data Warehouse
SQL-on-Hadoop
Druid
OLAP
Data2019
EditoraSpringer Verlag
RevistaLecture Notes in Business Information Processing
Resumo(s)In Big Data, SQL-on-Hadoop tools usually provide satisfactory performance for processing vast amounts of data, although new emerging tools may be an alternative. This paper evaluates if Apache Druid, an innovative column-oriented data store suited for online analytical processing workloads, is an alternative to some of the well-known SQL-on-Hadoop technologies and its potential in this role. In this evaluation, Druid, Hive and Presto are benchmarked with increasing data volumes. The results point Druid as a strong alternative, achieving better performance than Hive and Presto, and show the potential of integrating Hive and Druid, enhancing the potentialities of both tools.
TipoArtigo em ata de conferência
URIhttps://hdl.handle.net/1822/66785
ISBN9783030204846
DOI10.1007/978-3-030-20485-3_12
ISSN1865-1348
Versão da editorahttps://link.springer.com/chapter/10.1007%2F978-3-030-20485-3_12
Arbitragem científicayes
AcessoAcesso aberto
Aparece nas coleções:CAlg - Artigos em livros de atas/Papers in proceedings

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
BIS_2019_paper_137.pdf642,43 kBAdobe PDFVer/Abrir

Partilhe no FacebookPartilhe no TwitterPartilhe no DeliciousPartilhe no LinkedInPartilhe no DiggAdicionar ao Google BookmarksPartilhe no MySpacePartilhe no Orkut
Exporte no formato BibTex mendeley Exporte no formato Endnote Adicione ao seu ORCID