Utilize este identificador para referenciar este registo:
https://hdl.handle.net/1822/38340
Título: | Task clustering on ETL systems – A pattern-oriented approach |
Autor(es): | Oliveira, Bruno Belo, O. |
Palavras-chave: | Data Warehousing Systems ETL Conceptual Modelling Task Clustering ETL Patterns ETL Skeletons BPMN Kettle And kettle BPMN specification models |
Data: | 20-Jul-2015 |
Editora: | SCITEPRESS |
Resumo(s): | Usually, data warehousing populating processes are data-oriented workflows composed by dozens of granular tasks that are responsible for the integration of data coming from different data sources. Specific subset of these tasks can be grouped on a collection together with their relationships in order to form higher- level constructs. Increasing task granularity allows for the generalization of processes, simplifying their views and providing methods to carry out expertise to new applications. Well-proven practices can be used to describe general solutions that use basic skeletons configured and instantiated according to a set of specific integration requirements. Patterns can be applied to ETL processes aiming to simplify not only a possible conceptual representation but also to reduce the gap that often exists between two design perspectives. In this paper, we demonstrate the feasibility and effectiveness of an ETL pattern-based approach using task clustering, analyzing a real world ETL scenario through the definitions of two commonly used clusters of tasks: a data lookup cluster and a data conciliation and integration cluster. |
Tipo: | Artigo em ata de conferência |
URI: | https://hdl.handle.net/1822/38340 |
ISBN: | 9789897581038 |
Arbitragem científica: | yes |
Acesso: | Acesso restrito UMinho |
Aparece nas coleções: |
Ficheiros deste registo:
Ficheiro | Descrição | Tamanho | Formato | |
---|---|---|---|---|
2015-CI-Data-Oliveira&Belo-CRP.pdf Acesso restrito! | Artigo completo publicado. | 1,65 MB | Adobe PDF | Ver/Abrir |