Abstract
As companies grow (organically or inorganically), Data Administration (i.e. Stage 5 of Nolan’s IT growth model) becomes the next logical step in their IT evolution. Designing a Data Warehouse model, especially in the presence of legacy systems, is a challenging task. A lot of time and effort is consumed in understanding the existing data requirements, performing Dimensional and Fact modeling etc. This problem is further exacerbated if enterprise outsource their IT needs to external vendors. In such a situation no individual has a complete and in-depth view of the existing data setup. For such settings, a tool that can assist in building a data warehouse model from existing data models such that there is minimal impact to the business can be of immense value. In this paper we present the D’MART tool which addresses this problem. D’MART analyzes the existing data model of the enterprise and proposes alternatives for building the new data warehouse model. D’MART models the problem of identifying Fact/Dimension attributes of a warehouse model as a graph cut on a Dependency Analysis Graph (DAG). The DAG is built using the existing data models and the BI Report generation (SQL) scripts. The D’MART tool also uses the DAG for generation of ETL scripts that can be used to populate the newly proposed data warehouse from data present in the existing schemas. D’MART was developed and validated as part of an engagement with Indian Railways which operates one of the largest rail networks in the world.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Arora, S., Rao, S., Vazirani, U.: Expander flows, geometric embeddings and graph partitioning (2009)
Chowdhary, P., Mihaila, G., Lei, H.: Model driven data warehousing for business performance management (2006)
Doan, A., Domingos, P., Halevy, A.: Reconciling schemas of disparate data sources: a machine-learning approach (2001)
Edmonds, J., Karp, R.: Theoretical improvements in algorithmic efficiency for network flow problems (1972)
Golfarelli, M., Maio, D., Rizzi, S.: Conceptual design of data warehouses from e/r schemes (1998)
Kimball, R.: The Data Warehouse Toolkit: Practical Techniques For Building Dimensional Data Warehouse. John Wiley & Sons (1996)
Rahm, E., Bernstein, P.: A survey of approaches to automatic schema matching (2001)
Cormen, T.H., Leiserson, C.E., Rivest, R.L., Stein, C.: Introduction to Algorithms. MIT Press and McGraw-Hill (2009)
Westerman, P.: Data Warehousing using the Wal-Mart Model. Morgan Kaufmann (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Negi, S., Bhide, M.A., Batra, V.S., Mohania, M.K., Bajpai, S. (2012). D’MART: A Tool for Building and Populating Data Warehouse Model from Existing Reports and Tables. In: Gao, H., Lim, L., Wang, W., Li, C., Chen, L. (eds) Web-Age Information Management. WAIM 2012. Lecture Notes in Computer Science, vol 7418. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32281-5_11
Download citation
DOI: https://doi.org/10.1007/978-3-642-32281-5_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32280-8
Online ISBN: 978-3-642-32281-5
eBook Packages: Computer ScienceComputer Science (R0)