Abstract
Many applications like digital libraries use Near-line Tertiary Storage Systems (TSS) to store their massive data. TSS consists of main memory, disks and tertiary devices. Typically highly referenced or recent data are stored on disks and historical data are stored on tertiary storage devices. We call it Data Dispatching: the determination of what kind of data should be stored on disks and what kind of data should be stored on tertiary storage devices. Traditionally it was up to the Database Management System (DBMS) administrator to dispatch data by hand. But DBMS has to take the responsibility if we want to bring tertiary storage devices under the control of DBMS. We proved in this paper that the data dispatching is an optimal problem and can be reduced to the famous binary knapsack problem. Experimental results showed that the average response time of TSS could be decreased by using optimal data dispatch method.
Supported by National Natural Science Foundation of China under Grant No.60273082; the National High-Tech Research and Development Plan of China under Grant No.2001AA41541; the National Grand Fundamental Research 973 Program of China under Grant No.G1999032704.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Sarawagi, S., Stonebraker, M.: Efficient Organization of Large Multidimensional Arrays. In: ICDE, pp. 328–336 (1994)
Chen, L.T., Drach, R., Keating, M., Louis, S., Rotem, D., Shoshani, A.: Efficient Organization and Access of Multi-Dimensional Datasets on Tertiary Storage Systems. Information Systems Journal 20, 155–183 (1995)
Christodoulakis, S., Triantafillou, P., Zioga, F.A.: Principles of Optimally Placing Data in Tertiary Storage Libraries. In: VLDB, pp. 236–245 (1997)
Stonebraker, M.: Managing Persistent Objects in a Multi-level Store. In: SIGMOD Conference, pp. 2–11 (1991)
Sarawagi, S.: Database Systems for Efficient Access to Tertiary Memory. In: Proc. IEEE Symposium on Mass Storage Sys., pp. 120–126 (1995)
Yu, J., DeWitt, D.: Query Pre-execution and Batching in Paradise: A Two-pronged Approach to the Efficient Processing of Queries on Tape-resident Data Sets. In: SSDBM, pp. 64–78 (1997)
Kraiss, A., Weikum, G.: Vertical Data Migration in Large Near-Line Document Archives Based on Markov-Chain Predictions. In: VLDB, pp. 246–255 (1997)
Triantafillou, P., Papadakis, T.: On-Demand Data Elevation in a Hierarchical Multimedia Storage Server. In: VLDB, pp. 226–235 (1997)
Levitin, A.: Introduction to The Design & Analysis of Algorithms. Addison-Wesley, Reading (2003)
Frew, J., Dozier, J.: Data Management for Earth System Science. SIGMOD Record 26, 27–31 (1997)
Barclay, T., Slutz, D.R., Gray, J.: TerraServer: A Spatial Data Warehouse. In: SIGMOD Conference, pp. 307–318 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liu, B., Li, J., Zhang, Y. (2004). Optimal Data Dispatching Methods in Near-Line Tertiary Storage System. In: Li, Q., Wang, G., Feng, L. (eds) Advances in Web-Age Information Management. WAIM 2004. Lecture Notes in Computer Science, vol 3129. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27772-9_74
Download citation
DOI: https://doi.org/10.1007/978-3-540-27772-9_74
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22418-1
Online ISBN: 978-3-540-27772-9
eBook Packages: Springer Book Archive