{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,3,19]],"date-time":"2025-03-19T14:40:37Z","timestamp":1742395237082,"version":"3.30.2"},"publisher-location":"New York, NY, USA","reference-count":31,"publisher":"ACM","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2004,6,13]]},"DOI":"10.1145\/1007568.1007655","type":"proceedings-article","created":{"date-parts":[[2004,7,20]],"date-time":"2004-07-20T15:55:38Z","timestamp":1090338938000},"page":"767-778","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":27,"title":["When one sample is not enough"],"prefix":"10.1145","author":[{"given":"Panagiotis G.","family":"Ipeirotis","sequence":"first","affiliation":[{"name":"Columbia University"}]},{"given":"Luis","family":"Gravano","sequence":"additional","affiliation":[{"name":"Columbia University"}]}],"member":"320","published-online":{"date-parts":[[2004,6,13]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/382979.383040"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/304182.304224"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/215206.215328"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1008708725988"},{"key":"e_1_3_2_1_5_1","first-page":"2002","article-title":"Database selection using actual physical and acquired logical collection resources in a massive domain-specific operational environment","author":"Conrad J. G.","year":"2002","unstructured":"J. G. Conrad , X. S. Guo , P. Jackson , and M Meziou . Database selection using actual physical and acquired logical collection resources in a massive domain-specific operational environment . In VLDB 2002 , 2002 . J. G. Conrad, X. S. Guo, P. Jackson, and M Meziou. Database selection using actual physical and acquired logical collection resources in a massive domain-specific operational environment. In VLDB 2002, 2002.","journal-title":"VLDB"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/336597.336628"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"crossref","DOI":"10.1111\/j.2517-6161.1977.tb01600.x","article-title":"Maximum likelihood from incomplete data via the EM algorithm","author":"Dempster A.","year":"1977","unstructured":"A. Dempster , N. Laird , and D. Rubin . Maximum likelihood from incomplete data via the EM algorithm . Journal of the Royal Statistical Society, B(39) , 1977 . A. Dempster, N. Laird, and D. Rubin. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, B(39), 1977.","journal-title":"Journal of the Royal Statistical Society, B(39)"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/313238.313257"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/312624.312684"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/314516.314517"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/253260.253299"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/320248.320252"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/635484.635485"},{"key":"e_1_3_2_1_14_1","volume-title":"Overview of the Fourth Text REtrieval Conference (TREC-4). In NIST Special Publication 500-236: The Fourth Text REtrieval Conference (TREC-4)","author":"Harman D.","year":"1996","unstructured":"D. Harman . Overview of the Fourth Text REtrieval Conference (TREC-4). In NIST Special Publication 500-236: The Fourth Text REtrieval Conference (TREC-4) , 1996 . D. Harman. Overview of the Fourth Text REtrieval Conference (TREC-4). In NIST Special Publication 500-236: The Fourth Text REtrieval Conference (TREC-4), 1996."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-0-387-21606-5"},{"key":"e_1_3_2_1_16_1","first-page":"2002","article-title":"Distributed search over the hidden web: Hierarchical database sampling and selection","author":"Ipeirotis P. G.","year":"2002","unstructured":"P. G. Ipeirotis and L. Gravano . Distributed search over the hidden web: Hierarchical database sampling and selection . In VLDB 2002 , 2002 . P. G. Ipeirotis and L. Gravano. Distributed search over the hidden web: Hierarchical database sampling and selection. In VLDB 2002, 2002.","journal-title":"VLDB"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/354756.354830"},{"key":"e_1_3_2_1_19_1","volume-title":"ICDE 2004","author":"Liu Z.","year":"2004","unstructured":"Z. Liu , C. Luo , J. Cho , and W. Chu . A probabilistic approach to metasearching with adaptive probing . In ICDE 2004 , 2004 . Z. Liu, C. Luo, J. Cho, and W. Chu. A probabilistic approach to metasearching with adaptive probing. In ICDE 2004, 2004."},{"key":"e_1_3_2_1_20_1","volume-title":"Freeman & Co.","author":"Mandelbrot B. B.","year":"1988","unstructured":"B. B. Mandelbrot . Fractal Geometry of Nature. W. H . Freeman & Co. , 1988 . B. B. Mandelbrot. Fractal Geometry of Nature. W. H. Freeman & Co., 1988."},{"key":"e_1_3_2_1_21_1","volume-title":"Applied Statistics","author":"Marques De S\u00e1 J. P.","year":"2003","unstructured":"J. P. Marques De S\u00e1 . Applied Statistics . Springer Verlag , 2003 . J. P. Marques De S\u00e1. Applied Statistics. Springer Verlag, 2003."},{"key":"e_1_3_2_1_22_1","volume-title":"ICML'98","author":"McCallum A.","year":"1998","unstructured":"A. McCallum , R. Rosenfeld , T. M. Mitchell , and A. Y. Ng . Improving text classification by shrinkage in a hierarchy of classes . In ICML'98 , 1998 . A. McCallum, R. Rosenfeld, T. M. Mitchell, and A. Y. Ng. Improving text classification by shrinkage in a hierarchy of classes. In ICML'98, 1998."},{"key":"e_1_3_2_1_23_1","volume-title":"VLDB'98","author":"Meng W.","year":"1998","unstructured":"W. Meng , K.-L. Liu , C. T. Yu , X. Wang , Y. Chang , and N. Rishe . Determining text databases to search in the Internet . In VLDB'98 , 1998 . W. Meng, K.-L. Liu, C. T. Yu, X. Wang, Y. Chang, and N. Rishe. Determining text databases to search in the Internet. In VLDB'98, 1998."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.5555\/820741.820982"},{"key":"e_1_3_2_1_25_1","volume-title":"Introduction to modern information retrieval","author":"Salton G.","year":"1983","unstructured":"G. Salton and M. J. McGill . Introduction to modern information retrieval . McGraw-Hill , 1983 . G. Salton and M. J. McGill. Introduction to modern information retrieval. McGraw-Hill, 1983."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/860435.860490"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/584792.584856"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.6028\/NIST.SP.500-240"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/290941.290974"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/312624.312687"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/319950.320005"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.5555\/646711.759510"}],"event":{"name":"SIGMOD\/PODS04: International Conference on Management of Data and Symposium on Principles Database and Systems","sponsor":["SIGMOD ACM Special Interest Group on Management of Data"],"location":"Paris France","acronym":"SIGMOD\/PODS04"},"container-title":["Proceedings of the 2004 ACM SIGMOD international conference on Management of data"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1007568.1007655","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,12,17]],"date-time":"2024-12-17T23:32:51Z","timestamp":1734478371000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1007568.1007655"}},"subtitle":["improving text database selection using shrinkage"],"short-title":[],"issued":{"date-parts":[[2004,6,13]]},"references-count":31,"alternative-id":["10.1145\/1007568.1007655","10.1145\/1007568"],"URL":"https:\/\/doi.org\/10.1145\/1007568.1007655","relation":{},"subject":[],"published":{"date-parts":[[2004,6,13]]},"assertion":[{"value":"2004-06-13","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}