{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,8,11]],"date-time":"2024-08-11T14:31:54Z","timestamp":1723386714983},"reference-count":110,"publisher":"Association for Computing Machinery (ACM)","issue":"3","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Comput. Surv."],"published-print":{"date-parts":[[2009,7]]},"abstract":"Web clustering engines organize search results by topic, thus offering a complementary view to the flat-ranked list returned by conventional search engines. In this survey, we discuss the issues that must be addressed in the development of a Web clustering engine, including acquisition and preprocessing of search results, their clustering and visualization. Search results clustering, the core of the system, has specific requirements that cannot be addressed by classical clustering algorithms. We emphasize the role played by the quality of the cluster labels as opposed to optimizing only the clustering structure. We highlight the main characteristics of a number of existing Web clustering engines and also discuss how to evaluate their retrieval performance. Some directions for future research are finally presented.<\/jats:p>","DOI":"10.1145\/1541880.1541884","type":"journal-article","created":{"date-parts":[[2009,7,28]],"date-time":"2009-07-28T12:43:55Z","timestamp":1248785035000},"page":"1-38","update-policy":"http:\/\/dx.doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":240,"title":["A survey of Web clustering engines"],"prefix":"10.1145","volume":"41","author":[{"given":"Claudio","family":"Carpineto","sequence":"first","affiliation":[{"name":"Fondazione Ugo Bordoni, Roma, Italy"}]},{"given":"Stanislaw","family":"Osi\u0144ski","sequence":"additional","affiliation":[{"name":"Carrot Search"}]},{"given":"Giovanni","family":"Romano","sequence":"additional","affiliation":[{"name":"Fondazione Ugo Bordoni, Roma, Italy"}]},{"given":"Dawid","family":"Weiss","sequence":"additional","affiliation":[{"name":"Poznan University of Technology, Poznan, Poland"}]}],"member":"320","published-online":{"date-parts":[[2009,7,30]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Principle-Based Parsing: Computation and Psycholinguistics","author":"Abney S.","unstructured":"Abney , S. 1991. Parsing by Chunks . In Principle-Based Parsing: Computation and Psycholinguistics , R. C. Berwick, S. P. Abney, and C. Tenny, Eds. Kluwer Academic Publishers , 257--278. Abney, S. 1991. Parsing by Chunks. In Principle-Based Parsing: Computation and Psycholinguistics, R. C. Berwick, S. P. Abney, and C. Tenny, Eds. Kluwer Academic Publishers, 257--278."},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/168555.168572"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/1148170.1148273"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/1183614.1183648"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/WI.2006.131"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/792550.792552"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1007\/11735106_15"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1002\/asi.v60:5"},{"key":"e_1_2_1_9_1","doi-asserted-by":"crossref","unstructured":"Carpineto C. and Romano G. 2004a. Concept Data Analysis: Theory and Applications. Wiley. Carpineto C. and Romano G. 2004a. Concept Data Analysis: Theory and Applications. Wiley.","DOI":"10.1002\/0470011297"},{"key":"e_1_2_1_10_1","first-page":"985","article-title":"Exploiting the potential of concept lattices for information retrieval with CREDO","volume":"10","author":"Carpineto C.","year":"2004","unstructured":"Carpineto , C. and Romano , G. 2004 b. Exploiting the potential of concept lattices for information retrieval with CREDO . J. Univ. Comput. Sci. 10 , 8, 985 -- 1013 . Carpineto, C. and Romano, G. 2004b. Exploiting the potential of concept lattices for information retrieval with CREDO. J. Univ. Comput. Sci. 10, 8, 985--1013.","journal-title":"J. Univ. Comput. Sci."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/568727.568728"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1007\/s007780050061"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/332040.332418"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/1065167.1065192"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1007\/11575832_7"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1080\/713827120"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/133160.133214"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/1085777.1085859"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2007.40"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/1031171.1031289"},{"key":"e_1_2_1_22_1","unstructured":"Dom B. E. 2001. An information-theoretic external cluster-validity measure. Tech. rep. RJ-10219 IBM. Dom B. E. 2001. An information-theoretic external cluster-validity measure. Tech. rep. RJ-10219 IBM."},{"key":"e_1_2_1_24_1","unstructured":"Eades P. and Tamassia R. 1989. Algorithms for drawing graphs: an annotated bibliography. Tech. rep. CS-89-90 Department of Computer Science Brown University. Eades P. and Tamassia R. 1989. Algorithms for drawing graphs: an annotated bibliography. Tech. rep. CS-89-90 Department of Computer Science Brown University."},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/568574.568575"},{"key":"e_1_2_1_26_1","unstructured":"Everitt B. S. Landau S. and Leese M. 2001. Cluster Analysis 4th Ed. Oxford University Press. Everitt B. S. Landau S. and Leese M. 2001. Cluster Analysis 4th Ed. Oxford University Press."},{"key":"e_1_2_1_27_1","doi-asserted-by":"crossref","unstructured":"Ferragina P. and Gulli A. 2004. The Anatomy of SnakeT: A Hierarchical Clustering Engine for Web-Page Snippets. In Proceedings of the 8th European Conference on Principles and Practice of Knowledge Discovery in Databases. Lecture Notes in Computer Science vol. 3202. Springer 506--508. Ferragina P. and Gulli A. 2004. The Anatomy of SnakeT: A Hierarchical Clustering Engine for Web-Page Snippets. In Proceedings of the 8th European Conference on Principles and Practice of Knowledge Discovery in Databases. Lecture Notes in Computer Science vol. 3202. Springer 506--508.","DOI":"10.1007\/978-3-540-30116-5_48"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/1062745.1062760"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2005.113"},{"key":"e_1_2_1_31_1","doi-asserted-by":"crossref","unstructured":"Ganter B. and Wille R. 1999. Formal Concept Analysis: Mathematical Foundations. Springer. Ganter B. and Wille R. 1999. Formal Concept Analysis: Mathematical Foundations. Springer.","DOI":"10.1007\/978-3-642-59830-2"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1080\/15427951.2006.10129133"},{"key":"e_1_2_1_33_1","volume-title":"Proceedings of the 11th Italian Symposium on Advanced Database Systems (SEBD), S. Flesca, S. Greco, D. Sacc&#224;, and E. Zumpano, Eds. Rubettino Editore, 507--518","author":"Giannotti F.","unstructured":"Giannotti , F. , Nanni , M. , Pedreschi , D. , and Samaritani , F . 2003. WebCat: Automatic categorization of Web search results . In Proceedings of the 11th Italian Symposium on Advanced Database Systems (SEBD), S. Flesca, S. Greco, D. Sacc&#224;, and E. Zumpano, Eds. Rubettino Editore, 507--518 . Giannotti, F., Nanni, M., Pedreschi, D., and Samaritani, F. 2003. WebCat: Automatic categorization of Web search results. In Proceedings of the 11th Italian Symposium on Advanced Database Systems (SEBD), S. Flesca, S. Greco, D. Sacc&#224;, and E. Zumpano, Eds. Rubettino Editore, 507--518."},{"key":"e_1_2_1_34_1","volume-title":"Proceedings of the 3rd International Conference on Statistical Analysis of Textual Data (JADT'95)","author":"Grefenstette G.","year":"1995","unstructured":"Grefenstette , G. 1995 . Comparing two language identification schemes . In Proceedings of the 3rd International Conference on Statistical Analysis of Textual Data (JADT'95) . 263--268. Grefenstette, G. 1995. Comparing two language identification schemes. In Proceedings of the 3rd International Conference on Statistical Analysis of Textual Data (JADT'95). 263--268."},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1007\/11431053_33"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1012801612483"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/1076034.1076071"},{"key":"e_1_2_1_38_1","volume-title":"Clustering Algorithms","author":"Hartigan J. A.","unstructured":"Hartigan , J. A. 1975. Clustering Algorithms . Wiley . Hartigan, J. A. 1975. Clustering Algorithms. Wiley."},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/567498.567525"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/1121949.1121983"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/243199.243216"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/2945.841119"},{"key":"e_1_2_1_43_1","volume-title":"Proceedings of the 7th European Conference on Principles and Practice of Knowledge Discovery in Databases. Lecture Notes in Computer Science","volume":"2838","author":"Hotho A.","unstructured":"Hotho , A. , Staab , S. , and Stumme , G . 2003. Explaining text clustering results using semantic structures . In Proceedings of the 7th European Conference on Principles and Practice of Knowledge Discovery in Databases. Lecture Notes in Computer Science , vol. 2838 . Springer, 217--228. Hotho, A., Staab, S., and Stumme, G. 2003. Explaining text clustering results using semantic structures. In Proceedings of the 7th European Conference on Principles and Practice of Knowledge Discovery in Databases. Lecture Notes in Computer Science, vol. 2838. Springer, 217--228."},{"key":"e_1_2_1_44_1","doi-asserted-by":"crossref","unstructured":"Husek D. Pokorny J. Rezankova H. and Snasel V. 2006. Data clustering: From documents to the Web. In Web Data Management Practices: Emerging Techniques and Technologies A. Vakali and G. Pallis Eds. Baker and Taylor 1--33. Husek D. Pokorny J. Rezankova H. and Snasel V. 2006. Data clustering: From documents to the Web. In Web Data Management Practices: Emerging Techniques and Technologies A. Vakali and G. Pallis Eds. Baker and Taylor 1--33.","DOI":"10.4018\/978-1-59904-228-2.ch001"},{"key":"e_1_2_1_45_1","unstructured":"Jain A. K. and Dubes R. C. 1988. Algorithms for Clustering Data. Prentice-Hall. Jain A. K. and Dubes R. C. 1988. Algorithms for Clustering Data. Prentice-Hall."},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/331499.331504"},{"key":"e_1_2_1_47_1","volume-title":"Proceedings of IEEE Visualization. IEEE Computer Society","author":"Johnson B.","unstructured":"Johnson , B. and Shneiderman , B . 1991. Treemaps: A space-filling approach to the visualization of hierarchical information structures . In Proceedings of IEEE Visualization. IEEE Computer Society , San Diego, 284--291. Johnson, B. and Shneiderman, B. 1991. Treemaps: A space-filling approach to the visualization of hierarchical information structures. In Proceedings of IEEE Visualization. IEEE Computer Society, San Diego, 284--291."},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/1054972.1054991"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/345508.345650"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/1124772.1124878"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/1287620.1287621"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.667881"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1145\/988672.988762"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1145\/860435.860549"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1145\/383952.384022"},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1145\/502585.502592"},{"key":"e_1_2_1_57_1","unstructured":"Leuski A. and Croft B. W. 1996. An evaluation of techniques for clustering search results. Tech. rep. IR-76 University of Massachusetts Amherst. Leuski A. and Croft B. W. 1996. An evaluation of techniques for clustering search results. Tech. rep. IR-76 University of Massachusetts Amherst."},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.3115\/1072228.1072372"},{"key":"e_1_2_1_59_1","volume-title":"Proceedings of the 20th International Conference on Machine Learning, August 21--24","author":"Liu T.","unstructured":"Liu , T. , Liu , S. , Chen , Z. , and Ma , W . -Y. 2003. An evaluation on feature selection for text clustering . In Proceedings of the 20th International Conference on Machine Learning, August 21--24 , T. Fawcett and N. Mishra, Eds. AAAI Press, 488--495. Liu, T., Liu, S., Chen, Z., and Ma, W.-Y. 2003. An evaluation on feature selection for text clustering. In Proceedings of the 20th International Conference on Machine Learning, August 21--24, T. Fawcett and N. Mishra, Eds. AAAI Press, 488--495."},{"key":"e_1_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1145\/1008992.1009026"},{"key":"e_1_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1145\/1148170.1148310"},{"key":"e_1_2_1_62_1","unstructured":"Maarek Y. S. Fagin R. Ben-Shaul I. Z. and Pelleg D. 2000. Ephemeral document clustering for Web applications. Tech. rep. RJ 10186 IBM Research. Maarek Y. S. Fagin R. Ben-Shaul I. Z. and Pelleg D. 2000. Ephemeral document clustering for Web applications. Tech. rep. RJ 10186 IBM Research."},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1137\/0222058"},{"key":"e_1_2_1_64_1","volume-title":"Introduction to Information Retrieval","author":"Manning C. D.","unstructured":"Manning , C. D. , Raghavan , P. , and Sch &#252;tze, H. 2008. Introduction to Information Retrieval . Cambridge University Press . Manning, C. D., Raghavan, P., and Sch&#252;tze, H. 2008. Introduction to Information Retrieval. Cambridge University Press."},{"key":"e_1_2_1_65_1","volume-title":"Foundations of Statistical Natural Language Processing","author":"Manning C. D.","unstructured":"Manning , C. D. and Sch &#252;tze, H. 1999. Foundations of Statistical Natural Language Processing . MIT Press . Manning, C. D. and Sch&#252;tze, H. 1999. Foundations of Statistical Natural Language Processing. MIT Press."},{"key":"e_1_2_1_66_1","series-title":"Lecture Notes in Computer Science","volume-title":"Proceedings of the 25th European Conference on IR Research, (ECIR)","author":"Maslowska I.","unstructured":"Maslowska , I. 2003. Phrase-based hierarchical clustering of Web search results . In Proceedings of the 25th European Conference on IR Research, (ECIR) . Lecture Notes in Computer Science , vol. 2633 . Springer , 555--562. Maslowska, I. 2003. Phrase-based hierarchical clustering of Web search results. In Proceedings of the 25th European Conference on IR Research, (ECIR). Lecture Notes in Computer Science, vol. 2633. Springer, 555--562."},{"key":"e_1_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.1145\/505282.505284"},{"key":"e_1_2_1_68_1","series-title":"Lecture Notes in Computer Science","volume-title":"Databases: PKDD","author":"Ngo C. L.","year":"2004","unstructured":"Ngo , C. L. and Nguyen , H. S . 2004 . A tolerance rough set approach to clustering Web search results. In Proceedings of the Knowledge Discovery in Databases: PKDD . Lecture Notes in Computer Science , vol. 3202 . Springer , 515--517. Ngo, C. L. and Nguyen, H. S. 2004. A tolerance rough set approach to clustering Web search results. In Proceedings of the Knowledge Discovery in Databases: PKDD. Lecture Notes in Computer Science, vol. 3202. Springer, 515--517."},{"key":"e_1_2_1_69_1","volume-title":"Proceedings of the 11th Text REtrieval Conference (TREC). National Institute of Standards and Technology (NIST).","author":"Osdin R.","unstructured":"Osdin , R. , Ounis , I. , and White , R. W . 2002. Using hierarchical clustering and summarisation approaches for Web retrieval . In Proceedings of the 11th Text REtrieval Conference (TREC). National Institute of Standards and Technology (NIST). Osdin, R., Ounis, I., and White, R. W. 2002. Using hierarchical clustering and summarisation approaches for Web retrieval. In Proceedings of the 11th Text REtrieval Conference (TREC). National Institute of Standards and Technology (NIST)."},{"key":"e_1_2_1_70_1","doi-asserted-by":"publisher","DOI":"10.1007\/11735106_16"},{"key":"e_1_2_1_71_1","volume-title":"Proceedings of the International Intelligent Information Processing and Web Mining Conference. Advances in Soft Computing. Springer, 359--368","author":"Osi","unstructured":"Osi &#324;ski, S., Stefanowski , J. , and Weiss , D . 2004. Lingo: Search results clustering algorithm based on singular value decomposition . In Proceedings of the International Intelligent Information Processing and Web Mining Conference. Advances in Soft Computing. Springer, 359--368 . Osi&#324;ski, S., Stefanowski, J., and Weiss, D. 2004. Lingo: Search results clustering algorithm based on singular value decomposition. In Proceedings of the International Intelligent Information Processing and Web Mining Conference. Advances in Soft Computing. Springer, 359--368."},{"key":"e_1_2_1_72_1","doi-asserted-by":"publisher","DOI":"10.1109\/MIS.2005.38"},{"key":"e_1_2_1_73_1","doi-asserted-by":"publisher","DOI":"10.1145\/1148170.1148271"},{"key":"e_1_2_1_74_1","doi-asserted-by":"publisher","DOI":"10.1145\/564376.564412"},{"key":"e_1_2_1_75_1","doi-asserted-by":"publisher","DOI":"10.1109\/MITP.2006.91"},{"key":"e_1_2_1_76_1","doi-asserted-by":"publisher","DOI":"10.1007\/11527886_13"},{"key":"e_1_2_1_77_1","volume-title":"Readings in Information Retrieval","author":"Porter M. F.","unstructured":"Porter , M. F. 1997. An algorithm for suffix stripping . In Readings in Information Retrieval , K. S. Jones and P. Willett, Eds. Morgan Kaufmann , 313--316. Porter, M. F. 1997. An algorithm for suffix stripping. In Readings in Information Retrieval, K. S. Jones and P. Willett, Eds. Morgan Kaufmann, 313--316."},{"key":"e_1_2_1_78_1","doi-asserted-by":"publisher","DOI":"10.1145\/1149933.1149939"},{"key":"e_1_2_1_79_1","unstructured":"Rivadeneira W. and Bederson B. B. 2003. A study of search result clustering interfaces: Comparing textual and zoomable user interfaces. Tech. rep. HCIL-TR-2003-36 University of Maryland. Rivadeneira W. and Bederson B. B. 2003. A study of search result clustering interfaces: Comparing textual and zoomable user interfaces. Tech. rep. HCIL-TR-2003-36 University of Maryland."},{"key":"e_1_2_1_80_1","doi-asserted-by":"publisher","DOI":"10.5555\/851039.856760"},{"key":"e_1_2_1_81_1","doi-asserted-by":"publisher","DOI":"10.1145\/365024.365097"},{"key":"e_1_2_1_82_1","doi-asserted-by":"publisher","DOI":"10.1145\/988672.988675"},{"key":"e_1_2_1_83_1","doi-asserted-by":"publisher","DOI":"10.1145\/1183614.1183667"},{"key":"e_1_2_1_84_1","doi-asserted-by":"publisher","DOI":"10.1145\/361219.361220"},{"key":"e_1_2_1_85_1","doi-asserted-by":"publisher","DOI":"10.1145\/198366.198384"},{"key":"e_1_2_1_86_1","doi-asserted-by":"publisher","DOI":"10.1145\/258525.258539"},{"key":"e_1_2_1_87_1","doi-asserted-by":"publisher","DOI":"10.1145\/505282.505283"},{"key":"e_1_2_1_88_1","doi-asserted-by":"publisher","DOI":"10.1007\/11908678_9"},{"key":"e_1_2_1_89_1","volume-title":"Proceedings of IEEE Symposium on InfoVis. IEEE Computer Society, 57--65","author":"Stasko J.","unstructured":"Stasko , J. and Zhang , E . 2000. Focus+context display and navigation techniques for enhancing radial, space-filling hierarchy visualizations . In Proceedings of IEEE Symposium on InfoVis. IEEE Computer Society, 57--65 . Stasko, J. and Zhang, E. 2000. Focus+context display and navigation techniques for enhancing radial, space-filling hierarchy visualizations. In Proceedings of IEEE Symposium on InfoVis. IEEE Computer Society, 57--65."},{"key":"e_1_2_1_90_1","volume-title":"Proceedings of the 1st International Atlantic Web Intelligence Conference. Lecture Notes in Computer Science","volume":"2663","author":"Stefanowski J.","unstructured":"Stefanowski , J. and Weiss , D . 2003a. Carrot2 and language properties in Web search results clustering . In Proceedings of the 1st International Atlantic Web Intelligence Conference. Lecture Notes in Computer Science , vol. 2663 . Springer, 240--249. Stefanowski, J. and Weiss, D. 2003a. Carrot2 and language properties in Web search results clustering. In Proceedings of the 1st International Atlantic Web Intelligence Conference. Lecture Notes in Computer Science, vol. 2663. Springer, 240--249."},{"key":"e_1_2_1_91_1","volume-title":"Proceedings of the International Intelligent Information Processing and Web Mining Conference. Advances in Soft Computing. Springer, 209--218","author":"Stefanowski J.","unstructured":"Stefanowski , J. and Weiss , D . 2003b. Web search results clustering in Polish: Experimental Evaluation of Carrot . In Proceedings of the International Intelligent Information Processing and Web Mining Conference. Advances in Soft Computing. Springer, 209--218 . Stefanowski, J. and Weiss, D. 2003b. Web search results clustering in Polish: Experimental Evaluation of Carrot. In Proceedings of the International Intelligent Information Processing and Web Mining Conference. Advances in Soft Computing. Springer, 209--218."},{"key":"e_1_2_1_92_1","volume-title":"Proceedings of the 4th International Conference on Knowledge Management. 353--360","author":"Stein B.","unstructured":"Stein , B. and Meyer zu Eissen, S. 2004. Topic identification: Framework and application . In Proceedings of the 4th International Conference on Knowledge Management. 353--360 . Stein, B. and Meyer zu Eissen, S. 2004. Topic identification: Framework and application. In Proceedings of the 4th International Conference on Knowledge Management. 353--360."},{"key":"e_1_2_1_93_1","volume-title":"Proceedings of the 3rd IASTED International Conference on Artificial Intelligence and Applications (AIA). Springer, 216--221","author":"Stein B.","unstructured":"Stein , B. , Meyer zu Eissen , S. , and Wibrock , F . 2003. On cluster validity and the information need of users . In Proceedings of the 3rd IASTED International Conference on Artificial Intelligence and Applications (AIA). Springer, 216--221 . Stein, B., Meyer zu Eissen, S., and Wibrock, F. 2003. On cluster validity and the information need of users. In Proceedings of the 3rd IASTED International Conference on Artificial Intelligence and Applications (AIA). Springer, 216--221."},{"key":"e_1_2_1_94_1","volume-title":"Proceedings of the 6th SIAM International Conference on Data Mining (SDM). 188--199","author":"Tagarelli A.","unstructured":"Tagarelli , A. and Greco , S . 2006. Toward semantic XML clustering . In Proceedings of the 6th SIAM International Conference on Data Mining (SDM). 188--199 . Tagarelli, A. and Greco, S. 2006. Toward semantic XML clustering. In Proceedings of the 6th SIAM International Conference on Data Mining (SDM). 188--199."},{"key":"e_1_2_1_95_1","doi-asserted-by":"publisher","DOI":"10.1145\/985692.985745"},{"key":"e_1_2_1_96_1","doi-asserted-by":"publisher","DOI":"10.1145\/1097047.1097063"},{"key":"e_1_2_1_97_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0306-4573(01)00048-6"},{"key":"e_1_2_1_98_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10796-005-2770-7"},{"key":"e_1_2_1_99_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF01206331"},{"key":"e_1_2_1_100_1","unstructured":"van Rijsbergen K. 1979. Information Retrieval. Butterworth-Heinemann. van Rijsbergen K. 1979. Information Retrieval. Butterworth-Heinemann."},{"key":"e_1_2_1_101_1","volume-title":"Proceedings of the 18th International Conference on Machine Learning. Morgan Kaufmann, 577--584","author":"Wagstaff K.","year":"2001","unstructured":"Wagstaff , K. , Cardie , C. , Rogers , S. , and Scr &#246;dl, S. 2001 . Constrained K-means clustering with background knowledge . In Proceedings of the 18th International Conference on Machine Learning. Morgan Kaufmann, 577--584 . Wagstaff, K., Cardie, C., Rogers, S., and Scr&#246;dl, S. 2001. Constrained K-means clustering with background knowledge. In Proceedings of the 18th International Conference on Machine Learning. Morgan Kaufmann, 577--584."},{"key":"e_1_2_1_102_1","doi-asserted-by":"publisher","DOI":"10.1145\/1277741.1277759"},{"key":"e_1_2_1_103_1","volume-title":"Proceedings of the 13th International Conference on Database and Expert Systems Applications (DEXA). Springer, 902--913","author":"Wang Y.","unstructured":"Wang , Y. and Kitsuregawa , M . 2002. On combining link and contents information for Web page clustering . In Proceedings of the 13th International Conference on Database and Expert Systems Applications (DEXA). Springer, 902--913 . Wang, Y. and Kitsuregawa, M. 2002. On combining link and contents information for Web page clustering. In Proceedings of the 13th International Conference on Database and Expert Systems Applications (DEXA). Springer, 902--913."},{"key":"e_1_2_1_105_1","doi-asserted-by":"publisher","DOI":"10.1016\/0306-4573(88)90027-1"},{"key":"e_1_2_1_106_1","volume-title":"Proceedings of the 14th International Conference on Machine Learning (ICMC). Morgan Kaufmann","author":"Yang Y.","unstructured":"Yang , Y. and Pedersen , J. O . 1997. A comparative study on feature selection in text categorization . In Proceedings of the 14th International Conference on Machine Learning (ICMC). Morgan Kaufmann , San Francisco, 412--420. Yang, Y. and Pedersen, J. O. 1997. A comparative study on feature selection in text categorization. In Proceedings of the 14th International Conference on Machine Learning (ICMC). Morgan Kaufmann, San Francisco, 412--420."},{"key":"e_1_2_1_107_1","volume-title":"Proceedings of the IEEE\/WIC International Conference on Web Intelligence. Springer, 344--350","author":"Ye S.","unstructured":"Ye , S. , Chua , T.-S. , and Kei , J. R . 2003. Querying and clustering Web pages about persons and organizations . In Proceedings of the IEEE\/WIC International Conference on Web Intelligence. Springer, 344--350 . Ye, S., Chua, T.-S., and Kei, J. R. 2003. Querying and clustering Web pages about persons and organizations. In Proceedings of the IEEE\/WIC International Conference on Web Intelligence. Springer, 344--350."},{"key":"e_1_2_1_108_1","doi-asserted-by":"publisher","DOI":"10.1145\/290941.290956"},{"key":"e_1_2_1_109_1","doi-asserted-by":"publisher","DOI":"10.1016\/S1389-1286(99)00054-7"},{"key":"e_1_2_1_110_1","doi-asserted-by":"publisher","DOI":"10.1145\/1008992.1009030"},{"key":"e_1_2_1_111_1","volume-title":"Proceedings of 6th Asia-Pacific Web Conference (APWeb). Lecture Notes in Computer Science","volume":"3007","author":"Zhang D.","unstructured":"Zhang , D. and Dong , Y . 2004. Semantic, hierarchical, online clustering of Web search results . In Proceedings of 6th Asia-Pacific Web Conference (APWeb). Lecture Notes in Computer Science , vol. 3007 . Springer, 69--78. Zhang, D. and Dong, Y. 2004. Semantic, hierarchical, online clustering of Web search results. In Proceedings of 6th Asia-Pacific Web Conference (APWeb). Lecture Notes in Computer Science, vol. 3007. Springer, 69--78."},{"key":"e_1_2_1_112_1","doi-asserted-by":"publisher","DOI":"10.1002\/int.v19:1\/2"},{"key":"e_1_2_1_113_1","doi-asserted-by":"publisher","DOI":"10.1145\/1060745.1060760"}],"container-title":["ACM Computing Surveys"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1541880.1541884","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,29]],"date-time":"2022-12-29T07:19:44Z","timestamp":1672298384000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1541880.1541884"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,7]]},"references-count":110,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2009,7]]}},"alternative-id":["10.1145\/1541880.1541884"],"URL":"https:\/\/doi.org\/10.1145\/1541880.1541884","relation":{},"ISSN":["0360-0300","1557-7341"],"issn-type":[{"value":"0360-0300","type":"print"},{"value":"1557-7341","type":"electronic"}],"subject":[],"published":{"date-parts":[[2009,7]]},"assertion":[{"value":"2007-12-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2008-08-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2009-07-30","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}