{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,5,1]],"date-time":"2024-05-01T04:10:16Z","timestamp":1714536616026},"reference-count":52,"publisher":"Wiley","issue":"2","license":[{"start":{"date-parts":[[2012,11,6]],"date-time":"2012-11-06T00:00:00Z","timestamp":1352160000000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/onlinelibrary.wiley.com\/termsAndConditions#vor"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Statistical Analysis"],"published-print":{"date-parts":[[2013,4]]},"abstract":"Abstract<\/jats:title>Biclustering is desirable over traditional one\u2010dimensional clustering, and has been broadly applied to many domains such as bioinformatics and text mining. However, the existing biclustering methods can only deal with a data matrix of scalars. In this paper, we introduce a biclustering procedure that can handle a data matrix of scatter plots. To more accurately reflect the nature of data, we introduce a dissimilarity statistic based on \u2018data depth\u2019 to measure the discrepancy between two bivariate distributions without oversimplifying the nature of the underlying pattern. We then combine hypothesis testing with a searching algorithm to simultaneously cluster the rows and columns of the data matrix of scatter plots. We also propose novel painting metrics and construct heat maps to allow visualization of the biclusters. We demonstrate the utility and power of our proposed biclustering method through simulation studies and application to a microbe\u2013host interaction study. \u00a9 2012 Wiley Periodicals, Inc. Statistical Analysis and Data Mining 6: 102\u2013115, 2013<\/jats:p>","DOI":"10.1002\/sam.11166","type":"journal-article","created":{"date-parts":[[2012,11,6]],"date-time":"2012-11-06T18:55:11Z","timestamp":1352228111000},"page":"102-115","source":"Crossref","is-referenced-by-count":1,"title":["Biclustering scatter plots using data depth measures"],"prefix":"10.1002","volume":"6","author":[{"given":"Zhanpan","family":"Zhang","sequence":"first","affiliation":[]},{"given":"Xinping","family":"Cui","sequence":"additional","affiliation":[]},{"given":"Daniel R.","family":"Jeske","sequence":"additional","affiliation":[]},{"given":"James","family":"Borneman","sequence":"additional","affiliation":[]}],"member":"311","published-online":{"date-parts":[[2012,11,6]]},"reference":[{"key":"e_1_2_8_2_2","doi-asserted-by":"publisher","DOI":"10.1038\/nature06244"},{"key":"e_1_2_8_3_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCBB.2004.2"},{"key":"e_1_2_8_4_2","unstructured":"Y.ChengandG.Church Biclustering of expression data In Proceedings of Eighth ISMB Conference AAAI Press 2000 93\u2013103."},{"key":"e_1_2_8_5_2","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.210134797"},{"key":"e_1_2_8_6_2","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/18.suppl_1.S136"},{"key":"e_1_2_8_7_2","doi-asserted-by":"publisher","DOI":"10.1101\/gr.648603"},{"key":"e_1_2_8_8_2","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/bth166"},{"key":"e_1_2_8_9_2","doi-asserted-by":"crossref","unstructured":"I.Dhillon S.Mallela andD.Modha Information\u2010theoretic co\u2010clustering ' In Proceedings of ACM SIGKDD 2003 89\u201398.","DOI":"10.1145\/956750.956764"},{"key":"e_1_2_8_10_2","doi-asserted-by":"crossref","unstructured":"H.ShanandA.Banerjee Bayesian co\u2010clustering In Proceedings of IEEE ICDM 2008 530\u2013539.","DOI":"10.1109\/ICDM.2008.91"},{"key":"e_1_2_8_11_2","doi-asserted-by":"publisher","DOI":"10.1191\/0962280204sm373ra"},{"key":"e_1_2_8_12_2","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btl060"},{"key":"e_1_2_8_13_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.cor.2007.01.005"},{"key":"e_1_2_8_14_2","unstructured":"Z.Zhang X.Cui D. R.Jeske X.Li J.Braun andJ.Borneman Clustering scatter plots using data depth measures In The 6th International Conference on Data Mining (DMIN'10) Las Vegas USA 2010 327\u2013333."},{"key":"e_1_2_8_15_2","doi-asserted-by":"crossref","first-page":"252","DOI":"10.1080\/01621459.1993.10594317","article-title":"A quality index based on data depth and multivariate rank tests,","volume":"88","author":"Liu R. Y.","year":"1993","journal-title":"J Am Stat Assoc"},{"key":"e_1_2_8_16_2","doi-asserted-by":"publisher","DOI":"10.1214\/009053606000000876"},{"key":"e_1_2_8_17_2","first-page":"65","article-title":"A simple sequentially rejective multiple test procedure,","volume":"6","author":"Holm S.","year":"1979","journal-title":"Scand J Stat"},{"key":"e_1_2_8_18_2","doi-asserted-by":"publisher","DOI":"10.1016\/S0016-5085(10)60060-1"},{"key":"e_1_2_8_19_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.dld.2005.01.010"},{"key":"e_1_2_8_20_2","article-title":"Host\u2010microbe relationships in inflammatory bowel disease detected by bacterial and metaproteomic analysis of the mucosal\u2010luminal interface,","author":"Presley L. L.","journal-title":"Inflamm Bowel Dis."},{"key":"e_1_2_8_21_2","unstructured":"D.ZhouandT.Shi Statistical inference based on distances between empirical distributions with application to AIRS level 3 data ' In Proceedings of the NASA Conference on Intelligent Data Understanding (CIDU) 2011."},{"key":"e_1_2_8_22_2","doi-asserted-by":"publisher","DOI":"10.1016\/0377-0427(87)90125-7"},{"key":"e_1_2_8_23_2","first-page":"49","article-title":"On the generalized distance in statistics,","volume":"12","author":"Mahalanobis P. C.","year":"1936","journal-title":"Proc Natl Acad India"},{"key":"e_1_2_8_24_2","unstructured":"J. W.Tukey Mathematics and picturing data In Proceedings of the 1974 International Congress of Mathematicians Vol. 2 Vancouver 1974 523\u2013531."},{"key":"e_1_2_8_25_2","doi-asserted-by":"publisher","DOI":"10.1214\/aos\/1176347507"},{"key":"e_1_2_8_26_2","doi-asserted-by":"publisher","DOI":"10.1056\/NEJM199005103221903"},{"key":"e_1_2_8_27_2","doi-asserted-by":"publisher","DOI":"10.1097\/MPG.0b013e31810e75a9"},{"key":"e_1_2_8_28_2","doi-asserted-by":"publisher","DOI":"10.1097\/00054725-200606000-00006"},{"key":"e_1_2_8_29_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.metabol.2006.03.006"},{"key":"e_1_2_8_30_2","doi-asserted-by":"publisher","DOI":"10.1136\/gut.52.6.847"},{"key":"e_1_2_8_31_2","doi-asserted-by":"publisher","DOI":"10.1136\/gut.2008.170019"},{"key":"e_1_2_8_32_2","doi-asserted-by":"publisher","DOI":"10.1007\/BF02919746"},{"key":"e_1_2_8_33_2","doi-asserted-by":"publisher","DOI":"10.1136\/gut.33.7.902"},{"key":"e_1_2_8_34_2","doi-asserted-by":"publisher","DOI":"10.1046\/j.1365-2249.2000.01168.x"},{"key":"e_1_2_8_35_2","doi-asserted-by":"publisher","DOI":"10.1046\/j.1365-2249.1996.17721.x"},{"key":"e_1_2_8_36_2","doi-asserted-by":"publisher","DOI":"10.1152\/ajpcell.1990.259.4.C577"},{"key":"e_1_2_8_37_2","doi-asserted-by":"publisher","DOI":"10.1056\/NEJMoa070265"},{"key":"e_1_2_8_38_2","doi-asserted-by":"publisher","DOI":"10.1002\/ibd.20347"},{"key":"e_1_2_8_39_2","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.0906773106"},{"key":"e_1_2_8_40_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10620-006-9615-1"},{"key":"e_1_2_8_41_2","doi-asserted-by":"publisher","DOI":"10.1002\/ibd.20851"},{"key":"e_1_2_8_42_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00011-008-8120-8"},{"key":"e_1_2_8_43_2","doi-asserted-by":"publisher","DOI":"10.1016\/S0002-9440(10)63636-X"},{"key":"e_1_2_8_44_2","doi-asserted-by":"publisher","DOI":"10.2353\/ajpath.2008.080444"},{"key":"e_1_2_8_45_2","doi-asserted-by":"publisher","DOI":"10.1111\/j.1365-3083.2007.01908.x"},{"key":"e_1_2_8_46_2","doi-asserted-by":"publisher","DOI":"10.1053\/j.gastro.2005.09.032"},{"key":"e_1_2_8_47_2","doi-asserted-by":"publisher","DOI":"10.1111\/j.1572-0241.2001.03881.x"},{"key":"e_1_2_8_48_2","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/25.17.3389"},{"key":"e_1_2_8_49_2","doi-asserted-by":"publisher","DOI":"10.1038\/ismej.2007.52"},{"key":"e_1_2_8_50_2","doi-asserted-by":"publisher","DOI":"10.1097\/01.mib.0000235828.09305.0c"},{"key":"e_1_2_8_51_2","doi-asserted-by":"publisher","DOI":"10.1002\/ibd.20903"},{"key":"e_1_2_8_52_2","doi-asserted-by":"publisher","DOI":"10.1002\/ibd.20330"},{"key":"e_1_2_8_53_2","doi-asserted-by":"publisher","DOI":"10.1002\/ibd.20783"}],"container-title":["Statistical Analysis and Data Mining: The ASA Data Science Journal"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/api.wiley.com\/onlinelibrary\/tdm\/v1\/articles\/10.1002%2Fsam.11166","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/api.wiley.com\/onlinelibrary\/tdm\/v1\/articles\/10.1002%2Fsam.11166","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1002\/sam.11166","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,5,1]],"date-time":"2024-05-01T03:50:01Z","timestamp":1714535401000},"score":1,"resource":{"primary":{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/10.1002\/sam.11166"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,11,6]]},"references-count":52,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2013,4]]}},"alternative-id":["10.1002\/sam.11166"],"URL":"https:\/\/doi.org\/10.1002\/sam.11166","archive":["Portico"],"relation":{},"ISSN":["1932-1864","1932-1872"],"issn-type":[{"value":"1932-1864","type":"print"},{"value":"1932-1872","type":"electronic"}],"subject":[],"published":{"date-parts":[[2012,11,6]]}}}