{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,9,13]],"date-time":"2023-09-13T20:30:12Z","timestamp":1694637012305},"reference-count":17,"publisher":"Emerald","issue":"3","license":[{"start":{"date-parts":[[2009,8,21]],"date-time":"2009-08-21T00:00:00Z","timestamp":1250812800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2009,8,21]]},"abstract":"Purpose<\/jats:title>The purpose of this paper is to present a novel control mechanism for avoiding overlapping among biclusters in expression data.<\/jats:p><\/jats:sec>Design\/methodology\/approach<\/jats:title>Biclustering is a technique used in analysis of microarray data. One of the most popular biclustering algorithms is introduced by Cheng and Church (2000) (Ch&Ch). Even if this heuristic is successful at finding interesting biclusters, it presents several drawbacks. The main shortcoming is that it introduces random values in the expression matrix to control the overlapping. The overlapping control method presented in this paper is based on a matrix of weights, that is used to estimate the overlapping of a bicluster with already found ones. In this way, the algorithm is always working on real data and so the biclusters it discovers contain only original data.<\/jats:p><\/jats:sec>Findings<\/jats:title>The paper shows that the original algorithm wrongly estimates the quality of the biclusters after some iterations, due to random values that it introduces. The empirical results show that the proposed approach is effective in order to improve the heuristic. It is also important to highlight that many interesting biclusters found by using our approach would have not been obtained using the original algorithm.<\/jats:p><\/jats:sec>Originality\/value<\/jats:title>The original algorithm proposed by Ch&Ch is one of the most successful algorithms for discovering biclusters in microarray data. However, it presents some limitations, the most relevant being the substitution phase adopted in order to avoid overlapping among biclusters. The modified version of the algorithm proposed in this paper improves the original one, as proven in the experimentation.<\/jats:p><\/jats:sec>","DOI":"10.1108\/17563780910982707","type":"journal-article","created":{"date-parts":[[2009,10,5]],"date-time":"2009-10-05T14:45:26Z","timestamp":1254753926000},"page":"477-493","source":"Crossref","is-referenced-by-count":5,"title":["Improved biclustering on expression data through overlapping control"],"prefix":"10.1108","volume":"2","author":[{"given":"Beatriz","family":"Pontes","sequence":"first","affiliation":[]},{"given":"Federico","family":"Divina","sequence":"additional","affiliation":[]},{"given":"Ra\u00fal","family":"Gir\u00e1ldez","sequence":"additional","affiliation":[]},{"given":"Jes\u00fas S.","family":"Aguilar\u2010Ruiz","sequence":"additional","affiliation":[]}],"member":"140","reference":[{"key":"key2022021520032579200_b1","unstructured":"Aguilar\u2010Ruiz, J.S., Rodriguez, D.S. and Simovici, D.A. (2006), \u201cBiclustering of gene expression data based on local nearness\u201d, Proceedings of EGC 2006, pp. 681\u201092."},{"key":"key2022021520032579200_b2","doi-asserted-by":"crossref","unstructured":"Alizadeh, A.A., Eisen, M.B., Davis, R.E., Ma, C., Lossos, I.S., Rosenwald, A., Boldrick, J.C., Sabet, H., Tran, T., Yu, X., Powell, J.I., Yang, L., Marti, G.E., Moore, T., Hudson, J. Jr, Lu, L., Lewis, D.B., Tibshirani, R., Sherlock, G., Chan, W.C., Greiner, T.C., Weisenburger, D.D., Armitage, J.O., Warnke, R., Levy, R., Wilson, W., Grever, M.R., Byrd, J.C., Botstein, D., Brown, P.O. and Staudt, L.M. (2000), \u201cDistinct types of diffuse large B\u2010cell lymphoma identified by gene expression profiling\u201d, Nature, Vol. 403, pp. 503\u201011.","DOI":"10.1038\/35000501"},{"key":"key2022021520032579200_b3","doi-asserted-by":"crossref","unstructured":"Baldi, P. (2002), DNA Microarrays and Gene Expression: from Experiments to Data Analysis and Modeling, Cambridge University Press, Cambridge.","DOI":"10.1017\/CBO9780511541773"},{"key":"key2022021520032579200_b4","doi-asserted-by":"crossref","unstructured":"Ben\u2010Dor, A., Shamir, R. and Yakhini, Z. (1999), \u201cClustering gene expression patterns\u201d, Journal of Computational Biology, Vol. 6 Nos 3\u20104, pp. 281\u201097.","DOI":"10.1089\/106652799318274"},{"key":"key2022021520032579200_b5","doi-asserted-by":"crossref","unstructured":"Bryan, K. and Cunningham, P. (2007), \u201cBALBOA: extending bicluster analysis to classify ORFs using expression data\u201d, Proceedings of the 7th IEEE International Conference on Bioinformatics and Bioengineering (BIBE), Boston, MA, 14\u201017 October, pp. 995\u20101002.","DOI":"10.1109\/BIBE.2007.4375679"},{"key":"key2022021520032579200_b6","unstructured":"Cheng, Y. and Church, G.M. (2000), \u201cBiclustering of expression data\u201d, Proceedings of the 8th International Conference on Intelligent Systems for Molecular Biology, pp. 93\u2010103."},{"key":"key2022021520032579200_b8","doi-asserted-by":"crossref","unstructured":"Cho, H., Dhillon, D., Guan, Y. and Sra, S. (2004), \u201cMinimum sum\u2010squared residue cococlustering of gene expression data\u201d, Proceedings of the 4th SIAM International Conference on Data Mining.","DOI":"10.1137\/1.9781611972740.11"},{"key":"key2022021520032579200_b7","doi-asserted-by":"crossref","unstructured":"Cho, R.J., Campbell, M.J., Winzeler, E.A., Steinmetz, L., Conway, A., Wodicka, L., Wolfsberg, T.G., Gabrielian, A.E., Landsman, D., Lockhart, D.J. and Davis, R.W. (1998), \u201cA genome\u2010wide transcriptional analysis of the mitotic cell cycle\u201d, Molecular Cell, Vol. 2, pp. 65\u201073.","DOI":"10.1016\/S1097-2765(00)80114-8"},{"key":"key2022021520032579200_b9","doi-asserted-by":"crossref","unstructured":"Divina, F. and Aguilar\u2010Ruiz, J.S. (2006), \u201cBiclustering of expression data with evolutionary computation\u201d, IEEE Transactions on Knowledge & Data Engineering, Vol. 18 No. 5, pp. 590\u2010602.","DOI":"10.1109\/TKDE.2006.74"},{"key":"key2022021520032579200_b10","doi-asserted-by":"crossref","unstructured":"Harpaz, R. and Haralick, R. (2006), \u201cExploiting the geometry of gene expression patterns for unsupervised learning\u201d, The 18th International Conference on Pattern Recognition, pp. 670\u20104.","DOI":"10.1109\/ICPR.2006.518"},{"key":"key2022021520032579200_b11","doi-asserted-by":"crossref","unstructured":"Hartigan, J.A. (1972), \u201cDirect clustering of a data matrix\u201d, Journal of the American Statistical Association, Vol. 67, pp. 123\u20109.","DOI":"10.1080\/01621459.1972.10481214"},{"key":"key2022021520032579200_b12","doi-asserted-by":"crossref","unstructured":"Piatetsky\u2010Shapiro, G., Khabaza, T. and Ramaswamy, S. (2003), \u201cCapturing best practice for microarray gene expression data analysis\u201d, Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, pp. 407\u201015.","DOI":"10.1145\/956750.956797"},{"key":"key2022021520032579200_b13","doi-asserted-by":"crossref","unstructured":"Tamayo, P., Slonim, D., Mesirov, J., Zhu, Q., Kitareewan, S., Dmitrovsky, E., Lander, E.S. and Golub, T.R. (1999), \u201cInterpreting patterns of gene expression with self\u2010organizing maps: methods and application to hematopoietic differentiation\u201d, Proceedings of the National Academy of Sciences of the United States of America, Vol. 96, pp. 2907\u201012.","DOI":"10.1073\/pnas.96.6.2907"},{"key":"key2022021520032579200_b14","doi-asserted-by":"crossref","unstructured":"Tilstone, C. (2003), \u201cDNA microarrays: vital statistics\u201d, Nature, Vol. 424, pp. 610\u20102.","DOI":"10.1038\/424610a"},{"key":"key2022021520032579200_b16","unstructured":"Yang, J., Wang, H., Wang, W. and Yu, P.S. (2002), \u201c\u03b4\u2010Clusters: capturing subspace correlation in a large data set\u201d, Proceedings of the 18th IEEE Conference on Data Engineering, IEEE Computer Society, Washington, DC, pp. 517\u201028."},{"key":"key2022021520032579200_b15","doi-asserted-by":"crossref","unstructured":"Yang, J., Wang, H., Wang, W. and Yu, P.S. (2005), \u201cAn improved biclustering method for analyzing gene expression profiles\u201d, International Journal on Artificial Intelligence Tools, Vol. 14, pp. 771\u201090.","DOI":"10.1142\/S0218213005002387"},{"key":"key2022021520032579200_b17","doi-asserted-by":"crossref","unstructured":"Yin, L., Huang, C.H. and Ni, J. (2006), \u201cClustering of gene expression data: performance and similarity analysis\u201d, BMC Bionformatics, Vol. 7, p. S19.","DOI":"10.1186\/1471-2105-7-S4-S19"}],"container-title":["International Journal of Intelligent Computing and Cybernetics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/www.emeraldinsight.com\/doi\/full-xml\/10.1108\/17563780910982707","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/17563780910982707\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/17563780910982707\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,2,15]],"date-time":"2022-02-15T22:55:21Z","timestamp":1644965721000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/17563780910982707\/full\/html"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,8,21]]},"references-count":17,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2009,8,21]]}},"alternative-id":["10.1108\/17563780910982707"],"URL":"https:\/\/doi.org\/10.1108\/17563780910982707","relation":{},"ISSN":["1756-378X"],"issn-type":[{"value":"1756-378X","type":"print"}],"subject":[],"published":{"date-parts":[[2009,8,21]]}}}