{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,3,31]],"date-time":"2025-03-31T07:43:34Z","timestamp":1743407014614},"reference-count":30,"publisher":"Oxford University Press (OUP)","issue":"24","license":[{"start":{"date-parts":[[2016,10,6]],"date-time":"2016-10-06T00:00:00Z","timestamp":1475712000000},"content-version":"vor","delay-in-days":3272,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/2.0\/uk\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2007,12,15]]},"abstract":"Abstract<\/jats:title>Motivation: Determining locations of protein expression is essential to understand protein function. Advances in green fluorescence protein (GFP) fusion proteins and automated fluorescence microscopy allow for rapid acquisition of large collections of protein localization images. Recognition of these cell images requires an automated image analysis system. Approaches taken by previous work concentrated on designing a set of optimal features and then applying standard machine-learning algorithms. In fact, trends of recent advances in machine learning and computer vision can be applied to improve the performance. One trend is the advances in multiclass learning with error-correcting output codes (ECOC). Another trend is the use of a large number of weak detectors with boosting for detecting objects in images of real-world scenes.<\/jats:p>Results: We take advantage of these advances to propose a new learning algorithm, AdaBoost.ERC, coupled with weak and strong detectors, to improve the performance of automatic recognition of protein subcellular locations in cell images. We prepared two image data sets of CHO and Vero cells and downloaded a HeLa cell image data set in the public domain to evaluate our new method. We show that AdaBoost.ERC outperforms other AdaBoost extensions. We demonstrate the benefit of weak detectors by showing significant performance improvements over classifiers using only strong detectors. We also empirically test our method's capability of generalizing to heterogeneous image collections. Compared with previous work, our method performs reasonably well for the HeLa cell images.<\/jats:p>Availability: CHO and Vero cell images, their corresponding feature sets (SSLF and WSLF), our new learning algorithm, AdaBoost.ERC, and Supplementary Material are available at http:\/\/aiia.iis.sinica.edu.tw\/<\/jats:p>Contact: \u00a0chunnan@iis.sinica.edu.tw<\/jats:p>Supplementary information: Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btm497","type":"journal-article","created":{"date-parts":[[2007,10,24]],"date-time":"2007-10-24T00:20:21Z","timestamp":1193185221000},"page":"3374-3381","source":"Crossref","is-referenced-by-count":29,"title":["Boosting multiclass learning with repeating codes and weak detectors for protein subcellular localization"],"prefix":"10.1093","volume":"23","author":[{"given":"Chung-Chih","family":"Lin","sequence":"first","affiliation":[{"name":"1 Faculty of Life Sciences and Institute of Genomes, National Yang-Ming University, Taipei, 2Brain Research Center, University System of Taiwan, Hsin-Chu, 3Department of Biomedical Engineering, Chung Yuan Christian University, Jhongli, 4Institute of Information Science, Academia Sinica, Taipei, Taiwan and 5Department of Cell Biology\/Biophysics, EMBL Heidelberg, 69117 Heidelberg, Germany"},{"name":"1 Faculty of Life Sciences and Institute of Genomes, National Yang-Ming University, Taipei, 2Brain Research Center, University System of Taiwan, Hsin-Chu, 3Department of Biomedical Engineering, Chung Yuan Christian University, Jhongli, 4Institute of Information Science, Academia Sinica, Taipei, Taiwan and 5Department of Cell Biology\/Biophysics, EMBL Heidelberg, 69117 Heidelberg, Germany"}]},{"given":"Yuh-Show","family":"Tsai","sequence":"additional","affiliation":[{"name":"1 Faculty of Life Sciences and Institute of Genomes, National Yang-Ming University, Taipei, 2Brain Research Center, University System of Taiwan, Hsin-Chu, 3Department of Biomedical Engineering, Chung Yuan Christian University, Jhongli, 4Institute of Information Science, Academia Sinica, Taipei, Taiwan and 5Department of Cell Biology\/Biophysics, EMBL Heidelberg, 69117 Heidelberg, Germany"}]},{"given":"Yu-Shi","family":"Lin","sequence":"additional","affiliation":[{"name":"1 Faculty of Life Sciences and Institute of Genomes, National Yang-Ming University, Taipei, 2Brain Research Center, University System of Taiwan, Hsin-Chu, 3Department of Biomedical Engineering, Chung Yuan Christian University, Jhongli, 4Institute of Information Science, Academia Sinica, Taipei, Taiwan and 5Department of Cell Biology\/Biophysics, EMBL Heidelberg, 69117 Heidelberg, Germany"}]},{"given":"Tai-Yu","family":"Chiu","sequence":"additional","affiliation":[{"name":"1 Faculty of Life Sciences and Institute of Genomes, National Yang-Ming University, Taipei, 2Brain Research Center, University System of Taiwan, Hsin-Chu, 3Department of Biomedical Engineering, Chung Yuan Christian University, Jhongli, 4Institute of Information Science, Academia Sinica, Taipei, Taiwan and 5Department of Cell Biology\/Biophysics, EMBL Heidelberg, 69117 Heidelberg, Germany"}]},{"given":"Chia-Cheng","family":"Hsiung","sequence":"additional","affiliation":[{"name":"1 Faculty of Life Sciences and Institute of Genomes, National Yang-Ming University, Taipei, 2Brain Research Center, University System of Taiwan, Hsin-Chu, 3Department of Biomedical Engineering, Chung Yuan Christian University, Jhongli, 4Institute of Information Science, Academia Sinica, Taipei, Taiwan and 5Department of Cell Biology\/Biophysics, EMBL Heidelberg, 69117 Heidelberg, Germany"}]},{"given":"May-I.","family":"Lee","sequence":"additional","affiliation":[{"name":"1 Faculty of Life Sciences and Institute of Genomes, National Yang-Ming University, Taipei, 2Brain Research Center, University System of Taiwan, Hsin-Chu, 3Department of Biomedical Engineering, Chung Yuan Christian University, Jhongli, 4Institute of Information Science, Academia Sinica, Taipei, Taiwan and 5Department of Cell Biology\/Biophysics, EMBL Heidelberg, 69117 Heidelberg, Germany"}]},{"given":"Jeremy C.","family":"Simpson","sequence":"additional","affiliation":[{"name":"1 Faculty of Life Sciences and Institute of Genomes, National Yang-Ming University, Taipei, 2Brain Research Center, University System of Taiwan, Hsin-Chu, 3Department of Biomedical Engineering, Chung Yuan Christian University, Jhongli, 4Institute of Information Science, Academia Sinica, Taipei, Taiwan and 5Department of Cell Biology\/Biophysics, EMBL Heidelberg, 69117 Heidelberg, Germany"}]},{"given":"Chun-Nan","family":"Hsu","sequence":"additional","affiliation":[{"name":"1 Faculty of Life Sciences and Institute of Genomes, National Yang-Ming University, Taipei, 2Brain Research Center, University System of Taiwan, Hsin-Chu, 3Department of Biomedical Engineering, Chung Yuan Christian University, Jhongli, 4Institute of Information Science, Academia Sinica, Taipei, Taiwan and 5Department of Cell Biology\/Biophysics, EMBL Heidelberg, 69117 Heidelberg, Germany"}]}],"member":"286","published-online":{"date-parts":[[2007,10,22]]},"reference":[{"key":"2023041107442286700_","doi-asserted-by":"crossref","first-page":"D505","DOI":"10.1093\/nar\/gkh022","article-title":"LIFEdb: a database for functional genomics experiments integrating information from external sources, and serving as a sample tracking system","volume":"32","author":"Bannasch","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"2023041107442286700_","doi-asserted-by":"crossref","first-page":"1213","DOI":"10.1093\/bioinformatics\/17.12.1213","article-title":"A neural network classifier capable of recognizing the patterns of all major subcellular structures in fluorescence microscope images of HeLa cells","volume":"17","author":"Boland","year":"2001","journal-title":"Bioinformatics"},{"key":"2023041107442286700_","first-page":"594","article-title":"Automated classification of cellular protein localization patterns obtained via fluorescence microscopy","author":"Boland","year":"1997"},{"key":"2023041107442286700_","doi-asserted-by":"crossref","first-page":"366","DOI":"10.1002\/(SICI)1097-0320(19981101)33:3<366::AID-CYTO12>3.0.CO;2-R","article-title":"Automated recognition of patterns characteristic of subcellular structures in fluorescence microscopy images","volume":"33","author":"Boland","year":"1998","journal-title":"Cytometry"},{"key":"2023041107442286700_","doi-asserted-by":"crossref","first-page":"210","DOI":"10.1186\/1471-2105-8-210","article-title":"A multiresolution approach to automated classification of protein subcellular location images","volume":"8","author":"Chebira","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023041107442286700_","first-page":"328","article-title":"Interpretation of protein subcellular location patterns in 3d images across cell types and resolutions","volume":"4414","author":"Chen","year":"2007","journal-title":"Lect. Notes Bioinformatics"},{"key":"2023041107442286700_","doi-asserted-by":"crossref","first-page":"1130","DOI":"10.1101\/gr.2383804","article-title":"Automatic identification of subcellular phenotypes on human cell arrays","volume":"14","author":"Conrad","year":"2004","journal-title":"Genome Res"},{"key":"2023041107442286700_","doi-asserted-by":"crossref","first-page":"263","DOI":"10.1613\/jair.105","article-title":"Solving multiclass learning problems via error-correcting output codes","volume":"2","author":"Dietterich","year":"1995","journal-title":"J. Artif. Intell. Res"},{"key":"2023041107442286700_","doi-asserted-by":"crossref","first-page":"169","DOI":"10.1016\/S0962-8924(98)01226-4","article-title":"Wanted: subcellular localization of proteins based on sequence","volume":"8","author":"Eisenhaber","year":"1998","journal-title":"Trends Cell Biol"},{"key":"2023041107442286700_","first-page":"23","article-title":"A decision-theoretic generalization of on-line learning and an application to boosting","author":"Freund","year":"1995"},{"key":"2023041107442286700_","doi-asserted-by":"crossref","first-page":"46","DOI":"10.1093\/nar\/24.1.46","article-title":"YPD-A database for the proteins of Saccharomyces cerevisiae","volume":"24","author":"Garrels","year":"1996","journal-title":"Nucleic Acids Res"},{"key":"2023041107442286700_","doi-asserted-by":"crossref","first-page":"7","DOI":"10.1016\/j.devcel.2006.12.007","article-title":"Automated subcellular location determination and high throughput microscopy","volume":"12","author":"Glory","year":"2007","journal-title":"Dev. Cell"},{"key":"2023041107442286700_","first-page":"145","article-title":"Multiclass learning, boosting, and error-correcting codes","author":"Guruswami","year":"1999"},{"key":"2023041107442286700_","doi-asserted-by":"crossref","first-page":"80","DOI":"10.1093\/nar\/30.1.80","article-title":"YPL.db: the yeast protein localization database","volume":"30","author":"Habeler","year":"2002","journal-title":"Nucleic Acids Res"},{"key":"2023041107442286700_","doi-asserted-by":"crossref","first-page":"786","DOI":"10.1109\/PROC.1979.11328","article-title":"Statistical and structural approaches to texture","volume":"67","author":"Haralick","year":"1979","journal-title":"Proc. IEEE"},{"key":"2023041107442286700_","first-page":"1139","article-title":"Automated classification of subcellular patterns in multicell images without segmentation into single cells","author":"Huang","year":"2004"},{"key":"2023041107442286700_","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1186\/1471-2105-5-78","article-title":"Boosting accuracy of automated classification of fluorescence microscope images for location proteomics","volume":"5","author":"Huang","year":"2004","journal-title":"BMC Bioinformatics"},{"key":"2023041107442286700_","article-title":"Majority vote classifiers: theory and applications","volume-title":"Ph.D. Thesis","author":"James","year":"1998"},{"key":"2023041107442286700_","article-title":"Boosting multiclass learning with repeating codes","volume-title":"Technical report TR-IIS-07-014","author":"Lin","year":"2007"},{"key":"2023041107442286700_","first-page":"569","article-title":"Multiclass boosting with repartitioning","author":"Ling","year":"2006"},{"key":"2023041107442286700_","doi-asserted-by":"crossref","first-page":"1845","DOI":"10.1109\/NSSMIC.2004.1462603","article-title":"Quality improvement of digitized radiographs by filtering technique development based on morphological transformations","volume-title":"2004 IEEE Nuclear Science Symposium Conference Record","author":"Movafeghi","year":"2004"},{"key":"2023041107442286700_","first-page":"393","article-title":"Object detection and localization using local and global features","volume-title":"Toward Category-Level Object Recognition, volume 4170 of Lecture Notes in Computer Science","author":"Murphy","year":"2006"},{"key":"2023041107442286700_","doi-asserted-by":"crossref","first-page":"34","DOI":"10.1016\/S0968-0004(98)01336-X","article-title":"Psort: a program for detecting sorting signals in proteins and predicting their subcellular localization","volume":"24","author":"Nakai","year":"1999","journal-title":"Trends in Biochem. Sci"},{"key":"2023041107442286700_","first-page":"313","article-title":"Using output codes to boost multiclass learning problems","author":"Schapire","year":"1997"},{"key":"2023041107442286700_","first-page":"1401","article-title":"A brief introduction to boosting","author":"Schapire","year":"1999"},{"key":"2023041107442286700_","doi-asserted-by":"crossref","first-page":"287","DOI":"10.1093\/embo-reports\/kvd058","article-title":"Systematic subcellular localization of novel proteins identified by large-scale cDNA sequencing","volume":"1","author":"Simpson","year":"2000","journal-title":"EMBO Rep"},{"key":"2023041107442286700_","first-page":"1331","article-title":"Learning hierarchical models of scenes, objects, and parts","author":"Sudderth","year":"2005"},{"key":"2023041107442286700_","first-page":"872","article-title":"Unifying the error-correcting and output-code adaboost within the margin framework","author":"Sun","year":"2005"},{"key":"2023041107442286700_","first-page":"762","article-title":"Sharing features: efficient boosting procedures for multiclass object detection","author":"Torralba","year":"2004"},{"key":"2023041107442286700_","first-page":"280","article-title":"Adaptive quartile sigmoid function operator for color image contrast enhancement","author":"Wen","year":"2001"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/24\/3374\/49822569\/bioinformatics_23_24_3374.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/24\/3374\/49822569\/bioinformatics_23_24_3374.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,14]],"date-time":"2023-05-14T13:42:51Z","timestamp":1684071771000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/23\/24\/3374\/263446"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,10,22]]},"references-count":30,"journal-issue":{"issue":"24","published-print":{"date-parts":[[2007,12,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btm497","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2007,12,15]]},"published":{"date-parts":[[2007,10,22]]}}}