{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,11,19]],"date-time":"2024-11-19T17:34:20Z","timestamp":1732037660735},"reference-count":28,"publisher":"Oxford University Press (OUP)","issue":"23","license":[{"start":{"date-parts":[[2018,6,1]],"date-time":"2018-06-01T00:00:00Z","timestamp":1527811200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61701340","61702361"],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100011408","name":"State Key Laboratory of Medicinal Chemical Biology in China","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100011408","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100000923","name":"Australian Research Council","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100000923","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000163","name":"ARC","doi-asserted-by":"publisher","award":["LP110200333","DP120104460"],"id":[{"id":"10.13039\/100000163","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000060","name":"National Institute of Allergy and Infectious Diseases","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000060","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["R01 AI111965"],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Major Inter-Disciplinary Research"},{"name":"IDR"},{"DOI":"10.13039\/501100001779","name":"Monash University","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100001779","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2018,12,1]]},"abstract":"Abstract<\/jats:title>\n \n Motivation<\/jats:title>\n Anti-cancer peptides (ACPs) have recently emerged as promising therapeutic agents for cancer treatment. Due to the avalanche of protein sequence data in the post-genomic era, there is an urgent need to develop automated computational methods to enable fast and accurate identification of novel ACPs within the vast number of candidate proteins and peptides.<\/jats:p>\n <\/jats:sec>\n \n Results<\/jats:title>\n To address this, we propose a novel predictor named Anti-Cancer peptide Predictor with Feature representation Learning (ACPred-FL) for accurate prediction of ACPs based on sequence information. More specifically, we develop an effective feature representation learning model, with which we can extract and learn a set of informative features from a pool of support vector machine-based models trained using sequence-based feature descriptors. By doing so, the class label information of data samples is fully utilized. To improve the feature representation, we further employ a two-step feature selection technique, resulting in a most informative five-dimensional feature vector for the final peptide representation. Experimental results show that such five features provide the most discriminative power for identifying ACPs than currently available feature descriptors, highlighting the effectiveness of the proposed feature representation learning approach. The developed ACPred-FL method significantly outperforms state-of-the-art methods.<\/jats:p>\n <\/jats:sec>\n \n Availability and implementation<\/jats:title>\n The web-server of ACPred-FL is available at http:\/\/server.malab.cn\/ACPred-FL.<\/jats:p>\n <\/jats:sec>\n \n Supplementary information<\/jats:title>\n Supplementary data are available at Bioinformatics online.<\/jats:p>\n <\/jats:sec>","DOI":"10.1093\/bioinformatics\/bty451","type":"journal-article","created":{"date-parts":[[2018,5,29]],"date-time":"2018-05-29T11:13:56Z","timestamp":1527592436000},"page":"4007-4016","source":"Crossref","is-referenced-by-count":351,"title":["ACPred-FL: a sequence-based predictor using effective feature representation to improve the prediction of anti-cancer peptides"],"prefix":"10.1093","volume":"34","author":[{"ORCID":"http:\/\/orcid.org\/0000-0003-1444-190X","authenticated-orcid":false,"given":"Leyi","family":"Wei","sequence":"first","affiliation":[{"name":"School of Computer Science and Technology, Tianjin University, Tianjin, China"},{"name":"State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, China"}]},{"given":"Chen","family":"Zhou","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, Tianjin University, Tianjin, China"}]},{"given":"Huangrong","family":"Chen","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, Tianjin University, Tianjin, China"}]},{"ORCID":"http:\/\/orcid.org\/0000-0001-8031-9086","authenticated-orcid":false,"given":"Jiangning","family":"Song","sequence":"additional","affiliation":[{"name":"Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology"},{"name":"Monash Centre for Data Science, Faculty of Information Technology, Monash University, Clayton, VIC 3800, Australia"}]},{"given":"Ran","family":"Su","sequence":"additional","affiliation":[{"name":"School of Computer Software, Tianjin University, Tianjin, China"},{"name":"State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, China"}]}],"member":"286","published-online":{"date-parts":[[2018,6,1]]},"reference":[{"key":"2023012712291408500_bty451-B1","doi-asserted-by":"crossref","first-page":"1153","DOI":"10.2174\/138920111796117337","article-title":"Promises of apoptosis-inducing peptides in cancer therapeutics","volume":"12","author":"Barras","year":"2011","journal-title":"Curr. Pharm. Biotechnol."},{"key":"2023012712291408500_bty451-B2","doi-asserted-by":"crossref","first-page":"3794","DOI":"10.2174\/092986712801661004","article-title":"The use of therapeutic peptides to target and to kill cancer cells","volume":"19","author":"Boohaker","year":"2012","journal-title":"Curr. Med. Chem."},{"key":"2023012712291408500_bty451-B3","doi-asserted-by":"crossref","first-page":"16895","DOI":"10.18632\/oncotarget.7815","article-title":"iACP: a sequence-based tool for identifying anticancer peptides","volume":"7","author":"Chen","year":"2016","journal-title":"Oncotarget"},{"key":"2023012712291408500_bty451-B4","first-page":"294","article-title":"From antimicrobial to anticancer peptides","volume":"4","author":"Diana","year":"2013","journal-title":"A review. Front. Microbiol."},{"key":"2023012712291408500_bty451-B5","first-page":"185","article-title":"Minimum redundancy feature selection from microarray gene expression data","volume-title":"J. Bioinform. Comput. Biol.","author":"Ding","year":"2003"},{"key":"2023012712291408500_bty451-B6","doi-asserted-by":"crossref","first-page":"1459","DOI":"10.1007\/s00726-014-1711-5","article-title":"PhosphoSVM: prediction of phosphorylation sites by integrating various protein sequence attributes with a support vector machine","volume":"46","author":"Dou","year":"2014","journal-title":"Amino Acids"},{"key":"2023012712291408500_bty451-B7","doi-asserted-by":"crossref","first-page":"401","DOI":"10.1002\/(SICI)1097-0134(19990601)35:4<401::AID-PROT3>3.0.CO;2-K","article-title":"Recognition of a protein fold in the context of the SCOP classification","volume":"35","author":"Dubchak","year":"1999","journal-title":"Prot. Struct. Funct. Bioinform."},{"key":"2023012712291408500_bty451-B8","doi-asserted-by":"crossref","first-page":"2893","DOI":"10.1002\/ijc.25516","article-title":"Estimates of worldwide burden of cancer in 2008: GLOBOCAN 2008","volume":"127","author":"Ferlay","year":"2010","journal-title":"Int. J. Cancer"},{"key":"2023012712291408500_bty451-B9","doi-asserted-by":"crossref","first-page":"906","DOI":"10.1093\/bioinformatics\/16.10.906","article-title":"Support vector machine classification and validation of cancer tissue samples using microarray expression data","volume":"16","author":"Furey","year":"2000","journal-title":"Bioinformatics"},{"key":"2023012712291408500_bty451-B10","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/INDCON.2011.6139332","article-title":"Composition, Transition and Distribution (CTD)\u2014a dynamic feature for predictions based on hierarchical structure of cellular sorting","volume-title":"IEEE 2011 Annual IEEE India Conference","author":"Govindan","year":"2011"},{"key":"2023012712291408500_bty451-B11","doi-asserted-by":"crossref","first-page":"34","DOI":"10.1016\/j.jtbi.2013.08.037","article-title":"Predicting anticancer peptides with Chou\u2019s pseudo amino acid composition and investigating their mutagenicity via Ames test","volume":"341","author":"Hajisharifi","year":"2014","journal-title":"J. Theor. Biol."},{"key":"2023012712291408500_bty451-B12","doi-asserted-by":"crossref","first-page":"714","DOI":"10.1038\/nrc3599","article-title":"Cancer drug resistance: an evolving paradigm","volume":"13","author":"Holohan","year":"2013","journal-title":"Nat. Rev. Cancer"},{"key":"2023012712291408500_bty451-B13","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1214\/aos\/1033066197","article-title":"Nonparametric and semiparametric estimation of the receiver operating characteristic curve","volume":"24","author":"Hsieh","year":"1996","journal-title":"Ann. Stat."},{"key":"2023012712291408500_bty451-B14","doi-asserted-by":"crossref","first-page":"1658","DOI":"10.1093\/bioinformatics\/btl158","article-title":"Cd-Hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences","volume":"22","author":"Li","year":"2006","journal-title":"Bioinformatics"},{"key":"2023012712291408500_bty451-B15","doi-asserted-by":"crossref","first-page":"W385","DOI":"10.1093\/nar\/gkr284","article-title":"PROFEAT: a web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence","volume":"39","author":"Li","year":"2011","journal-title":"Nucleic Acids Res."},{"key":"2023012712291408500_bty451-B16","doi-asserted-by":"crossref","first-page":"933","DOI":"10.1517\/13543784.15.8.933","article-title":"Cationic antimicrobial peptides as novel cytotoxic agents for cancer treatment","volume":"15","author":"Mader","year":"2006","journal-title":"Expert Opin. Investig. Drugs"},{"key":"2023012712291408500_bty451-B17","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/978-1-59745-419-3_1","article-title":"Peptide-based drug design: here and now","volume":"494","author":"Otvos","year":"2008","journal-title":"Methods Mol. Biol."},{"key":"2023012712291408500_bty451-B18","doi-asserted-by":"crossref","first-page":"1226","DOI":"10.1109\/TPAMI.2005.159","article-title":"Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy","volume":"27","author":"Peng","year":"2005","journal-title":"IEEE Trans. Pattern Anal. Mach. Intel."},{"key":"2023012712291408500_bty451-B19","doi-asserted-by":"crossref","first-page":"277","DOI":"10.3322\/caac.20073","article-title":"Cancer statistics, 2013","volume":"60","author":"Jemal","year":"2010","journal-title":"CA Cancer J. Clin."},{"key":"2023012712291408500_bty451-B20","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1038\/srep02984","article-title":"In silico models for designing and discovering novel anticancer peptides","volume":"3","author":"Tyagi","year":"2013","journal-title":"Sci. Rep."},{"key":"2023012712291408500_bty451-B21","doi-asserted-by":"crossref","first-page":"D837","DOI":"10.1093\/nar\/gku892","article-title":"CancerPPD: a database of anticancer peptides and proteins","volume":"43","author":"Tyagi","year":"2015","journal-title":"Nucleic Acids Res."},{"key":"2023012712291408500_bty451-B22","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1007\/s10989-014-9435-7","article-title":"ACPP: a web server for prediction and design of anti-cancer peptides","volume":"21","author":"Vijayakumar","year":"2015","journal-title":"Int. J. Pept. Res. Ther."},{"key":"2023012712291408500_bty451-B23","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12864-017-4128-1","article-title":"SkipCPP-Pred: an improved and promising sequence-based predictor for predicting cell-penetrating peptides","volume":"18","author":"Wei","year":"2017","journal-title":"BMC Genomics"},{"key":"2023012712291408500_bty451-B24","article-title":"Fast prediction of methylation sites using sequence-based feature selection technique","author":"Wei","year":"2017","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinform"},{"key":"2023012712291408500_bty451-B25","doi-asserted-by":"crossref","first-page":"2044","DOI":"10.1021\/acs.jproteome.7b00019","article-title":"CPPred-RF: a sequence-based predictor for identifying cell-penetrating peptides and their uptake efficiency","volume":"16","author":"Wei","year":"2017","journal-title":"J. Proteome Res."},{"key":"2023012712291408500_bty451-B26","first-page":"1100","article-title":"A direct method of nonparametric measurement selection","volume":"20","author":"Whitney","year":"2006","journal-title":"IEEE Trans. Computers"},{"key":"2023012712291408500_bty451-B27","doi-asserted-by":"crossref","first-page":"1375","DOI":"10.3390\/e15041375","article-title":"Classification of knee joint vibration signals using bivariate feature distribution estimation and maximal posterior probability decision criterion","volume":"15","author":"Wu","year":"2013","journal-title":"Entropy"},{"key":"2023012712291408500_bty451-B28","doi-asserted-by":"crossref","first-page":"46757","DOI":"10.1038\/srep46757","article-title":"Identifying N6-methyladenosine sites using multi-interval nucleotide pair position specificity and support vector machine","volume":"7","author":"Xing","year":"2017","journal-title":"Sci. Rep."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/23\/4007\/48919948\/bioinformatics_34_23_4007.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/23\/4007\/48919948\/bioinformatics_34_23_4007.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,27]],"date-time":"2023-01-27T13:16:32Z","timestamp":1674825392000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/34\/23\/4007\/5026665"}},"subtitle":[],"editor":[{"given":"John","family":"Hancock","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2018,6,1]]},"references-count":28,"journal-issue":{"issue":"23","published-print":{"date-parts":[[2018,12,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bty451","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2018,12,1]]},"published":{"date-parts":[[2018,6,1]]}}}