{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,9,13]],"date-time":"2024-09-13T05:56:02Z","timestamp":1726206962058},"reference-count":88,"publisher":"Oxford University Press (OUP)","issue":"5","license":[{"start":{"date-parts":[[2021,3,12]],"date-time":"2021-03-12T00:00:00Z","timestamp":1615507200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61772115"],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,9,2]]},"abstract":"Abstract<\/jats:title>\n Discovering drug\u2013target (protein) interactions (DTIs) is of great significance for researching and developing novel drugs, having a tremendous advantage to pharmaceutical industries and patients. However, the prediction of DTIs using wet-lab experimental methods is generally expensive and time-consuming. Therefore, different machine learning-based methods have been developed for this purpose, but there are still substantial unknown interactions needed to discover. Furthermore, data imbalance and feature dimensionality problems are a critical challenge in drug-target datasets, which can decrease the classifier performances that have not been significantly addressed yet. This paper proposed a novel drug\u2013target interaction prediction method called PreDTIs. First, the feature vectors of the protein sequence are extracted by the pseudo-position-specific scoring matrix (PsePSSM), dipeptide composition (DC) and pseudo amino acid composition (PseAAC); and the drug is encoded with MACCS substructure fingerings. Besides, we propose a FastUS algorithm to handle the class imbalance problem and also develop a MoIFS algorithm to remove the irrelevant and redundant features for getting the best optimal features. Finally, balanced and optimal features are provided to the LightGBM Classifier to identify DTIs, and the 5-fold CV validation test method was applied to evaluate the prediction ability of the proposed method. Prediction results indicate that the proposed model PreDTIs is significantly superior to other existing methods in predicting DTIs, and our model could be used to discover new drugs for unknown disorders or infections, such as for the coronavirus disease 2019 using existing drugs compounds and severe acute respiratory syndrome coronavirus 2 protein sequences.<\/jats:p>","DOI":"10.1093\/bib\/bbab046","type":"journal-article","created":{"date-parts":[[2021,1,29]],"date-time":"2021-01-29T12:11:57Z","timestamp":1611922317000},"source":"Crossref","is-referenced-by-count":43,"title":["PreDTIs: prediction of drug\u2013target interactions based on multiple feature information using gradient boosting framework with data balancing and feature selection techniques"],"prefix":"10.1093","volume":"22","author":[{"given":"S M Hasan","family":"Mahmud","sequence":"first","affiliation":[{"name":"School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China"}]},{"given":"Wenyu","family":"Chen","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China"}]},{"given":"Yongsheng","family":"Liu","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China"}]},{"given":"Md Abdul","family":"Awal","sequence":"additional","affiliation":[{"name":"Electronics and Communication Engineering Discipline, Khulna University, Khulna 9208, Bangladesh"}]},{"ORCID":"http:\/\/orcid.org\/0000-0002-4034-9819","authenticated-orcid":false,"given":"Kawsar","family":"Ahmed","sequence":"additional","affiliation":[{"name":"Department of Information and Communication Technology, Mawlana Bhashani Science and Technology University, Santosh, Tangail-1902, Bangladesh"}]},{"given":"Md Habibur","family":"Rahman","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, Islamic University, Kushtia-7003, Bangladesh"}]},{"ORCID":"http:\/\/orcid.org\/0000-0003-0756-1006","authenticated-orcid":false,"given":"Mohammad Ali","family":"Moni","sequence":"additional","affiliation":[{"name":"UNSW Digital Health, WHO Center for eHealth, School of Public Health and Community Medicine, Faculty of Medicine, The University of New South Wales, Sydney, Australia"}]}],"member":"286","published-online":{"date-parts":[[2021,3,12]]},"reference":[{"key":"2021090815141358700_ref1","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0009603","article-title":"Predicting drug-target interaction networks based on functional groups and biological features","volume":"5","author":"He","year":"2010","journal-title":"PLoS One"},{"key":"2021090815141358700_ref2","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1038\/nrd986","article-title":"Target selection in drug discovery","volume":"2","author":"Knowles","year":"2003","journal-title":"Nat Rev Drug Discov"},{"key":"2021090815141358700_ref3","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1093\/bib\/bbaa205","article-title":"DTI-MLCD: predicting drug-target interactions using multi-label learning with community detection method","author":"Chen","year":"2020","journal-title":"Brief Bioinform"},{"key":"2021090815141358700_ref4","doi-asserted-by":"publisher","first-page":"192","DOI":"10.1038\/nrd1032","article-title":"ADMET in silico modelling: towards prediction paradise?","volume":"2","author":"","year":"2003","journal-title":"Nat Rev Drug Discov"},{"key":"2021090815141358700_ref5","doi-asserted-by":"crossref","first-page":"445","DOI":"10.1016\/S1359-6446(00)01559-2","article-title":"Predicting human safety\u00a0: screening and computational approaches","volume":"5","author":"Johnson","year":"2000","journal-title":"Drug Discov Today"},{"key":"2021090815141358700_ref6","doi-asserted-by":"crossref","first-page":"775","DOI":"10.1109\/TCBB.2014.2325031","article-title":"Network-based drug-target interaction prediction with probabilistic soft logic","volume":"11","author":"Fakhraei","year":"2014","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"2021090815141358700_ref7","doi-asserted-by":"publisher","first-page":"167","DOI":"10.1038\/462167a","article-title":"Predicting promiscuity","volume":"462","author":"Hopkins","year":"2009","journal-title":"Nature"},{"key":"2021090815141358700_ref8","doi-asserted-by":"publisher","first-page":"696","DOI":"10.1093\/bib\/bbv066","article-title":"Drug \u2013 target interaction prediction\u00a0: databases, web servers and computational models","volume":"17","author":"Chen","year":"2016","journal-title":"Brief Bioinform"},{"key":"2021090815141358700_ref9","doi-asserted-by":"publisher","first-page":"D109","DOI":"10.1093\/nar\/gkr988","article-title":"KEGG for integration and interpretation of large-scale molecular data sets","volume":"40","author":"Kanehisa","year":"2012","journal-title":"Nucleic Acids Res"},{"key":"2021090815141358700_ref10","doi-asserted-by":"publisher","first-page":"D1083","DOI":"10.1093\/nar\/gkt1031","article-title":"Overington, the ChEMBL bioactivity database: an update","volume":"42","author":"Bento","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"2021090815141358700_ref11","doi-asserted-by":"publisher","first-page":"D1035","DOI":"10.1093\/nar\/gkq1126","article-title":"DrugBank 3.0: a comprehensive resource for \u201comics\u201d research on drugs","volume":"39","author":"Knox","year":"2011","journal-title":"Nucleic Acids Res"},{"key":"2021090815141358700_ref12","doi-asserted-by":"crossref","first-page":"412","DOI":"10.1093\/nar\/30.1.412","article-title":"TTD: therapeutic target database","volume":"30","author":"Chen","year":"2002","journal-title":"Nucleic Acids Res"},{"key":"2021090815141358700_ref13","doi-asserted-by":"crossref","DOI":"10.1093\/nar\/gkp1014","article-title":"Update of TTD: therapeutic target database","volume":"38","author":"Zhu","year":"2010","journal-title":"Nucleic Acids Res"},{"key":"2021090815141358700_ref14","doi-asserted-by":"publisher","first-page":"D380","DOI":"10.1093\/nar\/gkv1277","article-title":"STITCH 5: augmenting protein-chemical interaction networks with tissue and affinity data","volume":"44","author":"Szklarczyk","year":"2016","journal-title":"Nucleic Acids Res"},{"key":"2021090815141358700_ref15","doi-asserted-by":"publisher","first-page":"637","DOI":"10.1016\/j.drudis.2013.11.005","article-title":"Toward better drug repositioning\u00a0: prioritizing and integrating existing methods into efficient pipelines","volume":"19","author":"Jin","year":"2014","journal-title":"Drug Discov Today"},{"key":"2021090815141358700_ref16","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1093\/bib\/bbz157","article-title":"Machine learning approaches and databases for prediction of drug \u2013 target interaction\u00a0: a survey paper","volume":"00","author":"Bagherian","year":"2019","journal-title":"Brief Bioinform"},{"key":"2021090815141358700_ref17","doi-asserted-by":"publisher","first-page":"197","DOI":"10.1038\/nbt1284","article-title":"Relating protein pharmacology by ligand chemistry","volume":"25","author":"Keiser","year":"2007","journal-title":"Nat Biotechnol"},{"key":"2021090815141358700_ref18","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0063730","article-title":"Insights into an original pocket-ligand pair classification\u00a0: a promising tool for ligand profile prediction","volume":"8","author":"Regad","year":"2013","journal-title":"PLoS One"},{"key":"2021090815141358700_ref19","doi-asserted-by":"publisher","first-page":"71","DOI":"10.1038\/nbt1273","article-title":"Structure-based maximal affinity model predicts small-molecule druggability","volume":"25","author":"Cheng","year":"2007","journal-title":"Nat Biotechnol"},{"key":"2021090815141358700_ref20","doi-asserted-by":"publisher","first-page":"1277","DOI":"10.1038\/nprot.2013.074","article-title":"Small-molecule ligand docking into comparative models with Rosetta","volume":"8","author":"Combs","year":"2013","journal-title":"Nat Protoc"},{"key":"2021090815141358700_ref21","doi-asserted-by":"publisher","first-page":"245","DOI":"10.1093\/bioinformatics\/bti1141","article-title":"A probabilistic model for mining implicit \u2018chemical compound \u2013 gene\u2019 relations from literature","volume":"21","author":"Zhu","year":"2005","journal-title":"Bioinformatics"},{"key":"2021090815141358700_ref22","doi-asserted-by":"publisher","first-page":"1273","DOI":"10.1517\/17425255.2014.950222","article-title":"Drug\u2013target interaction prediction via chemogenomic space: learning-based methods","volume":"10","author":"Mousavian","year":"2014","journal-title":"Expert Opin Drug Metab Toxicol"},{"key":"2021090815141358700_ref23","doi-asserted-by":"publisher","first-page":"333","DOI":"10.1093\/bib\/bbw012","article-title":"SDTNBI: An integrated network and chemoinformatics tool for systematic prediction of drug-target interactions and drug repositioning","volume":"18","author":"Wu","year":"2017","journal-title":"Brief Bioinform"},{"key":"2021090815141358700_ref24","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1093\/bib\/bby061","article-title":"Recent applications of deep learning and machine intelligence on in silico drug discovery\u00a0: methods, tools and databases","author":"Rifaioglu","year":"2018","journal-title":"Brief Bioinform"},{"key":"2021090815141358700_ref25","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0037608","article-title":"A systematic prediction of multiple drug-target interactions from chemical, genomic and pharmacological data","volume":"7","author":"Yu","year":"2012","journal-title":"PLoS One"},{"key":"2021090815141358700_ref26","doi-asserted-by":"publisher","first-page":"48699","DOI":"10.1109\/ACCESS.2019.2910277","article-title":"iDTi-CSsmoteB\u00a0: identification of drug\u2013target interaction based on drug chemical structure and protein sequence using XGBoost with over-sampling technique SMOTE","volume":"7","author":"Mahmud","year":"2019","journal-title":"IEEE Access"},{"key":"2021090815141358700_ref27","doi-asserted-by":"publisher","first-page":"2304","DOI":"10.1093\/bioinformatics\/bts360","article-title":"Predicting drug-target interactions from chemical and genomic kernels using Bayesian matrix factorization","volume":"28","author":"G\u00f6nen","year":"2012","journal-title":"Bioinformatics"},{"key":"2021090815141358700_ref28","first-page":"1025","article-title":"Collaborative matrix factorization with multiple similarities for predicting drug-target interactions categories and subject descriptors, in: 19th ACM SIGKDD Int","author":"Zheng"},{"key":"2021090815141358700_ref29","doi-asserted-by":"publisher","DOI":"10.1109\/TCBB.2016.2530062","article-title":"Drug-target interaction prediction with graph regularized matrix factorization","author":"Ezzat","year":"2016","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"2021090815141358700_ref30","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1093\/bib\/bbaa025","article-title":"Coupled matrix \u2013 matrix and coupled tensor \u2013 matrix completion methods for predicting drug \u2013 target interactions","volume":"00","author":"Bagherian","year":"2020","journal-title":"Brief Bioinform"},{"key":"2021090815141358700_ref31","doi-asserted-by":"publisher","first-page":"1970","DOI":"10.1039\/c2mb00002d","article-title":"Drug-target interaction prediction by random walk on the heterogeneous network","volume":"8","author":"Chen","year":"2012","journal-title":"Mol Biosyst"},{"key":"2021090815141358700_ref32","doi-asserted-by":"crossref","DOI":"10.1007\/s10916-018-1003-9","article-title":"A survey of data mining and deep learning in bioinformatics","volume":"42","author":"Lan","year":"2018","journal-title":"J Med Syst"},{"key":"2021090815141358700_ref33","doi-asserted-by":"publisher","first-page":"785","DOI":"10.1145\/2939672.2939785","author":"Chen","year":"2016"},{"key":"2021090815141358700_ref34","doi-asserted-by":"publisher","DOI":"10.1109\/TCBB.2019.2940187","article-title":"A convolutional neural network system to discriminate drug-target interactions","author":"Hu","year":"2019"},{"key":"2021090815141358700_ref35","doi-asserted-by":"publisher","first-page":"42","DOI":"10.1016\/j.vascn.2015.11.002","article-title":"Drug-target interaction prediction from PSSM based evolutionary information","volume":"78","author":"Mousavian","year":"2016","journal-title":"J Pharmacol Toxicol Methods"},{"key":"2021090815141358700_ref36","doi-asserted-by":"publisher","first-page":"71","DOI":"10.1016\/j.jtbi.2013.08.013","article-title":"ICDI-PseFpt: identify the channel-drug interaction in cellular networking with PseAAC and molecular fingerprints","volume":"337","author":"Xiao","year":"2013","journal-title":"J Theor Biol"},{"key":"2021090815141358700_ref37","doi-asserted-by":"publisher","first-page":"80","DOI":"10.4018\/IJACI.2020040105","article-title":"Behavioural intention of customers towards smartwatches in an ambient environment using soft computing: An integrated SEM-PLS and fuzzy rough set approach","volume":"11","author":"Kiruba B","year":"2020","journal-title":"Int J Ambient Comput Intell"},{"key":"2021090815141358700_ref38","doi-asserted-by":"publisher","first-page":"232","DOI":"10.1093\/bioinformatics\/btn162","article-title":"Prediction of drug-target interaction networks from the integration of chemical and genomic spaces","volume":"24","author":"Yamanishi","year":"2008","journal-title":"Bioinformatics"},{"key":"2021090815141358700_ref39","doi-asserted-by":"publisher","first-page":"246","DOI":"10.1093\/bioinformatics\/btq176","article-title":"Drug-target interaction prediction from chemical, genomic and pharmacological data in an integrated framework","volume":"26","author":"Yamanishi","year":"2010","journal-title":"Bioinformatics"},{"key":"2021090815141358700_ref40","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1016\/j.aca.2016.01.014","article-title":"Improved prediction of drug-target interactions using regularized least squares integrating with kernel fusion technique","volume":"909","author":"Hao","year":"2016","journal-title":"Anal Chim Acta"},{"key":"2021090815141358700_ref41","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41598-017-10724-0","article-title":"In silico prediction of drug-target interaction networks based on drug chemical structure and protein sequences","volume":"7","author":"Li","year":"2017","journal-title":"Sci Rep"},{"key":"2021090815141358700_ref42","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41598-017-18025-2","article-title":"IDTI-ESBoost: identification of drug target interaction using evolutionary and structural features with boosting","volume":"7","author":"Rayhan","year":"2017","journal-title":"Sci Rep"},{"key":"2021090815141358700_ref43","doi-asserted-by":"publisher","first-page":"445","DOI":"10.2174\/1389203718666161114111656","article-title":"RFDT: a rotation Forest-based predictor for predicting drug-target interactions using drug structure and protein sequence information","volume":"19","author":"Wang","year":"2018","journal-title":"Curr Protein Pept Sci"},{"key":"2021090815141358700_ref44","doi-asserted-by":"publisher","first-page":"90","DOI":"10.1016\/j.compbiolchem.2019.03.016","article-title":"Predicting drug-target interaction network using deep learning model","volume":"80","author":"You","year":"2019","journal-title":"Comput Biol Chem"},{"key":"2021090815141358700_ref45","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.ygeno.2018.12.007","article-title":"Predicting drug-target interactions using Lasso with random forest based on evolutionary information and chemical structure","author":"Shi","year":"2018","journal-title":"Genomics"},{"key":"2021090815141358700_ref46","doi-asserted-by":"publisher","first-page":"256","DOI":"10.1016\/j.neucom.2016.10.039","article-title":"DrugRPE\u00a0: random projection ensemble approach to drug-target interaction prediction","volume":"228","author":"Zhang","year":"2017","journal-title":"Neurocomputing"},{"key":"2021090815141358700_ref47","doi-asserted-by":"publisher","DOI":"10.1016\/j.ab.2019.113507","article-title":"Prediction of drug-target interaction based on protein features using undersampling and feature selection techniques with boosting","volume":"589","author":"Mahmud","year":"2020","journal-title":"Anal Biochem"},{"key":"2021090815141358700_ref48","doi-asserted-by":"crossref","DOI":"10.1111\/j.1469-1809.1936.tb02137.x","article-title":"The use of multiple measurements in taxonomic problems","volume":"7","author":"FISHER","year":"1936","journal-title":"Ann Eugen"},{"key":"2021090815141358700_ref49","doi-asserted-by":"crossref","first-page":"417","DOI":"10.1037\/h0071325","article-title":"Analysis of a complex of statistical variables into principal components","volume":"24","author":"Hotelling","year":"1993","journal-title":"J Educ Psychol"},{"key":"2021090815141358700_ref50","doi-asserted-by":"crossref","DOI":"10.1038\/scientificamerican0792-66","article-title":"Genetic algorithms","volume":"267","author":"Holland","year":"1992","journal-title":"Sci Am"},{"key":"2021090815141358700_ref51","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1023\/A:1025667309714","article-title":"M., Kononenko, theoretical and empirical analysis of ReliefF and RReliefF","volume":"53","author":"Robnik-\u0160ikonja","year":"2003","journal-title":"Mach Learn"},{"key":"2021090815141358700_ref52","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13321-020-00447-2","article-title":"DTiGEMS +\u2009: drug \u2013 target interaction prediction using graph embedding , graph mining, and similarity - based techniques, J","volume":"12","author":"Thafar","year":"2020","journal-title":"Chem"},{"key":"2021090815141358700_ref53","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s12859-020-3518-6","article-title":"Drug-target interaction prediction using semi-bipartite graph model and deep learning","volume":"21","author":"Manoochehri","year":"2020","journal-title":"BMC Bioinformatics"},{"key":"2021090815141358700_ref54","doi-asserted-by":"publisher","first-page":"1","DOI":"10.3389\/fbioe.2020.00338","article-title":"Prediction of drug\u2013target interactions from multi-molecular network based on deep walk embedding model, front","volume":"8","author":"Chen","year":"2020","journal-title":"Bioeng Biotechnol"},{"key":"2021090815141358700_ref55","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gkx1037","article-title":"DrugBank 5.0\u00a0: a major update to the DrugBank database for 2018","author":"Wishart","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2021090815141358700_ref56","doi-asserted-by":"publisher","first-page":"919","DOI":"10.1093\/nar\/gkm862","article-title":"SuperTarget and matador: resources for exploring drug-target relationships","volume":"36","author":"G\u00fcnther","year":"2008","journal-title":"Nucleic Acids Res"},{"key":"2021090815141358700_ref57","doi-asserted-by":"publisher","first-page":"480","DOI":"10.1093\/nar\/gkm882","article-title":"KEGG for linking genomes to life and the environment","volume":"36","author":"Kanehisa","year":"2008","journal-title":"Nucleic Acids Res"},{"key":"2021090815141358700_ref58","doi-asserted-by":"publisher","first-page":"431D","DOI":"10.1093\/nar\/gkh081","article-title":"BRENDA, the enzyme database: updates and major new developments","volume":"32","author":"Schomburg","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"2021090815141358700_ref59","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1089\/cmb.2017.0135","article-title":"Based method for predicting drug\u2013target interactions by using stacked autoencoder deep neural","volume":"24","author":"Wang","year":"2017","journal-title":"Network"},{"key":"2021090815141358700_ref60","doi-asserted-by":"publisher","first-page":"546","DOI":"10.1016\/j.ins.2017.08.045","article-title":"Identification of drug-target interactions via multiple information integration","volume":"418\u2013419","author":"Ding","year":"2017","journal-title":"Inf Sci (Ny)"},{"key":"2021090815141358700_ref61","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.aca.2012.09.021","article-title":"Large-scale prediction of drug-target interactions using protein sequences and drug topological structures","volume":"752","author":"Cao","year":"2012","journal-title":"Anal Chim Acta"},{"key":"2021090815141358700_ref62","doi-asserted-by":"publisher","first-page":"1092","DOI":"10.1093\/bioinformatics\/btt105","article-title":"ChemoPy\u00a0: freely available python package for computational biology and chemoinformatics","volume":"29","author":"Cao","year":"2013","journal-title":"Bioinformatics"},{"key":"2021090815141358700_ref63","doi-asserted-by":"publisher","DOI":"10.1093\/protein\/gzm057","article-title":"Nuc-PLoc\u00a0: a new web-server for predicting protein subnuclear localization by fusing PseAA composition and PsePSSM","author":"Shen","year":"2013","journal-title":"Protein Eng Des Sel"},{"key":"2021090815141358700_ref64","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1006\/jmbi.1999.3091","article-title":"Protein secondary structure prediction based on position-specific scoring matrices","volume":"292","author":"Jones","year":"1999","journal-title":"J Mol Biol"},{"key":"2021090815141358700_ref65","doi-asserted-by":"crossref","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","article-title":"PSI-BLAST: a new generation of protein database search programs","volume":"25","author":"Altschul","year":"1997","journal-title":"Nucleic Acids Res"},{"key":"2021090815141358700_ref66","doi-asserted-by":"publisher","first-page":"10","DOI":"10.1093\/bioinformatics\/bth466","article-title":"Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes","volume":"21","author":"Chou","year":"2005","journal-title":"Bioinformatics"},{"key":"2021090815141358700_ref67","doi-asserted-by":"publisher","DOI":"10.1080\/07391102.2015.1095116","article-title":"Identification of protein-protein binding sites by incorporating the physicochemical properties and stationary wavelet transforms into pseudo amino acid composition","author":"Jia","journal-title":"J Biomol Struct Dyn"},{"key":"2021090815141358700_ref68","doi-asserted-by":"publisher","first-page":"80","DOI":"10.1016\/j.jtbi.2017.08.009","article-title":"Highly accurate prediction of protein self-interactions by incorporating the average block and PSSM information into the general PseAAC","volume":"432","author":"Zhai","year":"2017","journal-title":"J Theor Biol"},{"key":"2021090815141358700_ref69","doi-asserted-by":"publisher","first-page":"558","DOI":"10.1039\/C4MB00645C","article-title":"Molecular BioSystems predicting the subcellular localization of mycobacterial proteins by incorporating the optimal tripeptides into the general form of pseudo amino acid composition","volume":"11","author":"Zhu","year":"2015","journal-title":"Mol Biosyst"},{"key":"2021090815141358700_ref70","doi-asserted-by":"publisher","first-page":"218","DOI":"10.1016\/j.compbiolchem.2011.05.003","article-title":"CE-PLoc\u00a0: An ensemble classifier for predicting protein subcellular locations by fusing different modes of pseudo amino acid composition","volume":"35","author":"Khan","year":"2011","journal-title":"Comput Biol Chem"},{"key":"2021090815141358700_ref71","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13321-018-0270-2","article-title":"PyBioMed: a python library for various molecular representations of chemicals, proteins and DNAs and their interactions","volume":"10","author":"Dong","year":"2018","journal-title":"J Chem"},{"key":"2021090815141358700_ref72","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.ygeno.2018.12.007","article-title":"Predicting drug-target interactions using Lasso with random forest based on evolutionary information and chemical structure","author":"Shi","year":"2018","journal-title":"Genomics"},{"key":"2021090815141358700_ref73","doi-asserted-by":"publisher","first-page":"5718","DOI":"10.1016\/j.eswa.2008.06.108","article-title":"Cluster-based under-sampling approaches for imbalanced data distributions","volume":"36","author":"Yen","year":"2009","journal-title":"Expert Syst Appl"},{"key":"2021090815141358700_ref74","doi-asserted-by":"publisher","DOI":"10.1007\/978-981-10-7242-0_3","article-title":"Rare event prediction using similarity majority under-sampling technique","author":"Li","year":"2017","journal-title":"Soft Comput Data Sci"},{"key":"2021090815141358700_ref75","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1109\/TSMC.2020.3016283","article-title":"Neural network-based undersampling techniques, IEEE Transactions on Systems, Man, and Cybernetics","author":"Arefeen","year":"2020"},{"key":"2021090815141358700_ref76","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41598-017-14945-1","article-title":"iDNAProt-ES\u00a0: identification of DNA-binding proteins using evolutionary and structural features","author":"Chowdhury","year":"2017","journal-title":"Sci Rep"},{"key":"2021090815141358700_ref77","doi-asserted-by":"crossref","first-page":"217","DOI":"10.1023\/A:1008363719778","article-title":"Incremental feature selection","volume":"9","author":"Liu","year":"1998","journal-title":"Appl Intell"},{"key":"2021090815141358700_ref78","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41598-017-13259-6","article-title":"RIFS: a randomly restarted incremental feature selection algorithm","author":"Ye","year":"2017","journal-title":"Sci Rep"},{"key":"2021090815141358700_ref79","doi-asserted-by":"publisher","first-page":"1189","DOI":"10.2307\/2699986","article-title":"Greedy function approximation: a gradient boosting machine","volume":"29","author":"Friedman","year":"..","journal-title":"Ann Stat"},{"key":"2021090815141358700_ref80","first-page":"3146","article-title":"LightGBM: a highly efficient gradient boosting decision tree","volume-title":"31st Conference on Neural Information Processing Systems (NIPS)","author":"Ke","year":"2017"},{"key":"2021090815141358700_ref81","doi-asserted-by":"publisher","first-page":"559","DOI":"10.1080\/14786440109462720","article-title":"On lines and planes of closest fit to systems of points in space","volume":"2","author":"Karl Pearson","year":"2010","journal-title":"Philos Mag"},{"key":"2021090815141358700_ref82","doi-asserted-by":"publisher","first-page":"45","DOI":"10.4018\/IJACI.2017100104","article-title":"CA as dimensionality reduction for large-scale image retrieval systems","volume":"8","author":"Belarbi","year":"2017","journal-title":"Int J Ambient Comput Intell"},{"key":"2021090815141358700_ref83","doi-asserted-by":"publisher","first-page":"832","DOI":"10.1109\/34.709601","article-title":"The random subspace method for constructing decision forests","volume":"20","author":"Ho","year":"1998","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"2021090815141358700_ref84","doi-asserted-by":"publisher","first-page":"273","DOI":"10.1111\/j.1747-0285.2009.00840.x","article-title":"Supprot-vector networks","volume":"297","author":"Cortes","year":"1995","journal-title":"Mach Learn"},{"key":"2021090815141358700_ref85","doi-asserted-by":"publisher","first-page":"468","DOI":"10.2174\/1389203718666161122103057","article-title":"A systematic prediction of drug-target interactions using molecular fingerprints and protein sequences","volume":"19","author":"Huang","year":"2018","journal-title":"Curr Protein Pept Sci"},{"key":"2021090815141358700_ref86","doi-asserted-by":"publisher","DOI":"10.3390\/molecules22071119","article-title":"Prediction of drug\u2013target interaction networks from the integration of protein sequences and drug chemical structures","volume":"22","author":"Meng","year":"2017","journal-title":"Molecules"},{"key":"2021090815141358700_ref87","doi-asserted-by":"publisher","first-page":"29","DOI":"10.1093\/nar\/27.1.29","article-title":"KEGG: Kyoto encyclopedia of genes and genomes","volume":"27","author":"Ogata","year":"1999","journal-title":"Nucleic Acids Res"},{"key":"2021090815141358700_ref88","doi-asserted-by":"publisher","first-page":"80","DOI":"10.4018\/IJACI.2020070105","article-title":"Statistical study of machine learning algorithms using parametric and non-parametric tests: a comparative analysis and recommendations","author":"Khadse","year":"2020","journal-title":"Int J Ambient Comput Intell"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/22\/5\/bbab046\/40259701\/bbab046.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/22\/5\/bbab046\/40259701\/bbab046.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,8]],"date-time":"2021-09-08T15:21:29Z","timestamp":1631114489000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbab046\/6168499"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,3,12]]},"references-count":88,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2021,9,2]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbab046","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,9]]},"published":{"date-parts":[[2021,3,12]]}}}