{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,9,23]],"date-time":"2024-09-23T04:31:10Z","timestamp":1727065870276},"reference-count":101,"publisher":"Springer Science and Business Media LLC","issue":"4","license":[{"start":{"date-parts":[[2023,5,16]],"date-time":"2023-05-16T00:00:00Z","timestamp":1684195200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,5,16]],"date-time":"2023-05-16T00:00:00Z","timestamp":1684195200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100002790","name":"Canadian Network for Research and Innovation in Machining Technology, Natural Sciences and Engineering Research Council of Canada","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100002790","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100011958","name":"Danmarks Frie Forskningsfond","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100011958","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Data Min Knowl Disc"],"published-print":{"date-parts":[[2023,7]]},"abstract":"Abstract<\/jats:title>It has been shown that unsupervised outlier detection methods can be adapted to the one-class classification problem (Janssens and Postma, in: Proceedings of the 18th annual Belgian-Dutch on machine learning, pp 56\u201364, 2009; Janssens et al. in: Proceedings of the 2009 ICMLA international conference on machine learning and applications, IEEE Computer Society, pp 147\u2013153, 2009. https:\/\/doi.org\/10.1109\/ICMLA.2009.16<\/jats:ext-link>). In this paper, we focus on the comparison of one-class classification algorithms with such adapted unsupervised outlier detection methods, improving on previous comparison studies in several important aspects. We study a number of one-class classification and unsupervised outlier detection methods in a rigorous experimental setup, comparing them on a large number of datasets with different characteristics, using different performance measures. In contrast to previous comparison studies, where the models (algorithms, parameters) are selected by using examples from both classes (outlier and inlier), here we also study and compare different approaches for model selection in the absence of examples from the outlier class, which is more realistic for practical applications since labeled outliers are rarely available. Our results showed that, overall, SVDD and GMM are top-performers, regardless of whether the ground truth is used for parameter selection or not. However, in specific application scenarios, other methods exhibited better performance. Combining one-class classifiers into ensembles showed better performance than individual methods in terms of accuracy, as long as the ensemble members are properly selected.<\/jats:p>","DOI":"10.1007\/s10618-023-00931-x","type":"journal-article","created":{"date-parts":[[2023,5,16]],"date-time":"2023-05-16T13:02:32Z","timestamp":1684242152000},"page":"1473-1517","update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["On the evaluation of outlier detection and one-class classification: a comparative study of algorithms, model selection, and ensembles"],"prefix":"10.1007","volume":"37","author":[{"ORCID":"http:\/\/orcid.org\/0000-0002-8273-5814","authenticated-orcid":false,"given":"Henrique O.","family":"Marques","sequence":"first","affiliation":[]},{"given":"Lorne","family":"Swersky","sequence":"additional","affiliation":[]},{"given":"J\u00f6rg","family":"Sander","sequence":"additional","affiliation":[]},{"given":"Ricardo J. G. B.","family":"Campello","sequence":"additional","affiliation":[]},{"given":"Arthur","family":"Zimek","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,5,16]]},"reference":[{"issue":"2s","key":"931_CR1","doi-asserted-by":"publisher","first-page":"937","DOI":"10.1007\/s13198-016-0551-y","volume":"8","author":"AO Adewumi","year":"2017","unstructured":"Adewumi AO, Akinyelu AA (2017) A survey of machine-learning and nature-inspired based credit card fraud detection techniques. Int J Syst Assur Eng Manag 8(2s):937\u2013953. https:\/\/doi.org\/10.1007\/s13198-016-0551-y","journal-title":"Int J Syst Assur Eng Manag"},{"key":"931_CR2","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4614-6396-2","volume-title":"Outlier analysis","author":"CC Aggarwal","year":"2013","unstructured":"Aggarwal CC (2013) Outlier analysis. Springer. https:\/\/doi.org\/10.1007\/978-1-4614-6396-2"},{"key":"931_CR3","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-54765-7","volume-title":"Outlier ensembles\u2014an introduction","author":"CC Aggarwal","year":"2017","unstructured":"Aggarwal CC, Sathe S (2017) Outlier ensembles\u2014an introduction. Springer. https:\/\/doi.org\/10.1007\/978-3-319-54765-7"},{"key":"931_CR4","doi-asserted-by":"publisher","DOI":"10.1016\/j.media.2019.101618","author":"Z Alaverdyan","year":"2020","unstructured":"Alaverdyan Z, Jung J, Bouet R et al (2020) Regularized siamese neural network for unsupervised outlier detection on brain multiparametric magnetic resonance imaging: application to epilepsy lesion screening. Medical Image Anal. https:\/\/doi.org\/10.1016\/j.media.2019.101618","journal-title":"Medical Image Anal"},{"key":"931_CR5","doi-asserted-by":"publisher","first-page":"64","DOI":"10.1016\/j.neucom.2017.01.103","volume":"268","author":"ME Azami","year":"2017","unstructured":"Azami ME, Lartizien C, Canu S (2017) Converting SVDD scores into probability estimates: application to outlier detection. Neurocomputing 268:64\u201375. https:\/\/doi.org\/10.1016\/j.neucom.2017.01.103","journal-title":"Neurocomputing"},{"issue":"5","key":"931_CR6","doi-asserted-by":"publisher","first-page":"412","DOI":"10.1093\/bioinformatics\/16.5.412","volume":"16","author":"P Baldi","year":"2000","unstructured":"Baldi P, Brunak S, Chauvin Y et al (2000) Assessing the accuracy of prediction algorithms for classification: an overview. Bioinform 16(5):412\u2013424. https:\/\/doi.org\/10.1093\/bioinformatics\/16.5.412","journal-title":"Bioinform"},{"key":"931_CR7","volume-title":"Outliers in statistical data","author":"V Barnett","year":"1994","unstructured":"Barnett V, Lewis T (1994) Outliers in statistical data, 3rd edn. Wiley","edition":"3"},{"issue":"14","key":"931_CR8","doi-asserted-by":"publisher","first-page":"3188","DOI":"10.3390\/s19143188","volume":"19","author":"VH Bezerra","year":"2019","unstructured":"Bezerra VH, da Costa VGT, Junior SB et al (2019) IoTDS: a one-class classification approach to detect botnets in internet of things devices. Sensors 19(14):3188. https:\/\/doi.org\/10.3390\/s19143188","journal-title":"Sensors"},{"key":"931_CR9","volume-title":"Pattern recognition and machine learning","author":"CM Bishop","year":"2007","unstructured":"Bishop CM (2007) Pattern recognition and machine learning, 5th edn. Springer","edition":"5"},{"key":"931_CR10","doi-asserted-by":"publisher","unstructured":"Breunig MM, Kriegel H, Ng RT et al (2000) LOF: identifying density-based local outliers. In: Proceedings of the 2000 SIGMOD international conference on management of data. ACM, pp 93\u2013104. https:\/\/doi.org\/10.1145\/342009.335388","DOI":"10.1145\/342009.335388"},{"issue":"1","key":"931_CR11","doi-asserted-by":"publisher","first-page":"5:1","DOI":"10.1145\/2733381","volume":"10","author":"RJGB Campello","year":"2015","unstructured":"Campello RJGB, Moulavi D, Zimek A et al (2015) Hierarchical density estimates for data clustering, visualization, and outlier detection. ACM Trans Knowl Discov Data 10(1):5:1-5:51. https:\/\/doi.org\/10.1145\/2733381","journal-title":"ACM Trans Knowl Discov Data"},{"issue":"4","key":"931_CR12","doi-asserted-by":"publisher","first-page":"891","DOI":"10.1007\/s10618-015-0444-8","volume":"30","author":"GO Campos","year":"2016","unstructured":"Campos GO, Zimek A, Sander J et al (2016) On the evaluation of unsupervised outlier detection: measures, datasets, and an empirical study. Data Min Knowl Discov 30(4):891\u2013927. https:\/\/doi.org\/10.1007\/s10618-015-0444-8","journal-title":"Data Min Knowl Discov"},{"key":"931_CR13","doi-asserted-by":"crossref","unstructured":"Chalapathy R, Chawla S (2019) Deep learning for anomaly detection: a survey. CoRR arXiv:1901.03407","DOI":"10.1145\/3394486.3406704"},{"issue":"3","key":"931_CR14","doi-asserted-by":"publisher","first-page":"15:1","DOI":"10.1145\/1541880.1541882","volume":"41","author":"V Chandola","year":"2009","unstructured":"Chandola V, Banerjee A, Kumar V (2009) Anomaly detection: a survey. ACM Comput Surv 41(3):15:1-15:58. https:\/\/doi.org\/10.1145\/1541880.1541882","journal-title":"ACM Comput Surv"},{"key":"931_CR15","doi-asserted-by":"publisher","unstructured":"Chong YS, Tay YH (2017) Abnormal event detection in videos using spatiotemporal autoencoder. In: Proceedings of the 14th ISNN international symposium on neural networks, advances in neural networks. Springer, pp 189\u2013196. https:\/\/doi.org\/10.1007\/978-3-319-59081-3_23","DOI":"10.1007\/978-3-319-59081-3_23"},{"key":"931_CR16","doi-asserted-by":"publisher","unstructured":"Cormack GV, Clarke CLA, B\u00fcttcher S (2009) Reciprocal rank fusion outperforms condorcet and individual rank learning methods. In: Proceedings of the 32nd SIGIR international conference on research and development in information retrieval. ACM, pp 758\u201375., https:\/\/doi.org\/10.1145\/1571941.1572114","DOI":"10.1145\/1571941.1572114"},{"key":"931_CR17","unstructured":"de\u00a0Ridder D, Tax DMJ, Duin RPW (1998) An experimental comparison of one-class classification methods. In: Proceedings of the 4th ASCI advanced school for computing and imaging, pp 213\u2013218"},{"key":"931_CR18","first-page":"1","volume":"7","author":"J Demsar","year":"2006","unstructured":"Demsar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1\u201330","journal-title":"J Mach Learn Res"},{"issue":"12","key":"931_CR19","doi-asserted-by":"publisher","first-page":"3490","DOI":"10.1016\/j.patcog.2013.05.022","volume":"46","author":"C D\u00e9sir","year":"2013","unstructured":"D\u00e9sir C, Bernard S, Petitjean C et al (2013) One class random forests. Pattern Recognit 46(12):3490\u20133506. https:\/\/doi.org\/10.1016\/j.patcog.2013.05.022","journal-title":"Pattern Recognit"},{"key":"931_CR20","unstructured":"Dheeru D, Karra\u00a0Taniskidou E (2017) UCI machine learning repository. http:\/\/archive.ics.uci.edu\/ml"},{"issue":"11","key":"931_CR21","doi-asserted-by":"publisher","first-page":"1175","DOI":"10.1109\/TC.1976.1674577","volume":"25","author":"RPW Duin","year":"1976","unstructured":"Duin RPW (1976) On the choice of smoothing parameters for Parzen estimators of probability density functions. IEEE Trans Comput 25(11):1175\u20131179. https:\/\/doi.org\/10.1109\/TC.1976.1674577","journal-title":"IEEE Trans Comput"},{"key":"931_CR22","doi-asserted-by":"crossref","unstructured":"Erfani SM, Baktashmotlagh M, Rajasegarar S et al (2015) R1SVM: a randomised nonlinear approach to large-scale anomaly detection. In: Proceedings of the 29th AAAI conference on artificial intelligence. AAAI Press, pp 432\u2013438","DOI":"10.1609\/aaai.v29i1.9208"},{"issue":"200","key":"931_CR23","doi-asserted-by":"publisher","first-page":"675","DOI":"10.1080\/01621459.1937.10503522","volume":"32","author":"M Friedman","year":"1937","unstructured":"Friedman M (1937) The use of ranks to avoid the assumption of normality implicit in the analysis of variance. J Am Stat Assoc 32(200):675\u2013701. https:\/\/doi.org\/10.1080\/01621459.1937.10503522","journal-title":"J Am Stat Assoc"},{"issue":"4","key":"931_CR24","doi-asserted-by":"publisher","first-page":"463","DOI":"10.1109\/TSMCC.2011.2161285","volume":"42","author":"M Galar","year":"2012","unstructured":"Galar M, Fern\u00e1ndez A, Tartas EB et al (2012) A review on ensembles for the class imbalance problem: bagging-, boosting-, and hybrid-based approaches. IEEE Trans Syst Man Cybern 42(4):463\u2013484. https:\/\/doi.org\/10.1109\/TSMCC.2011.2161285","journal-title":"IEEE Trans Syst Man Cybern"},{"key":"931_CR25","doi-asserted-by":"publisher","unstructured":"Gao J, Tan P (2006) Converting output scores from outlier detection algorithms into probability estimates. In: Proceedings of the 6th ICDM international conference on data mining. IEEE Computer Society, pp 212\u2013221. https:\/\/doi.org\/10.1109\/ICDM.2006.43","DOI":"10.1109\/ICDM.2006.43"},{"key":"931_CR26","doi-asserted-by":"publisher","unstructured":"Ghafoori Z, Rajasegarar S, Erfani SM et al (2016) Unsupervised parameter estimation for one-class support vector machines. In: Proceedings of the 20th PAKDD Pacific-Asia conference on knowledge discovery and data mining, advances in knowledge discovery and data mining. Springer, pp 183\u2013195. https:\/\/doi.org\/10.1007\/978-3-319-31750-2_15","DOI":"10.1007\/978-3-319-31750-2_15"},{"key":"931_CR27","unstructured":"Gonz\u00e1lez F, Dasgupta D (2002) Neuro-immune and self-organizing map approaches to anomaly detection: a comparison. In: Proceedings of the 1st ICARIS international conference on artificial immune system, pp 203\u2013211"},{"key":"931_CR28","unstructured":"Goodfellow IJ, Pouget-Abadie J, Mirza M et al (2014) Generative adversarial nets. In: Proceedings of the 27th NIPS international conference on neural information processing systems, advances in neural information processing systems, pp 2672\u20132680"},{"key":"931_CR29","volume-title":"Deep learning","author":"IJ Goodfellow","year":"2016","unstructured":"Goodfellow IJ, Bengio Y, Courville AC (2016) Deep learning. MIT Press"},{"key":"931_CR30","volume-title":"Data mining: concepts and techniques","author":"J Han","year":"2011","unstructured":"Han J, Kamber M, Pei J (2011) Data mining: concepts and techniques, 3rd edn. Morgan Kaufmann","edition":"3"},{"key":"931_CR31","doi-asserted-by":"publisher","DOI":"10.1007\/978-94-015-3994-4","volume-title":"Identification of outliers","author":"DM Hawkins","year":"1980","unstructured":"Hawkins DM (1980) Identification of outliers. Chapman & Hall"},{"key":"931_CR32","doi-asserted-by":"publisher","unstructured":"Hempstalk K, Frank E, Witten IH (2008) One-class classification by combining density and class probability estimation. In: Proceedings of the ECML\/PKDD Joint European conference on machine learning and knowledge discovery in databases. Springer, pp 505\u2013519. https:\/\/doi.org\/10.1007\/978-3-540-87479-9_51","DOI":"10.1007\/978-3-540-87479-9_51"},{"key":"931_CR33","doi-asserted-by":"publisher","unstructured":"Hido S, Tsuboi Y, Kashima H et al (2008) Inlier-based outlier detection via direct density ratio estimation. In: Proceedings of the 8th ICDM international conference on data mining. IEEE Computer Society, pp 223\u2013232. https:\/\/doi.org\/10.1109\/ICDM.2008.49","DOI":"10.1109\/ICDM.2008.49"},{"issue":"5786","key":"931_CR34","doi-asserted-by":"publisher","first-page":"504","DOI":"10.1126\/science.1127647","volume":"313","author":"GE Hinton","year":"2006","unstructured":"Hinton GE, Salakhutdinov RR (2006) Reducing the dimensionality of data with neural networks. Science 313(5786):504\u2013507. https:\/\/doi.org\/10.1126\/science.1127647","journal-title":"Science"},{"key":"931_CR35","doi-asserted-by":"publisher","DOI":"10.1002\/9780470434697","volume-title":"Robust statistics","author":"PJ Huber","year":"2009","unstructured":"Huber PJ, Ronchetti EM (2009) Robust statistics, 2nd edn. Wiley","edition":"2"},{"key":"931_CR36","unstructured":"Janssens JHM, Postma EO (2009) One-class classification with LOF and LOCI: an empirical comparison. In: Proceedings of the 18th annual Belgian-Dutch on machine learning, pp 56\u201364"},{"key":"931_CR37","doi-asserted-by":"publisher","unstructured":"Janssens JHM, Flesch I, Postma EO (2009) Outlier detection with one-class classifiers from ML and KDD. In: Proceedings of the 2009 ICMLA international conference on machine learning and applications. IEEE Computer Society, pp 147\u2013153. https:\/\/doi.org\/10.1109\/ICMLA.2009.16","DOI":"10.1109\/ICMLA.2009.16"},{"issue":"2","key":"931_CR38","doi-asserted-by":"publisher","first-page":"329","DOI":"10.1007\/s10115-015-0851-6","volume":"47","author":"PA Jaskowiak","year":"2016","unstructured":"Jaskowiak PA, Moulavi D, Furtado ACS et al (2016) On strategies for building effective ensembles of relative clustering validity criteria. Knowl Inf Syst 47(2):329\u2013354. https:\/\/doi.org\/10.1007\/s10115-015-0851-6","journal-title":"Knowl Inf Syst"},{"key":"931_CR39","doi-asserted-by":"publisher","unstructured":"Joachims T (2006) Training linear SVMs in linear time. In: Proceedings of the 12th SIGKDD international conference on knowledge discovery and data mining. ACM, pp 217\u2013226. https:\/\/doi.org\/10.1145\/1150402.1150429","DOI":"10.1145\/1150402.1150429"},{"key":"931_CR40","unstructured":"Juszczak P (2006) Learning to recognise: a study on one-class classification and active learning. Ph.D. thesis, Delft University of Technology"},{"issue":"3","key":"931_CR41","doi-asserted-by":"publisher","first-page":"345","DOI":"10.1017\/S026988891300043X","volume":"29","author":"SS Khan","year":"2014","unstructured":"Khan SS, Madden MG (2014) One-class classification: taxonomy of study and review of techniques. Knowl Eng Rev 29(3):345\u2013374. https:\/\/doi.org\/10.1017\/S026988891300043X","journal-title":"Knowl Eng Rev"},{"key":"931_CR42","doi-asserted-by":"publisher","first-page":"1090","DOI":"10.1007\/978-1-4020-5614-7_2569","volume-title":"Encyclopedia of public health","author":"W Kirch","year":"2008","unstructured":"Kirch W (2008) Pearson\u2019s correlation coefficient. In: Kirch W (ed) Encyclopedia of public health. Springer, pp 1090\u20131091. https:\/\/doi.org\/10.1007\/978-1-4020-5614-7_2569"},{"issue":"2","key":"931_CR43","doi-asserted-by":"publisher","first-page":"427","DOI":"10.1007\/s10044-015-0505-z","volume":"20","author":"B Krawczyk","year":"2017","unstructured":"Krawczyk B, Cyganek B (2017) Selecting locally specialised classifiers for one-class classification ensembles. Pattern Anal Appl 20(2):427\u2013439. https:\/\/doi.org\/10.1007\/s10044-015-0505-z","journal-title":"Pattern Anal Appl"},{"key":"931_CR44","doi-asserted-by":"publisher","first-page":"36","DOI":"10.1016\/j.neucom.2013.01.053","volume":"126","author":"B Krawczyk","year":"2014","unstructured":"Krawczyk B, Wo\u017aniak M (2014) Diversity measures for one-class classifier ensembles. Neurocomputing 126:36\u201344. https:\/\/doi.org\/10.1016\/j.neucom.2013.01.053","journal-title":"Neurocomputing"},{"key":"931_CR45","doi-asserted-by":"publisher","unstructured":"Krawczyk B, Schaefer G, Wozniak M (2013) Combining one-class classifiers for imbalanced classification of breast thermogram features. In: Proceedings of the 2013 4th CIMI international workshop on computational intelligence in medical imaging. IEEE, pp 36\u201341. https:\/\/doi.org\/10.1109\/CIMI.2013.6583855","DOI":"10.1109\/CIMI.2013.6583855"},{"key":"931_CR46","doi-asserted-by":"publisher","unstructured":"Kriegel H, Schubert M, Zimek A (2008) Angle-based outlier detection in high-dimensional data. In: Proceedings of the 14th SIGKDD international conference on knowledge discovery and data mining. ACM, pp 444\u2013452. https:\/\/doi.org\/10.1145\/1401890.1401946","DOI":"10.1145\/1401890.1401946"},{"key":"931_CR47","doi-asserted-by":"publisher","unstructured":"Kriegel H, Kr\u00f6ger P, Schubert E et al (2009) Outlier detection in axis-parallel subspaces of high dimensional data. In: Proceedings of the 13th PAKDD Pacific-Asia conference on knowledge discovery and data mining. Springer, pp 831\u2013838. https:\/\/doi.org\/10.1007\/978-3-642-01307-2_86","DOI":"10.1007\/978-3-642-01307-2_86"},{"key":"931_CR48","doi-asserted-by":"publisher","unstructured":"Kriegel H, Kr\u00f6ger P, Schubert E et al (2011) Interpreting and unifying outlier scores. In: Proceedings of the 11th SDM international conference on data mining. SIAM, pp 13\u201324. https:\/\/doi.org\/10.1137\/1.9781611972818.2","DOI":"10.1137\/1.9781611972818.2"},{"issue":"6","key":"931_CR49","doi-asserted-by":"publisher","first-page":"84","DOI":"10.1145\/3065386","volume":"60","author":"A Krizhevsky","year":"2017","unstructured":"Krizhevsky A, Sutskever I, Hinton GE (2017) Imagenet classification with deep convolutional neural networks. Commun ACM 60(6):84\u201390. https:\/\/doi.org\/10.1145\/3065386","journal-title":"Commun ACM"},{"issue":"Suppl 1","key":"931_CR50","doi-asserted-by":"publisher","first-page":"949","DOI":"10.1007\/s10586-017-1117-8","volume":"22","author":"D Kwon","year":"2019","unstructured":"Kwon D, Kim H, Kim J et al (2019) A survey of deep learning-based network anomaly detection. Clust Comput 22(Suppl 1):949\u2013961. https:\/\/doi.org\/10.1007\/s10586-017-1117-8","journal-title":"Clust Comput"},{"issue":"7553","key":"931_CR51","doi-asserted-by":"publisher","first-page":"436","DOI":"10.1038\/nature14539","volume":"521","author":"Y LeCun","year":"2015","unstructured":"LeCun Y, Bengio Y, Hinton GE (2015) Deep learning. Nature 521(7553):436\u2013444. https:\/\/doi.org\/10.1038\/nature14539","journal-title":"Nature"},{"key":"931_CR52","doi-asserted-by":"publisher","first-page":"60","DOI":"10.1016\/j.media.2017.07.005","volume":"42","author":"G Litjens","year":"2017","unstructured":"Litjens G, Kooi T, Bejnordi BE et al (2017) A survey on deep learning in medical image analysis. Med Image Anal 42:60\u201388. https:\/\/doi.org\/10.1016\/j.media.2017.07.005","journal-title":"Med Image Anal"},{"key":"931_CR53","doi-asserted-by":"publisher","unstructured":"Liu FT, Ting KM, Zhou Z (2008) Isolation forest. In: Proceedings of the 8th ICDM international conference on data mining. IEEE Computer Society, pp 413\u2013422. https:\/\/doi.org\/10.1109\/ICDM.2008.17","DOI":"10.1109\/ICDM.2008.17"},{"issue":"1","key":"931_CR54","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1145\/2133360.2133363","volume":"6","author":"FT Liu","year":"2012","unstructured":"Liu FT, Ting KM, Zhou Z (2012) Isolation-based anomaly detection. ACM Trans Knowl Discov Data 6(1):31\u2013339. https:\/\/doi.org\/10.1145\/2133360.2133363","journal-title":"ACM Trans Knowl Discov Data"},{"issue":"8","key":"931_CR55","doi-asserted-by":"publisher","first-page":"1517","DOI":"10.1109\/TKDE.2019.2905606","volume":"32","author":"Y Liu","year":"2020","unstructured":"Liu Y, Li Z, Zhou C et al (2020) Generative adversarial active learning for unsupervised outlier detection. IEEE Trans Knowl Data Eng 32(8):1517\u20131528. https:\/\/doi.org\/10.1109\/TKDE.2019.2905606","journal-title":"IEEE Trans Knowl Data Eng"},{"issue":"12","key":"931_CR56","doi-asserted-by":"publisher","first-page":"2481","DOI":"10.1016\/j.sigpro.2003.07.018","volume":"83","author":"M Markou","year":"2003","unstructured":"Markou M, Singh S (2003) Novelty detection: a review\u2014part 1: statistical approaches. Signal Process 83(12):2481\u20132497. https:\/\/doi.org\/10.1016\/j.sigpro.2003.07.018","journal-title":"Signal Process"},{"key":"931_CR57","unstructured":"Marques HO (2019) Evaluation and model selection for unsupervised outlier detection and one-class classification. Ph.D. thesis, University of S\u00e3o Paulo. http:\/\/www.teses.usp.br\/teses\/disponiveis\/55\/55134\/tde-07012020-105601"},{"issue":"2","key":"931_CR58","doi-asserted-by":"publisher","first-page":"442","DOI":"10.1016\/0005-2795(75)90109-9","volume":"405","author":"BW Matthews","year":"1975","unstructured":"Matthews BW (1975) Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochim Biophys Acta (BBA) Protein Struct 405(2):442\u2013451. https:\/\/doi.org\/10.1016\/0005-2795(75)90109-9","journal-title":"Biochim Biophys Acta (BBA) Protein Struct"},{"issue":"1","key":"931_CR59","doi-asserted-by":"publisher","first-page":"10:1","DOI":"10.1145\/2379776.2379786","volume":"45","author":"J Mendes-Moreira","year":"2012","unstructured":"Mendes-Moreira J, Soares C, Jorge AM et al (2012) Ensemble approaches for regression: a survey. ACM Comput Surv 45(1):10:1-10:40. https:\/\/doi.org\/10.1145\/2379776.2379786","journal-title":"ACM Comput Surv"},{"issue":"4","key":"931_CR60","doi-asserted-by":"publisher","first-page":"525","DOI":"10.1016\/S0893-6080(05)80056-5","volume":"6","author":"MF M\u00f8ller","year":"1993","unstructured":"M\u00f8ller MF (1993) A scaled conjugate gradient algorithm for fast supervised learning. Neural Netw 6(4):525\u2013533. https:\/\/doi.org\/10.1016\/S0893-6080(05)80056-5","journal-title":"Neural Netw"},{"key":"931_CR61","unstructured":"Nem\u00e9nyi P (1963) Distribution-free multiple comparisons. Ph.D. thesis, Princeton University"},{"issue":"2","key":"931_CR62","doi-asserted-by":"publisher","first-page":"38:1","DOI":"10.1145\/3439950","volume":"54","author":"G Pang","year":"2022","unstructured":"Pang G, Shen C, Cao L et al (2022) Deep learning for anomaly detection: a review. ACM Comput Surv 54(2):38:1-38:38. https:\/\/doi.org\/10.1145\/3439950","journal-title":"ACM Comput Surv"},{"key":"931_CR63","doi-asserted-by":"publisher","unstructured":"Papadimitriou S, Kitagawa H, Gibbons PB et al (2003) LOCI: fast outlier detection using the local correlation integral. In: Proceedings of the 19th ICDE international conference on data engineering. IEEE Computer Society, pp 315\u2013326. https:\/\/doi.org\/10.1109\/ICDE.2003.1260802","DOI":"10.1109\/ICDE.2003.1260802"},{"issue":"3","key":"931_CR64","doi-asserted-by":"publisher","first-page":"1065","DOI":"10.1214\/aoms\/1177704472","volume":"33","author":"E Parzen","year":"1962","unstructured":"Parzen E (1962) On estimation of a probability density function and mode. Ann Math Stat 33(3):1065\u20131076","journal-title":"Ann Math Stat"},{"key":"931_CR65","unstructured":"Pekalska E, Tax DMJ, Duin RPW (2002) One-class LP classifiers for dissimilarity representations. In: Proceedings of the 15th NIPS international conference on neural information processing systems, advances in neural information processing systems. MIT Press, pp 761\u2013768"},{"issue":"11","key":"931_CR66","doi-asserted-by":"publisher","first-page":"5450","DOI":"10.1109\/TIP.2019.2917862","volume":"28","author":"P Perera","year":"2019","unstructured":"Perera P, Patel VM (2019) Learning deep features for one-class classification. IEEE Trans Image Process 28(11):5450\u20135463. https:\/\/doi.org\/10.1109\/TIP.2019.2917862","journal-title":"IEEE Trans Image Process"},{"key":"931_CR67","doi-asserted-by":"publisher","first-page":"215","DOI":"10.1016\/j.sigpro.2013.12.026","volume":"99","author":"MAF Pimentel","year":"2014","unstructured":"Pimentel MAF, Clifton DA, Clifton LA et al (2014) A review of novelty detection. Signal Process 99:215\u2013249. https:\/\/doi.org\/10.1016\/j.sigpro.2013.12.026","journal-title":"Signal Process"},{"key":"931_CR68","doi-asserted-by":"publisher","first-page":"61","DOI":"10.7551\/mitpress\/1113.003.0008","volume-title":"Advances in large-margin classifiers","author":"JC Platt","year":"2000","unstructured":"Platt JC (2000) Probabilities for SV machines. In: Smola AJ, Bartlett P, Sch\u00f6lkopf B et al (eds) Advances in large-margin classifiers. MIT Press, pp 61\u201374. https:\/\/doi.org\/10.7551\/mitpress\/1113.003.0008"},{"key":"931_CR69","doi-asserted-by":"publisher","unstructured":"Ramaswamy S, Rastogi R, Shim K (2000) Efficient algorithms for mining outliers from large data sets. In: Proceedings of the 2000 SIGMOD international conference on management of data. ACM, pp 427\u2013438. https:\/\/doi.org\/10.1145\/342009.335437","DOI":"10.1145\/342009.335437"},{"issue":"8","key":"931_CR70","doi-asserted-by":"publisher","first-page":"2491","DOI":"10.3390\/s18082491","volume":"18","author":"DT Ramotsoela","year":"2018","unstructured":"Ramotsoela DT, Abu-Mahfouz AM, Hancke GP (2018) A survey of anomaly detection in industrial wireless sensor networks with critical water system infrastructure as a case study. Sensors 18(8):2491. https:\/\/doi.org\/10.3390\/s18082491","journal-title":"Sensors"},{"issue":"101","key":"931_CR71","doi-asserted-by":"publisher","first-page":"715","DOI":"10.1016\/j.cose.2020.101715","volume":"91","author":"J Rodr\u00edguez-Ruiz","year":"2020","unstructured":"Rodr\u00edguez-Ruiz J, Mata-S\u00e1nchez JI, Monroy R et al (2020) A one-class classification approach for bot detection on twitter. Comput Secur 91(101):715. https:\/\/doi.org\/10.1016\/j.cose.2020.101715","journal-title":"Comput Secur"},{"issue":"1\u20132","key":"931_CR72","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s10462-009-9124-7","volume":"33","author":"L Rokach","year":"2010","unstructured":"Rokach L (2010) Ensemble-based classifiers. Artif Intell Rev 33(1\u20132):1\u201339. https:\/\/doi.org\/10.1007\/s10462-009-9124-7","journal-title":"Artif Intell Rev"},{"key":"931_CR73","unstructured":"Ruff L, G\u00f6rnitz N, Deecke L et al (2018) Deep one-class classification. In: Proceedings of the 35th ICML international conference on machine learning. PMLR, pp 4390\u20134399"},{"key":"931_CR74","unstructured":"Ruff L, Vandermeulen RA, G\u00f6rnitz N et al (2020) Deep semi-supervised anomaly detection. In: Proceedings of the 8th ICLR international conference on learning representations. OpenReview.net"},{"key":"931_CR75","doi-asserted-by":"publisher","unstructured":"Schlegl T, Seeb\u00f6ck P, Waldstein SM et al (2017) Unsupervised anomaly detection with generative adversarial networks to guide marker discovery. In: Proceedings of the 25th IPMI international conference on information processing in medical imaging. Springer, pp 146\u2013157. https:\/\/doi.org\/10.1007\/978-3-319-59050-9_12","DOI":"10.1007\/978-3-319-59050-9_12"},{"issue":"7","key":"931_CR76","doi-asserted-by":"publisher","first-page":"1443","DOI":"10.1162\/089976601750264965","volume":"13","author":"B Sch\u00f6lkopf","year":"2001","unstructured":"Sch\u00f6lkopf B, Platt JC, Shawe-Taylor J et al (2001) Estimating the support of a high-dimensional distribution. Neural Comput 13(7):1443\u20131471. https:\/\/doi.org\/10.1162\/089976601750264965","journal-title":"Neural Comput"},{"issue":"1","key":"931_CR77","doi-asserted-by":"publisher","first-page":"190","DOI":"10.1007\/s10618-012-0300-z","volume":"28","author":"E Schubert","year":"2014","unstructured":"Schubert E, Zimek A, Kriegel H (2014) Local outlier detection reconsidered: a generalized view on locality with applications to spatial, video, and network outlier detection. Data Min Knowl Discov 28(1):190\u2013237. https:\/\/doi.org\/10.1007\/s10618-012-0300-z","journal-title":"Data Min Knowl Discov"},{"key":"931_CR78","doi-asserted-by":"publisher","unstructured":"Schubert E, Weiler M, Zimek A (2015) Outlier detection and trend detection: two sides of the same coin. In: ICDMW international conference on data mining workshop. IEEE Computer Society, pp 40\u201346. https:\/\/doi.org\/10.1109\/ICDMW.2015.79","DOI":"10.1109\/ICDMW.2015.79"},{"key":"931_CR79","unstructured":"Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. In: Proceedings of the 3rd ICLR international conference on learning representations"},{"issue":"1","key":"931_CR80","doi-asserted-by":"publisher","first-page":"63","DOI":"10.1023\/A:1008940618127","volume":"10","author":"P Smyth","year":"2000","unstructured":"Smyth P (2000) Model selection for probabilistic clustering using cross-validated likelihood. Stat Comput 10(1):63\u201372. https:\/\/doi.org\/10.1023\/A:1008940618127","journal-title":"Stat Comput"},{"key":"931_CR81","doi-asserted-by":"publisher","unstructured":"Spinosa EJ, de\u00a0Leon Ferreira\u00a0de Carvalho ACP (2005) Combining one-class classifiers for robust novelty detection in gene expression data. In: Proceedings of the 2005 BSB Brazilian symposium on bioinformatics, advances in bioinformatics and computational biology. Springer, pp 54\u201364. https:\/\/doi.org\/10.1007\/11532323_7","DOI":"10.1007\/11532323_7"},{"key":"931_CR82","first-page":"583","volume":"3","author":"A Strehl","year":"2002","unstructured":"Strehl A, Ghosh J (2002) Cluster ensembles\u2014a knowledge reuse framework for combining multiple partitions. J Mach Learn Res 3:583\u2013617","journal-title":"J Mach Learn Res"},{"key":"931_CR83","doi-asserted-by":"publisher","unstructured":"Swersky L, Marques HO, Sander J et al (2016) On the evaluation of outlier detection and one-class classification methods. In: Proceedings of the 2016 DSAA international conference on data science and advanced analytics. IEEE, pp 1\u201310. https:\/\/doi.org\/10.1109\/DSAA.2016.8","DOI":"10.1109\/DSAA.2016.8"},{"key":"931_CR84","volume-title":"Introduction to data mining","author":"PN Tan","year":"2005","unstructured":"Tan PN, Steinbach M, Kumar V (2005) Introduction to data mining. Addison Wesley"},{"key":"931_CR85","unstructured":"Tax DMJ (2001) One-class classification. Ph.D. thesis, Delft University of Technology"},{"key":"931_CR86","doi-asserted-by":"publisher","unstructured":"Tax DMJ, Duin RPW (2001a) Combining one-class classifiers. In: Proceedings of the 2nd MCS international workshop on multiple classifier systems. Springer, pp 299\u2013308. https:\/\/doi.org\/10.1007\/3-540-48219-9_30","DOI":"10.1007\/3-540-48219-9_30"},{"key":"931_CR87","first-page":"155","volume":"2","author":"DMJ Tax","year":"2001","unstructured":"Tax DMJ, Duin RPW (2001b) Uniform object generation for optimizing one-class classifiers. J Mach Learn Res 2:155\u2013173","journal-title":"J Mach Learn Res"},{"issue":"1","key":"931_CR88","doi-asserted-by":"publisher","first-page":"45","DOI":"10.1023\/B:MACH.0000008084.60811.49","volume":"54","author":"DMJ Tax","year":"2004","unstructured":"Tax DMJ, Duin RPW (2004) Support vector data description. Mach Learn 54(1):45\u201366. https:\/\/doi.org\/10.1023\/B:MACH.0000008084.60811.49","journal-title":"Mach Learn"},{"key":"931_CR89","doi-asserted-by":"publisher","unstructured":"Tax DMJ, M\u00fcller K (2004) A consistency-based model selection for one-class classification. In: Proceedings of the 17th ICPR international conference on pattern recognition. IEEE Computer Society, pp 363\u2013366. https:\/\/doi.org\/10.1109\/ICPR.2004.1334542","DOI":"10.1109\/ICPR.2004.1334542"},{"key":"931_CR90","doi-asserted-by":"publisher","first-page":"3371","DOI":"10.5555\/1756006.1953039","volume":"11","author":"P Vincent","year":"2010","unstructured":"Vincent P, Larochelle H, Lajoie I et al (2010) Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion. J Mach Learn Res 11:3371\u20133408. https:\/\/doi.org\/10.5555\/1756006.1953039","journal-title":"J Mach Learn Res"},{"key":"931_CR91","doi-asserted-by":"publisher","first-page":"198","DOI":"10.1016\/j.patcog.2017.09.012","volume":"74","author":"S Wang","year":"2018","unstructured":"Wang S, Liu Q, Zhu E et al (2018) Hyperparameter selection of one-class support vector machine by self-adaptive data shifting. Pattern Recognit 74:198\u2013211. https:\/\/doi.org\/10.1016\/j.patcog.2017.09.012","journal-title":"Pattern Recognit"},{"issue":"5","key":"931_CR92","doi-asserted-by":"publisher","first-page":"927","DOI":"10.1109\/TCYB.2014.2340433","volume":"45","author":"Y Xiao","year":"2015","unstructured":"Xiao Y, Wang H, Xu W (2015) Parameter selection of Gaussian kernel for one-class SVM. IEEE Trans Cybern 45(5):927\u2013939. https:\/\/doi.org\/10.1109\/TCYB.2014.2340433","journal-title":"IEEE Trans Cybern"},{"issue":"10","key":"931_CR93","doi-asserted-by":"publisher","first-page":"977","DOI":"10.1093\/bioinformatics\/17.10.977","volume":"17","author":"KY Yeung","year":"2001","unstructured":"Yeung KY, Fraley C, Murua A et al (2001) Model-based clustering and data transformations for gene expression data. Bioinformatics 17(10):977\u2013987. https:\/\/doi.org\/10.1093\/bioinformatics\/17.10.977","journal-title":"Bioinformatics"},{"issue":"5","key":"931_CR94","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/gb-2003-4-5-r34","volume":"4","author":"KY Yeung","year":"2003","unstructured":"Yeung KY, Medvedovic M, Bumgarner RE (2003) Clustering gene-expression data with repeated measurements. Genome Biol 4(5):1\u201317. https:\/\/doi.org\/10.1186\/gb-2003-4-5-r34","journal-title":"Genome Biol"},{"key":"931_CR95","doi-asserted-by":"publisher","unstructured":"Zhang J, Lu J, Zhang G (2011) Combining one class classification models for avian influenza outbreaks. In: Proceedings of the 2011 MCDM symposium on computational intelligence in multicriteria decision-making. IEEE, pp 190\u2013196. https:\/\/doi.org\/10.1109\/SMDCM.2011.5949278","DOI":"10.1109\/SMDCM.2011.5949278"},{"key":"931_CR96","doi-asserted-by":"publisher","DOI":"10.1201\/b12207","volume-title":"Ensemble methods: foundations and algorithms","author":"ZH Zhou","year":"2012","unstructured":"Zhou ZH (2012) Ensemble methods: foundations and algorithms. Chapman & Hall"},{"key":"931_CR97","doi-asserted-by":"publisher","DOI":"10.1002\/widm.1280","author":"A Zimek","year":"2018","unstructured":"Zimek A, Filzmoser P (2018) There and back again: outlier detection between statistical reasoning and data mining algorithms. WIREs Data Mining Knowl Discov. https:\/\/doi.org\/10.1002\/widm.1280","journal-title":"WIREs Data Mining Knowl Discov"},{"issue":"5","key":"931_CR98","doi-asserted-by":"publisher","first-page":"363","DOI":"10.1002\/sam.11161","volume":"5","author":"A Zimek","year":"2012","unstructured":"Zimek A, Schubert E, Kriegel H (2012) A survey on unsupervised outlier detection in high-dimensional numerical data. Stat Anal Data Min 5(5):363\u2013387. https:\/\/doi.org\/10.1002\/sam.11161","journal-title":"Stat Anal Data Min"},{"issue":"1","key":"931_CR99","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1145\/2594473.2594476","volume":"15","author":"A Zimek","year":"2013","unstructured":"Zimek A, Campello RJGB, Sander J (2013a) Ensembles for unsupervised outlier detection: challenges and research questions a position paper. SIGKDD Explor 15(1):11\u201322. https:\/\/doi.org\/10.1145\/2594473.2594476","journal-title":"SIGKDD Explor"},{"key":"931_CR100","doi-asserted-by":"publisher","unstructured":"Zimek A, Gaudet M, Campello RJGB et al (2013b) Subsampling for efficient and effective unsupervised outlier detection ensembles. In: Proceedings of the 19th SIGKDD international conference on knowledge discovery and data mining. ACM, pp 428\u2013436. https:\/\/doi.org\/10.1145\/2487575.2487676","DOI":"10.1145\/2487575.2487676"},{"key":"931_CR101","doi-asserted-by":"publisher","unstructured":"Zimek A, Campello RJGB, Sander J (2014) Data perturbation for outlier detection ensembles. In: Proceedings of the 2014 SSDBM conference on scientific and statistical database management. ACM, pp 13:1\u201313:12. https:\/\/doi.org\/10.1145\/2618243.2618257","DOI":"10.1145\/2618243.2618257"}],"container-title":["Data Mining and Knowledge Discovery"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10618-023-00931-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10618-023-00931-x\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10618-023-00931-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,7,6]],"date-time":"2023-07-06T17:16:22Z","timestamp":1688663782000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10618-023-00931-x"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,5,16]]},"references-count":101,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2023,7]]}},"alternative-id":["931"],"URL":"https:\/\/doi.org\/10.1007\/s10618-023-00931-x","relation":{},"ISSN":["1384-5810","1573-756X"],"issn-type":[{"value":"1384-5810","type":"print"},{"value":"1573-756X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,5,16]]},"assertion":[{"value":"23 February 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"28 February 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 May 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors have no conflict of interest to declare that are relevant to the content of this article.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}