Abstract
Neuroblastoma is the most common cancer in young children accounting for over 15% of deaths in children due to cancer. Identification of the class of neuroblastoma is dependent on histopathological classification performed by pathologists which are considered the gold standard. However, due to the heterogeneous nature of neuroblast tumours, the human eye can miss critical visual features in histopathology. Hence, the use of computer-based models can assist pathologists in classification through mathematical analysis. There is no publicly available dataset containing neuroblastoma histopathological images. So, this study uses dataset gathered from The Tumour Bank at Kids Research at The Children’s Hospital at Westmead, which has been used in previous research. Previous work on this dataset has shown maximum accuracy of 84%. One main issue that previous research fails to address is the class imbalance problem that exists in the dataset as one class represents over 50% of the samples. This study explores a range of feature extraction and data undersampling and over-sampling techniques to improve classification accuracy. Using these methods, this study was able to achieve accuracy of over 90% in the dataset. Moreover, significant improvements observed in this study were in the minority classes where previous work failed to achieve high level of classification accuracy. In doing so, this study shows importance of effective management of available data for any application of machine learning.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Park, J.R., Eggert, A., Caron, H.: Neuroblastoma: biology, prognosis, and treatment. Pediatr. Clin. North Am. 55(1), 97–120 (2008)
Maris, J.M.: Recent advances in neuroblastoma. N. Engl. J. Med. 362(23), 2202–2211 (2010)
Shimada, H., et al.: The international neuroblastoma pathology classification (the Shimada system). Cancer Interdisc. Int. J. Am. Cancer Soc. 86(2), 364–372 (1999)
Maris, J.M., Matthay, K.K.: Molecular biology of neuroblastoma. J. Clin. Oncol. 17(7), 2264 (1999)
Gheisari, S., Catchpoole, D.R., Charlton, A., Kennedy, P.J.: Patched completed local binary pattern is an effective method for neuroblastoma histological image classification. In: Boo, Y.L., Stirling, D., Chi, L., Liu, L., Ong, K.-L., Williams, G. (eds.) AusDM 2017. CCIS, vol. 845, pp. 57–71. Springer, Singapore (2018). https://doi.org/10.1007/978-981-13-0292-3_4
Gheisari, S., Catchpoole, D.R., Charlton, A., Kennedy, P.J.: Convolutional deep belief network with feature encoding for classification of neuroblastoma histological images. J. Pathol. Inf. 9, 17 (2018)
Gheisari, S., Catchpoole, D., Charlton, A., Melegh, Z., Gradhand, E., Kennedy, P.: Computer aided classification of neuroblastoma histological images using scale invariant feature transform with feature encoding. Diagnostics 8(3), 56 (2018)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision 60(2), 91–110 (2004)
Yang, J., Jiang, Y.-G., Hauptmann, A.G., Ngo, C.-W.: Evaluating bag-of-visual-words representations in scene classification. In: Proceedings of the International Workshop on Workshop on Multimedia Information Retrieval, pp. 197–206. ACM (2007)
Bay, H., Tuytelaars, T., Van Gool, L.: SURF: speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006). https://doi.org/10.1007/11744023_32
Panchal, P., Panchal, S., Shah, S.: A comparison of SIFT and SURF. Int. J. Innov. Res. Comput. Commun. Eng. 1(2), 323–327 (2013)
Lenc, L., Král, P.: A combined SIFT/SURF descriptor for automatic face recognition. In: Sixth International Conference on Machine Vision (ICMV 2013), vol. 9067. International Society for Optics and Photonics, p. 90672C (2013)
Chawla, N.V., Japkowicz, N., Kotcz, A.: Special issue on learning from imbalanced data sets. ACM SIGKDD Explor. Newsl. 6(1), 1–6 (2004)
Ramyachitra, D., Manikandan, P.: Imbalanced dataset classification and solutions: a review. Int. J. Comput. Bus. Res. (IJCBR) 5(4) (2014)
Kotsiantis, S., Kanellopoulos, D., Pintelas, P.: Handling imbalanced datasets: a review. GESTS Int. Trans. Comput. Sci. Eng. 30(1), 25–36 (2006)
Barlow, H., Mao, S., Khushi, M.: Predicting high-risk prostate cancer using machine learning methods. Data 4, 129 (2019)
Khushi, M., Clarke, C.L., Graham, J.D.: Bioinformatic analysis of cis-regulatory interactions between progesterone and estrogen receptors in breast cancer. PeerJ 2, e654 (2014). https://doi.org/10.7717/peerj.654
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Panta, A., Khushi, M., Naseem, U., Kennedy, P., Catchpoole, D. (2020). Classification of Neuroblastoma Histopathological Images Using Machine Learning. In: Yang, H., Pasupa, K., Leung, A.CS., Kwok, J.T., Chan, J.H., King, I. (eds) Neural Information Processing. ICONIP 2020. Lecture Notes in Computer Science(), vol 12534. Springer, Cham. https://doi.org/10.1007/978-3-030-63836-8_1
Download citation
DOI: https://doi.org/10.1007/978-3-030-63836-8_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-63835-1
Online ISBN: 978-3-030-63836-8
eBook Packages: Computer ScienceComputer Science (R0)