Abstract
The leaf disease classification is a method for putting diseases into groups based on their properties, like texture, shape, and color. Even though DL features are very good at classifying leaf diseases, some authors focused on handcrafted features for leaf disease classification and got quite good results with similar accuracy. In this paper, we have also focused on handcrafted features and ML based shallow classifier to get comparable accuracy of DL models. Handcrafted features and shallow ML based classifier are used for leaf disease detection and classification mainly for three species such as tomato, bell pepper and potato. Here we have used 3- level decomposition based 2D-DWT for image feature extraction and PCA for dimensionality reduction of features. We have used stratified K-Fold validation because the dataset is small and there is a need to maintain the class ratio for classification. For classification Random Forest and XGBoost are used. The proposed method is made up of 4 steps: image pre-processing, feature extraction, feature reduction, and classification. We evaluate the proposed model’s classification accuracy against the classification accuracy of several scholarly works. When applied to Datasets 1, 2, 3, and 4, RF classifiers achieve accuracies of 98.45%, 100%, 98.33%, and 98.55%, respectively, while XGBoost achieves accuracies of 99.11%, 98.72%, 98.23%, and 97.73%.
Similar content being viewed by others
Data availability
Access the datasets used in the current study on the GitHub repository, https://github.com/MonuBhagat11/Leafdisease_data.
Abbreviations
- CNN:
-
Convolutional Neural network
- SVM:
-
Support Vector Machine
- KNN:
-
K-Nearest Neighbor
- ROC:
-
Receiver Operating Characteristic
- DL:
-
Deep Learning
- HIS:
-
Hyperspectral Imaging
- MCC:
-
Moving Center Classifier
- LDC:
-
Linear Discriminant Classifier
- FNN:
-
Fuzzy Neural Network
- RF:
-
Random Forest
- EM:
-
Expectation-maximization
- CCD:
-
Centroid Contour Distance Curve
- ML:
-
Machine Learning
- CA:
-
Classification Accuracy
- HSV:
-
Hue Saturation Value
- RGB:
-
Red Green Blue
- LBPs:
-
Local Binary Patterns
- GLCM:
-
Grey-Level Co-occurrence Matrix
- SGDM:
-
Spatial Gray Level Dependence Matrix
- SIFT:
-
Scale Invariant and Feature Transformation
- MSF:
-
Multiscale fusion
- GWO:
-
Grey Wolf optimizer
- GSA:
-
Gravitational Search Algorithm
- RGA:
-
Real coded Genetic Algorithm
- CA:
-
Classification Accuracy
- CHT:
-
Circular Hough Transform
- DRNN:
-
Deep Residual Neural Network
- ABC:
-
Ant Bee Colony Optimization
- IGA:
-
Improved Genetic Algorithm
- PCA:
-
Principal Component Analysis
- ANN:
-
Artificial Neural Networks
- GA:
-
Genetic Algorithm
- PNN:
-
Probabilistic Neural Network
- SURF:
-
Speeded Up Robust Features
- PCA:
-
Principal Component Analysis
- RGA:
-
Real coded Genetic Algorithm
- ACH:
-
Angle Code Histogram
- BoWs:
-
Bag-of-words
- CSM:
-
Chaotic spider monkey
- DWT:
-
Discrete Wavelet Transform
- PSO:
-
Particle Swarm Optimization
- YOP:
-
Year of Publication
- FRT:
-
Feature Reduction Technique
- DiffN:
-
Difference of Normal Orientations
- BRBFNN:
-
Bacteria Foraging Algorithm Radial Basis Function Neural Network
- CFO:
-
Central force optimization
- MDM:
-
Multiscale Distance Matrix
- DNM:
-
Decomposed Newton’s Method
- MMC:
-
Maximum Margin Criterion
- MTS:
-
Mahalanobis Taguchi System
References
Arjunagi S, Patil NB (2019) Texture based Leaf Disease classification using Machine Learning Techniques
Bhagat M, Kumar D (2022) A comprehensive survey on leaf disease identification & classification. Multimed Tools Appl 81:33897–33925. https://doi.org/10.1007/s11042-022-12984-z
Bhagat M, Kumar D, Kumar D (2019) Role of internet of things (IoT) in smart farming: a brief survey. 2019 Devices for Integrated Circuit (DevIC):141–145. https://doi.org/10.1109/DEVIC.2019.8783800
Bhagat M, Kumar D, Mahmood R, Pati B, Kumar M (2020) "Bell Pepper Leaf Disease Classification Using CNN," 2nd International Conference on Data, Engineering and Applications (IDEA), pp. 1–5, https://doi.org/10.1109/IDEA49133.2020.9170728
Bhagat M, Kumar D, Haque I, Munda HS and Bhagat R, "Plant Leaf Disease Classification Using Grid Search Based SVM," 2nd International Conference on Data, Engineering and Applications (IDEA), 2020, pp. 1–6, https://doi.org/10.1109/IDEA49133.2020.9170725.
Cai J, Miklavcic S (2012) Automated extraction of three-dimensional cereal plant structures from two-dimensional orthographic images. IET Image Process 6(6):687696–687696
Chhikara RR, Sharma P, Singh L (2016) A hybrid feature selection approach based on improved pso and filter approaches for image steganalysis. Int J Mach Learn Cybern 7(6):11951206–11951206
Chouhan SS, Kaul A, Singh UP, Jain S (2018) Bacterial foraging optimization based radial basis function neural network (brbfnn) for identification and classification of plant leaf diseases: An automatic approach towards plant pathology. IEEE Access 6:8852–8863
Chowdhury MEH, … Ali SHM (2021) Automatic and reliable leaf disease detection using deep learning techniques. AgriEngineering 3(2):294–312
Deepa N, … Gadekallu TR (2020) Multiclass model for agriculture development using multivariate statistical method. IEEE Access 8:183749–183758
Dey AK, Sharma M, Meshram MR (2016) Image processing based leaf rot disease, detection of betel vine (Piper betle L.). Procedia Comput Sci. https://doi.org/10.1016/j.procs.2016.05.262
Dhaka, Singh V et al (2021) A survey of deep convolutional neural networks applied for prediction of plant leaf diseases. Sensors 21.14:4749
Emary E, Zawbaa HM, Grosan C, and Hassenian AE (2015) Feature subset selection approach by gray-wolf optimization. In AfroEuropean Conference for Industrial Advancement, Springer, pp. 1–13
Es-Saady Y, El Massi I, El Yassa M, Mammass D, Benazoun A. Automatic recognition of plant leaves diseases based on serial combination of two SVM classifiers. In: Proc 2016 Int Conf Electr Inf Technol ICEIT 2016 2016:561–6. https://doi.org/10.1109/EITech.2016.7519661
Fuentes A, Yoon S, Kim SC, Park DS (2017) A robust deep-learning- based detector for real-time tomato plant diseases and pests recognition. Sensors (Switzerland) 17. https://doi.org/10.3390/s17092022
Gadekallu TR, … Alazab M (2021) A novel PCA–whale optimization-based deep neural network model for classification of tomato plant diseases using GPU. J Real-Time Image Proc 18(4):1383–1396. https://doi.org/10.1007/s11554-020-00987-8
Hassan SM, Maji AK, Jasiński M, Leonowicz Z, Jasińska E (2021) Identification of plant-leaf diseases using CNN and transfer-learning approach. Electronics 10:1388. https://doi.org/10.3390/electronics10121388
Hlaing CS, Maung Zaw SM. Tomato plant diseases classification using statistical texture feature and color feature. In: Proc - 17th IEEE/ACIS Int Conf Comput Inf Sci ICIS 2018 2018:439–44. https://doi.org/10.1109/ICIS.2018.8466483
Hu R, Jia W, Ling H, Huang D (2012) Multiscale distance matrix for fast plant leaf recognition. IEEE Trans Image Process 21(11):4667–4672
Hu J, Chen Z, Yang M, Zhang R, Cui Y (2018) A multiscale fusion convolutional neural network for plant leaf recognition. IEEE Signal Process Lett 25(6):853–857
Huang M-L, Chang Y-H (2020) Dataset of Tomato Leaves. Mendeley Data V1. https://doi.org/10.17632/ngdgg79rzb.1
Hussain R et al. "Revise-Net: Exploiting Reverse Attention Mechanism for Salient Object Detection." Remote Sensing 13.23 (2021): 4941.
Ijaz MF, Attique M, Son Y (2020) Data-driven cervical cancer prediction model with outlier detection and over-sampling methods. Sensors 20(10):2809. https://doi.org/10.3390/s20102809
Jackson RD, Pinter PJ, Reginato RJ, Idso SB (1986) Detection and evaluation of plant stresses for crop management decisions. IEEE Trans Geosci Remote Sensing 1:99–106
Jackson RD, Pinter PJ, Reginato RJ, Idso SB (1986) Detection and evaluation of plant stresses for crop management decisions. IEEE Trans Geosci Remote Sensing 1:99–106
Johannes A, Picon A, Alvarez-Gila A, Echazarra J, Rodriguez-Vaamonde S, Navajas AD et al (2017) Automatic plant disease diagnosis using mobile capture devices, applied on a wheat use case. Comput Electron Agric 138:200–209. https://doi.org/10.1016/j.compag.2017.04.013
Kaur S, Pandey S, Goel S (2019) Plants disease identification and classification through leaf images: a survey. Arch Comput Methods Eng 26(2):507–530
Kharrat A, Gasmi K, Ben M, Nacera (2010) A hybrid approach for automatic classification of Brain using Genetic algorithm and SVM. Leonardo J Sci (17)
Kundu N, et al. "IoT and interpretable machine learning based framework for disease prediction in pearl millet." Sensors 21.16 (2021): 5386.
Li S, Zhang X (2019) Research on orthopedic auxiliary classification and prediction model based on XGBoost algorithm. Neural Comput Applic:1–9
Mohammadi FG, Abadeh MS (2014) Image steganalysis using a bee colony-based feature selection algorithm. Eng Appl Artif Intell 31:35–43
Mohanty SP, Hughes DP, Salathé M (2016) Using deep learning for image-based plant disease detection. Front Plant Sci 7:1419
Naik J, Patel S (2014) Tumor detection and classification using decision tree in brain MRI. Int J Comput Sci Network Sec 14(6):87
Ngugi LC, Abelwahab M, Abo-Zahhad M (2021) Recent advances in image processing techniques for automated leaf pest and disease recognition–a review. Inform Process Agricult 8(1):27–51
Padol PB, Yadav AA (2016) SVM classifier based grape leaf disease detection. In: Conf Adv Signal Process CASP 2016:175–179. https://doi.org/10.1109/CASP.2016.7746160
Pantazi XE, Moshou D, Tamouridou AA (2019) Automated leaf disease detection in different crop species through image features analysis and one class classifiers. Comput Electron Agric 156:96–104
Qin F, Liu D, Sun B, Ruan L, Ma Z, Wang H (2016) Identification of alfalfa leaf diseases using image recognition technology. PLoS One 11:1–26. https://doi.org/10.1371/journal.pone.0168274
Ramesh S, Vydeki D (2020) Recognition and classification of paddy leaf diseases using optimized deep neural network with Jaya algorithm. Inform Process Agricult 7(2):249–260
Rashedi E, Nezamabadi-Pour H, Saryazdi S (2009) Gsa: a gravitational search algorithm. Inf Sci 179(13):2232–2248
Sabrol H, Satish K (2016) Tomato plant disease classification in digital images using classification tree. Int Conf Commun Signal Process ICCSP 2016:1242–1246. https://doi.org/10.1109/ICCSP.2016.7754351
Sambasivam G, Opiyo GD (2021) A predictive machine learning application in agriculture: cassava disease detection and classification with imbalanced dataset using convolutional neural networks. Egyptian Inform J 22(1):27–34
Singh V (2019) Sunflower leaf diseases detection using image segmentation based on particle swarm optimization. Artif Intell Agric 3:62–68. https://doi.org/10.1016/j.aiia.2019.09.002
Singh V, Misra AK (2017) Detection of plant leaf diseases using image segmentation and soft computing techniques. Inform Process Agricult 4(1):41–49
Singh V, Varsha, Misra AK. Detection of unhealthy region of plant leaves using image processing and genetic algorithm. In: Conf Proceeding - 2015 Int Conf Adv Comput Eng Appl ICACEA 2015 2015:1028–32. https://doi.org/10.1109/ICACEA.2015.7164858
Sulistyo SB, Woo WL, Dlay SS (2016) Regularized neural networks fusion and genetic algorithm based oneld nitrogen status estimation of wheat plants. IEEE Trans Industrial Inform 13(1):103–114
Sulistyo SB, Wu D, Woo WL, Dlay SS, Gao B (2017) Computational deep intelligence vision sensing for nutrient content estimation in agricultural automation. IEEE Trans Autom Sci Eng 15(3):1243–1257
Tiwari VM, Tarum G (2017) Plant leaf disease analysis using image processing technique with modified SVM-CS classifier. Int J Eng Manag Technol 5:11–17
Wang Z, Chi Z, Feng D (2003) Shape based leaf image retrieval. IEE Proceed-Vision, Image Signal Process 150(1):34–43
Wang H, Li G, Ma Z, Li X (2012) Image recognition of plant diseases based on backpropagation networks. In: 2012 5th Int Congr Image Signal Process CISP 2012:894–900. https://doi.org/10.1109/CISP.2012.6469998
Wang X, Ma J, Li X, Zhao X, Lin Z, Chen J, Shao Z (2014) Optimization of chemical fungicide combinations targeting the maize fungal pathogen, bipolaris maydis: a systematic quantitative approach. IEEE Trans Biomed Eng 62(1):80–87
Zhang Y-D, Wang S, Lenan W (2010) A novel method for magnetic resonance brain image classification based on adaptive chaotic PSO. Prog Electromagn Res 109:325–343
Acknowledgements
The authors would like to express their gratitude to the esteemed reviewers for their insightful and useful suggestions.
Funding
No public, private, or charitable organizations provided any direct funding for this investigation.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors have stated that there are no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Bhagat, M., Kumar, D. Performance evaluation of PCA based reduced features of leaf images extracted by DWT using random Forest and XGBoost classifier. Multimed Tools Appl 82, 26225–26254 (2023). https://doi.org/10.1007/s11042-023-14370-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-14370-9