Does feature selection improve classification accuracy? Impact of sample size and feature selection on classification using anatomical magnetic resonance images

doi:10.1016/j.neuroimage.2011.11.066

Comparative Study

. 2012 Mar;60(1):59-70.

doi: 10.1016/j.neuroimage.2011.11.066. Epub 2011 Dec 1.

Does feature selection improve classification accuracy? Impact of sample size and feature selection on classification using anatomical magnetic resonance images

Carlton Chu¹, Ai-Ling Hsu, Kun-Hsien Chou, Peter Bandettini, Chingpo Lin; Alzheimer's Disease Neuroimaging Initiative

Collaborators, Affiliations

Collaborators

Alzheimer's Disease Neuroimaging Initiative:
Michael Weiner, Paul Aisen, Michael Weiner, Paul Aisen, Ronald Petersen, Clifford R Jack Jr, William Jagust, John Q Trojanowki, Laurel Beckett, Robert C Green, Andrew J Saykin, John Morris, Enchi Liu, Robert C Green, Tom Montine, Ronald Petersen, Paul Aisen, Anthony Gamst, Ronald G Thomas, Michael Donohue, Sarah Walter, Devon Gessert, Tamie Sather, Laurel Beckett, Danielle Harvey, Anthony Gamst, Michael Donohue, John Kornak, Clifford R Jack Jr, Anders Dale, Matthew Bernstein, Joel Felmlee, Nick Fox, Paul Thompson, Norbert Schuff, Gene Alexander, Charles DeCarli, William Jagust, Dan Bandy, Robert A Koeppe, Norm Foster, Eric M Reiman, Kewei Chen, Chet Mathis, John Morris, Nigel J Cairns, Lisa Taylor-Reinwald, J Q Trojanowki, Les Shaw, Virginia M Y Lee, Magdalena Korecka, Arthur W Toga, Karen Crawford, Scott Neu, Andrew J Saykin, Tatiana M Foroud, Steven Potkin, Li Shen, Zaven Kachaturian, Richard Frank, Peter J Snyder, Susan Molchan, Jeffrey Kaye, Joseph Quinn, Betty Lind, Sara Dolen, Lon S Schneider, Sonia Pawluczyk, Bryan M Spann, James Brewer, Helen Vanderswag, Judith L Heidebrink, Joanne L Lord, Ronald Petersen, Kris Johnson, Rachelle S Doody, Javier Villanueva-Meyer, Munir Chowdhury, Yaakov Stern, Lawrence S Honig, Karen L Bell, John C Morris, Beau Ances, Maria Carroll, Sue Leon, Mark A Mintun, Stacy Schneider, Daniel Marson, Randall Griffith, David Clark, Hillel Grossman, Effie Mitsis, Aliza Romirowsky, Leyla deToledo-Morrell, Raj C Shah, Ranjan Duara, Daniel Varon, Peggy Roberts, Marilyn Albert, Stephanie Kielb, Henry Rusinek, Mony J de Leon, Lidia Glodzik, P Murali Doraiswamy, Jeffrey R Petrella, R Edward Coleman, Steven E Arnold, Jason H Karlawish, David Wolk, Charles D Smith, Greg Jicha, Peter Hardy, Oscar L Lopez, MaryAnn Oakley, Donna M Simpson, Anton P Porsteinsson, Bonnie S Goldstein, Kim Martin, Kelly M Makino, M Saleem Ismail, Connie Brand, Ruth A Mulnard, Gaby Thai, Catherine Mc-Adams-Ortiz, Ramon Diaz-Arrastia, Kristen Martin-Cook, Michael DeVous, Allan I Levey, James J Lah, Janet S Cellar, Jeffrey M Burns, Heather S Anderson, Russell H Swerdlow, Liana Apostolova, Po H Lu, George Bartzokis, Daniel H S Silverman, Neill R Graff-Radford, Francine Parfitt, Heather Johnson, Martin Farlow, Scott Herring, Ann M Hake, Christopher H van Dyck, Richard E Carson, Martha G MacAvoy, Howard Chertkow, Howard Bergman, Chris Hosein, Sandra Black, Bojana Stefanovic, Curtis Caldwell, Ging-Yuek Robin Hsiung, Howard Feldman, Michele Assaly, Andrew Kertesz, John Rogers, Dick Trost, Charles Bernick, Donna Munic, Diana Kerwin, Marek-Marsel Mesulam, Kristina Lipowski, Chuang-Kuo Wu, Nancy Johnson, Carl Sadowsky, Walter Martinez, Teresa Villena, Raymond Scott Turner, Kathleen Johnson, Brigid Reynolds, Reisa A Sperling, Keith A Johnson, Gad Marshall, Meghan Frey, Allyson Rosen, Jared Tinklenberg, Marwan Sabbagh, Christine Belden, Sandra Jacobson, Neil Kowall, Ronald Killiany, Andrew E Budson, Alexander Norbash, Patricia Lynn Johnson, Thomas O Obisesan, Saba Wolday, Salome K Bwayo, Alan Lerner, Leon Hudson, Paula Ogrocki, Evan Fletcher, Owen Carmichael, John Olichney, Charles DeCarli, Smita Kittur, Michael Borrie, T-Y Lee, Rob Bartha, Sterling Johnson, Sanjay Asthana, Cynthia M Carlsson, Steven G Potkin, Adrian Preda, Dana Nguyen, Pierre Tariot, Adam Fleisher, Stephanie Reeder, Vernice Bates, Horacio Capote, Michelle Rainka, Barry A Hendin, Douglas W Scharre, Maria Kataki, Earl A Zimmerman, Dzintra Celmins, Alice D Brown, Godfrey D Pearlson, Karen Blank, Karen Anderson, Andrew J Saykin, Robert B Santulli, Eben S Schwartz, Kaycee M Sink, Jeff D Williamson, Pradeep Garg, Franklin Watkins, Brian R Ott, Henry Querfurth, Geoffrey Tremont, Stephen Salloway, Paul Malloy, Stephen Correia, Howard J Rosen, Bruce L Miller, Jacobo Mintzer, Kenneth Spicer

Affiliation

¹ Section on Functional Imaging Methods, Laboratory of Brain and Cognition, NIMH, NIH, Bethesda, USA.

PMID: 22166797
DOI: 10.1016/j.neuroimage.2011.11.066

Comparative Study

Does feature selection improve classification accuracy? Impact of sample size and feature selection on classification using anatomical magnetic resonance images

Carlton Chu et al. Neuroimage. 2012 Mar.

. 2012 Mar;60(1):59-70.

doi: 10.1016/j.neuroimage.2011.11.066. Epub 2011 Dec 1.

Authors

Carlton Chu¹, Ai-Ling Hsu, Kun-Hsien Chou, Peter Bandettini, Chingpo Lin; Alzheimer's Disease Neuroimaging Initiative

Collaborators

Alzheimer's Disease Neuroimaging Initiative:
Michael Weiner, Paul Aisen, Michael Weiner, Paul Aisen, Ronald Petersen, Clifford R Jack Jr, William Jagust, John Q Trojanowki, Laurel Beckett, Robert C Green, Andrew J Saykin, John Morris, Enchi Liu, Robert C Green, Tom Montine, Ronald Petersen, Paul Aisen, Anthony Gamst, Ronald G Thomas, Michael Donohue, Sarah Walter, Devon Gessert, Tamie Sather, Laurel Beckett, Danielle Harvey, Anthony Gamst, Michael Donohue, John Kornak, Clifford R Jack Jr, Anders Dale, Matthew Bernstein, Joel Felmlee, Nick Fox, Paul Thompson, Norbert Schuff, Gene Alexander, Charles DeCarli, William Jagust, Dan Bandy, Robert A Koeppe, Norm Foster, Eric M Reiman, Kewei Chen, Chet Mathis, John Morris, Nigel J Cairns, Lisa Taylor-Reinwald, J Q Trojanowki, Les Shaw, Virginia M Y Lee, Magdalena Korecka, Arthur W Toga, Karen Crawford, Scott Neu, Andrew J Saykin, Tatiana M Foroud, Steven Potkin, Li Shen, Zaven Kachaturian, Richard Frank, Peter J Snyder, Susan Molchan, Jeffrey Kaye, Joseph Quinn, Betty Lind, Sara Dolen, Lon S Schneider, Sonia Pawluczyk, Bryan M Spann, James Brewer, Helen Vanderswag, Judith L Heidebrink, Joanne L Lord, Ronald Petersen, Kris Johnson, Rachelle S Doody, Javier Villanueva-Meyer, Munir Chowdhury, Yaakov Stern, Lawrence S Honig, Karen L Bell, John C Morris, Beau Ances, Maria Carroll, Sue Leon, Mark A Mintun, Stacy Schneider, Daniel Marson, Randall Griffith, David Clark, Hillel Grossman, Effie Mitsis, Aliza Romirowsky, Leyla deToledo-Morrell, Raj C Shah, Ranjan Duara, Daniel Varon, Peggy Roberts, Marilyn Albert, Stephanie Kielb, Henry Rusinek, Mony J de Leon, Lidia Glodzik, P Murali Doraiswamy, Jeffrey R Petrella, R Edward Coleman, Steven E Arnold, Jason H Karlawish, David Wolk, Charles D Smith, Greg Jicha, Peter Hardy, Oscar L Lopez, MaryAnn Oakley, Donna M Simpson, Anton P Porsteinsson, Bonnie S Goldstein, Kim Martin, Kelly M Makino, M Saleem Ismail, Connie Brand, Ruth A Mulnard, Gaby Thai, Catherine Mc-Adams-Ortiz, Ramon Diaz-Arrastia, Kristen Martin-Cook, Michael DeVous, Allan I Levey, James J Lah, Janet S Cellar, Jeffrey M Burns, Heather S Anderson, Russell H Swerdlow, Liana Apostolova, Po H Lu, George Bartzokis, Daniel H S Silverman, Neill R Graff-Radford, Francine Parfitt, Heather Johnson, Martin Farlow, Scott Herring, Ann M Hake, Christopher H van Dyck, Richard E Carson, Martha G MacAvoy, Howard Chertkow, Howard Bergman, Chris Hosein, Sandra Black, Bojana Stefanovic, Curtis Caldwell, Ging-Yuek Robin Hsiung, Howard Feldman, Michele Assaly, Andrew Kertesz, John Rogers, Dick Trost, Charles Bernick, Donna Munic, Diana Kerwin, Marek-Marsel Mesulam, Kristina Lipowski, Chuang-Kuo Wu, Nancy Johnson, Carl Sadowsky, Walter Martinez, Teresa Villena, Raymond Scott Turner, Kathleen Johnson, Brigid Reynolds, Reisa A Sperling, Keith A Johnson, Gad Marshall, Meghan Frey, Allyson Rosen, Jared Tinklenberg, Marwan Sabbagh, Christine Belden, Sandra Jacobson, Neil Kowall, Ronald Killiany, Andrew E Budson, Alexander Norbash, Patricia Lynn Johnson, Thomas O Obisesan, Saba Wolday, Salome K Bwayo, Alan Lerner, Leon Hudson, Paula Ogrocki, Evan Fletcher, Owen Carmichael, John Olichney, Charles DeCarli, Smita Kittur, Michael Borrie, T-Y Lee, Rob Bartha, Sterling Johnson, Sanjay Asthana, Cynthia M Carlsson, Steven G Potkin, Adrian Preda, Dana Nguyen, Pierre Tariot, Adam Fleisher, Stephanie Reeder, Vernice Bates, Horacio Capote, Michelle Rainka, Barry A Hendin, Douglas W Scharre, Maria Kataki, Earl A Zimmerman, Dzintra Celmins, Alice D Brown, Godfrey D Pearlson, Karen Blank, Karen Anderson, Andrew J Saykin, Robert B Santulli, Eben S Schwartz, Kaycee M Sink, Jeff D Williamson, Pradeep Garg, Franklin Watkins, Brian R Ott, Henry Querfurth, Geoffrey Tremont, Stephen Salloway, Paul Malloy, Stephen Correia, Howard J Rosen, Bruce L Miller, Jacobo Mintzer, Kenneth Spicer

Affiliation

¹ Section on Functional Imaging Methods, Laboratory of Brain and Cognition, NIMH, NIH, Bethesda, USA.

PMID: 22166797
DOI: 10.1016/j.neuroimage.2011.11.066

Abstract

There are growing numbers of studies using machine learning approaches to characterize patterns of anatomical difference discernible from neuroimaging data. The high-dimensionality of image data often raises a concern that feature selection is needed to obtain optimal accuracy. Among previous studies, mostly using fixed sample sizes, some show greater predictive accuracies with feature selection, whereas others do not. In this study, we compared four common feature selection methods. 1) Pre-selected region of interests (ROIs) that are based on prior knowledge. 2) Univariate t-test filtering. 3) Recursive feature elimination (RFE), and 4) t-test filtering constrained by ROIs. The predictive accuracies achieved from different sample sizes, with and without feature selection, were compared statistically. To demonstrate the effect, we used grey matter segmented from the T1-weighted anatomical scans collected by the Alzheimer's disease Neuroimaging Initiative (ADNI) as the input features to a linear support vector machine classifier. The objective was to characterize the patterns of difference between Alzheimer's disease (AD) patients and cognitively normal subjects, and also to characterize the difference between mild cognitive impairment (MCI) patients and normal subjects. In addition, we also compared the classification accuracies between MCI patients who converted to AD and MCI patients who did not convert within the period of 12 months. Predictive accuracies from two data-driven feature selection methods (t-test filtering and RFE) were no better than those achieved using whole brain data. We showed that we could achieve the most accurate characterizations by using prior knowledge of where to expect neurodegeneration (hippocampus and parahippocampal gyrus). Therefore, feature selection does improve the classification accuracies, but it depends on the method adopted. In general, larger sample sizes yielded higher accuracies with less advantage obtained by using knowledge from the existing literature.

PubMed Disclaimer

Comment in

The utility of data-driven feature selection: re: Chu et al. 2012.
Kerr WT, Douglas PK, Anderson A, Cohen MS. Kerr WT, et al. Neuroimage. 2014 Jan 1;84:1107-10. doi: 10.1016/j.neuroimage.2013.07.050. Epub 2013 Jul 25. Neuroimage. 2014. PMID: 23891886 Free PMC article.

Cited by

PRoNTo: pattern recognition for neuroimaging toolbox.
Schrouff J, Rosa MJ, Rondina JM, Marquand AF, Chu C, Ashburner J, Phillips C, Richiardi J, Mourão-Miranda J. Schrouff J, et al. Neuroinformatics. 2013 Jul;11(3):319-37. doi: 10.1007/s12021-013-9178-1. Neuroinformatics. 2013. PMID: 23417655 Free PMC article.
Locally linear embedding (LLE) for MRI based Alzheimer's disease classification.
Liu X, Tosun D, Weiner MW, Schuff N; Alzheimer's Disease Neuroimaging Initiative. Liu X, et al. Neuroimage. 2013 Dec;83:148-57. doi: 10.1016/j.neuroimage.2013.06.033. Epub 2013 Jun 21. Neuroimage. 2013. PMID: 23792982 Free PMC article.
Machine Learning for Predicting Cognitive Diseases: Methods, Data Sources and Risk Factors.
Bratić B, Kurbalija V, Ivanović M, Oder I, Bosnić Z. Bratić B, et al. J Med Syst. 2018 Oct 27;42(12):243. doi: 10.1007/s10916-018-1071-x. J Med Syst. 2018. PMID: 30368611 Review.
Trends in Heart-Rate Variability Signal Analysis.
Ishaque S, Khan N, Krishnan S. Ishaque S, et al. Front Digit Health. 2021 Feb 25;3:639444. doi: 10.3389/fdgth.2021.639444. eCollection 2021. Front Digit Health. 2021. PMID: 34713110 Free PMC article. Review.
The Added Value of Diffusion-Weighted MRI-Derived Structural Connectome in Evaluating Mild Cognitive Impairment: A Multi-Cohort Validation1.
Wang Q, Guo L, Thompson PM, Jack CR, Dodge H, Zhan L, Zhou J; Alzheimer’s Disease Neuroimaging Initiative and National Alzheimer’s Coordinating Center. Wang Q, et al. J Alzheimers Dis. 2018;64(1):149-169. doi: 10.3233/JAD-171048. J Alzheimers Dis. 2018. PMID: 29865049 Free PMC article.

See all "Cited by" articles

Publication types

Actions
Actions
Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
- ClinicalKey
- Elsevier Science
Other Literature Sources
- The Lens - Patent Citations Database
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Does feature selection improve classification accuracy? Impact of sample size and feature selection on classification using anatomical magnetic resonance images

Collaborators

Affiliation

Does feature selection improve classification accuracy? Impact of sample size and feature selection on classification using anatomical magnetic resonance images

Authors

Collaborators

Affiliation

Abstract

Comment in

Similar articles

Cited by

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical