Quantitative analysis of human-model agreement in visual saliency modeling: a comparative study
- PMID: 22868572
- DOI: 10.1109/TIP.2012.2210727
Quantitative analysis of human-model agreement in visual saliency modeling: a comparative study
Abstract
Visual attention is a process that enables biological and machine vision systems to select the most relevant regions from a scene. Relevance is determined by two components: 1) top-down factors driven by task and 2) bottom-up factors that highlight image regions that are different from their surroundings. The latter are often referred to as "visual saliency." Modeling bottom-up visual saliency has been the subject of numerous research efforts during the past 20 years, with many successful applications in computer vision and robotics. Available models have been tested with different datasets (e.g., synthetic psychological search arrays, natural images or videos) using different evaluation scores (e.g., search slopes, comparison to human eye tracking) and parameter settings. This has made direct comparison of models difficult. Here, we perform an exhaustive comparison of 35 state-of-the-art saliency models over 54 challenging synthetic patterns, three natural image datasets, and two video datasets, using three evaluation scores. We find that although model rankings vary, some models consistently perform better. Analysis of datasets reveals that existing datasets are highly center-biased, which influences some of the evaluation scores. Computational complexity analysis shows that some models are very fast, yet yield competitive eye movement prediction accuracy. Different models often have common easy/difficult stimuli. Furthermore, several concerns in visual saliency modeling, eye movement datasets, and evaluation scores are discussed and insights for future work are provided. Our study allows one to assess the state-of-the-art, helps to organizing this rapidly growing field, and sets a unified comparison framework for gauging future efforts, similar to the PASCAL VOC challenge in the object recognition and detection domains.
Similar articles
-
Computational model of stereoscopic 3D visual saliency.IEEE Trans Image Process. 2013 Jun;22(6):2151-65. doi: 10.1109/TIP.2013.2246176. Epub 2013 Feb 11. IEEE Trans Image Process. 2013. PMID: 23412612
-
State-of-the-art in visual attention modeling.IEEE Trans Pattern Anal Mach Intell. 2013 Jan;35(1):185-207. doi: 10.1109/TPAMI.2012.89. IEEE Trans Pattern Anal Mach Intell. 2013. PMID: 22487985
-
What stands out in a scene? A study of human explicit saliency judgment.Vision Res. 2013 Oct 18;91:62-77. doi: 10.1016/j.visres.2013.07.016. Epub 2013 Aug 15. Vision Res. 2013. PMID: 23954536
-
Computational modelling of visual attention.Nat Rev Neurosci. 2001 Mar;2(3):194-203. doi: 10.1038/35058500. Nat Rev Neurosci. 2001. PMID: 11256080 Review.
-
The role of context in object recognition.Trends Cogn Sci. 2007 Dec;11(12):520-7. doi: 10.1016/j.tics.2007.09.009. Epub 2007 Nov 19. Trends Cogn Sci. 2007. PMID: 18024143 Review.
Cited by
-
Clutter perception is invariant to image size.Vision Res. 2015 Nov;116(Pt B):142-51. doi: 10.1016/j.visres.2015.04.017. Epub 2015 May 14. Vision Res. 2015. PMID: 25982717 Free PMC article.
-
Theoretical perspectives on active sensing.Curr Opin Behav Sci. 2018 Oct;11:100-108. doi: 10.1016/j.cobeha.2016.06.009. Curr Opin Behav Sci. 2018. PMID: 30175197 Free PMC article.
-
Combining segmentation and attention: a new foveal attention model.Front Comput Neurosci. 2014 Aug 14;8:96. doi: 10.3389/fncom.2014.00096. eCollection 2014. Front Comput Neurosci. 2014. PMID: 25177289 Free PMC article.
-
Information-theoretic model comparison unifies saliency metrics.Proc Natl Acad Sci U S A. 2015 Dec 29;112(52):16054-9. doi: 10.1073/pnas.1510393112. Epub 2015 Dec 10. Proc Natl Acad Sci U S A. 2015. PMID: 26655340 Free PMC article.
-
Change Blindness Is Influenced by Both Contrast Energy and Subjective Importance within Local Regions of the Image.Front Psychol. 2017 Oct 4;8:1718. doi: 10.3389/fpsyg.2017.01718. eCollection 2017. Front Psychol. 2017. PMID: 29046655 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous