{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,9,19]],"date-time":"2024-09-19T15:34:31Z","timestamp":1726760071633},"reference-count":3,"publisher":"World Scientific Pub Co Pte Lt","issue":"03","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Image Grap."],"published-print":{"date-parts":[[2014,7]]},"abstract":" Automatic image annotation is an important and challenging job for image analysis and understanding such as content-based image retrieval (CBIR). The relationship between the keywords and visual features is too complicated due to the semantic gap. We present an approach of automatic image annotation based on scene analysis. With the constrain of scene semantics, the correlation between keywords and visual features becomes simpler and clearer. Our model has two stages of process. The first stage is training process which groups training image data set into semantic scenes using the extracted semantic feature and visual scenes constructed from the calculation distances of visual features for every pairs of training images by using Earth mover's distance (EMD). Then, combine a pair of semantic and visual scene together and apply Gaussian mixture model (GMM) for all scenes. The second stage is to test and annotate keywords for test image data set. Using the visual features provided by Duygulu, experimental results show that our model outperforms probabilistic latent semantic analysis (PLSA) & GMM (PLSA&GMM) model on Corel5K database. <\/jats:p>","DOI":"10.1142\/s0219467814500120","type":"journal-article","created":{"date-parts":[[2014,8,25]],"date-time":"2014-08-25T09:38:50Z","timestamp":1408959530000},"page":"1450012","source":"Crossref","is-referenced-by-count":1,"title":["Automatic Image Annotation Based on Scene Analysis"],"prefix":"10.1142","volume":"14","author":[{"given":"Yongmei","family":"Liu","sequence":"first","affiliation":[{"name":"College of Computer Science and Technology, Harbin Engineering University, Harbin, Heilongjiang 150001, China"}]},{"given":"Tanakrit","family":"Wongwitit","sequence":"additional","affiliation":[{"name":"College of Computer Science and Technology, Harbin Engineering University, Harbin, Heilongjiang 150001, China"}]},{"given":"Linsen","family":"Yu","sequence":"additional","affiliation":[{"name":"College of Computer Science and Technology, Harbin University of Science and Technology, Harbin, Heilongjiang 150080, China"}]}],"member":"219","published-online":{"date-parts":[[2014,8,25]]},"reference":[{"key":"rf3","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2007.61"},{"key":"rf7","first-page":"993","author":"Blei D.","year":"2003","journal-title":"Journal of Machine Learning Research"},{"key":"rf13","first-page":"1605","volume":"2","author":"Russell B. C.","year":"2006","journal-title":"Computer Vision and Pattern Recognition"}],"container-title":["International Journal of Image and Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0219467814500120","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,8,6]],"date-time":"2019-08-06T19:12:36Z","timestamp":1565118756000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S0219467814500120"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,7]]},"references-count":3,"journal-issue":{"issue":"03","published-online":{"date-parts":[[2014,8,25]]},"published-print":{"date-parts":[[2014,7]]}},"alternative-id":["10.1142\/S0219467814500120"],"URL":"https:\/\/doi.org\/10.1142\/s0219467814500120","relation":{},"ISSN":["0219-4678","1793-6756"],"issn-type":[{"value":"0219-4678","type":"print"},{"value":"1793-6756","type":"electronic"}],"subject":[],"published":{"date-parts":[[2014,7]]}}}