{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T00:48:46Z","timestamp":1740098926327,"version":"3.37.3"},"publisher-location":"New York, NY, USA","reference-count":15,"publisher":"ACM","license":[{"start":{"date-parts":[[2017,6,19]],"date-time":"2017-06-19T00:00:00Z","timestamp":1497830400000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61672523"],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2017,6,19]]},"DOI":"10.1145\/3095713.3095751","type":"proceedings-article","created":{"date-parts":[[2017,8,28]],"date-time":"2017-08-28T12:45:27Z","timestamp":1503924327000},"page":"1-5","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Harvesting Deep Models for Cross-Lingual Image Annotation"],"prefix":"10.1145","author":[{"given":"Qijie","family":"Wei","sequence":"first","affiliation":[{"name":"Key Lab of Data Engineering and Knowledge Engineering, Renmin University of China, Multimedia Computing Lab, School of Information, Renmin University of China"}]},{"given":"Xiaoxu","family":"Wang","sequence":"additional","affiliation":[{"name":"Key Lab of Data Engineering and Knowledge Engineering, Renmin University of China, Multimedia Computing Lab, School of Information, Renmin University of China"}]},{"given":"Xirong","family":"Li","sequence":"additional","affiliation":[{"name":"Key Lab of Data Engineering and Knowledge Engineering, Renmin University of China, Multimedia Computing Lab, School of Information, Renmin University of China"}]}],"member":"320","published-online":{"date-parts":[[2017,6,19]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"Alexandre B\u00e9rard Christophe Servan Olivier Pietquin and Laurent Besacier. 2016. MultiVec: a multilingual and multilevel representation learning toolkit for nlp. In LREC. Alexandre B\u00e9rard Christophe Servan Olivier Pietquin and Laurent Besacier. 2016. MultiVec: a multilingual and multilevel representation learning toolkit for nlp. In LREC."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"crossref","unstructured":"Jia Deng Wei Dong Richard Socher Li-Jia Li Kai Li and Li Fei-Fei. 2009. ImageNet: A large-scale hierarchical image database. In CVPR. Jia Deng Wei Dong Richard Socher Li-Jia Li Kai Li and Li Fei-Fei. 2009. ImageNet: A large-scale hierarchical image database. In CVPR.","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.5555\/2566972.2566993"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"crossref","unstructured":"Alireza Koochali Sebastian Kalkowski Andreas Dengel Damian Borth and Christian Schulze. 2016. Which Languages do People Speak on Flickr? A Language and Geo-Location Study of the YFCC100m Dataset. In MMCommons. Alireza Koochali Sebastian Kalkowski Andreas Dengel Damian Borth and Christian Schulze. 2016. Which Languages do People Speak on Flickr? A Language and Geo-Location Study of the YFCC100m Dataset. In MMCommons.","DOI":"10.1145\/2983554.2983560"},{"key":"e_1_3_2_1_6_1","unstructured":"I Krasin T Duerig N Alldrin A Veit S Abu-El-Haija S Belongie D Cai Z Feng V Ferrari V Gomes and others. 2016. OpenImages: A public dataset for large-scale multi-label and multiclass image classification. https:\/\/github.com\/openimages. (2016). I Krasin T Duerig N Alldrin A Veit S Abu-El-Haija S Belongie D Cai Z Feng V Ferrari V Gomes and others. 2016. OpenImages: A public dataset for large-scale multi-label and multiclass image classification. https:\/\/github.com\/openimages. (2016)."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/2911996.2912049"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/2766462.2767773"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"crossref","unstructured":"Xirong Li Tiberio Uricchio Lamberto Ballan Marco Bertini Cees G. M. Snoek and Alberto Del Bimbo. 2016. Socializing the Semantic Gap: A Comparative Survey on Image Tag Assignment Refinement and Retrieval. CSUR 49 1 (2016) 14:1--14:39. Xirong Li Tiberio Uricchio Lamberto Ballan Marco Bertini Cees G. M. Snoek and Alberto Del Bimbo. 2016. Socializing the Semantic Gap: A Comparative Survey on Image Tag Assignment Refinement and Retrieval. CSUR 49 1 (2016) 14:1--14:39.","DOI":"10.1145\/2906152"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/W15-1521"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"crossref","unstructured":"Pascal Mettes Dennis Koelma and Cees Snoek. 2016. The ImageNet Shuffle: Reorganized Pre-training for Video Event Detection. In ICMR. Pascal Mettes Dennis Koelma and Cees Snoek. 2016. The ImageNet Shuffle: Reorganized Pre-training for Video Event Detection. In ICMR.","DOI":"10.1145\/2911996.2912036"},{"key":"e_1_3_2_1_12_1","unstructured":"Tomas Mikolov Kai Chen Greg Corrado and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. In ICLR. Tomas Mikolov Kai Chen Greg Corrado and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. In ICLR."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"crossref","unstructured":"Takashi Miyazaki and Nobuyuki Shimizu. 2015. Cross-lingual image caption generation. In ACL. Takashi Miyazaki and Nobuyuki Shimizu. 2015. Cross-lingual image caption generation. In ACL.","DOI":"10.18653\/v1\/P16-1168"},{"key":"e_1_3_2_1_14_1","unstructured":"Mohammad Norouzi Tomas Mikolov Samy Bengio Yoram Singer Jonathon Shlens Andrea Frome Greg Corrado and Jeffrey Dean. 2014. Zero-Shot Learning by Convex Combination of Semantic Embeddings. In ICLR. Mohammad Norouzi Tomas Mikolov Samy Bengio Yoram Singer Jonathon Shlens Andrea Frome Greg Corrado and Jeffrey Dean. 2014. Zero-Shot Learning by Convex Combination of Semantic Embeddings. In ICLR."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICME.2009.5202826"}],"event":{"name":"CBMI '17: International Workshop on Content-Based Multimedia Indexing","acronym":"CBMI '17","location":"Florence Italy"},"container-title":["Proceedings of the 15th International Workshop on Content-Based Multimedia Indexing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3095713.3095751","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,14]],"date-time":"2023-01-14T10:49:35Z","timestamp":1673693375000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3095713.3095751"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,6,19]]},"references-count":15,"alternative-id":["10.1145\/3095713.3095751","10.1145\/3095713"],"URL":"https:\/\/doi.org\/10.1145\/3095713.3095751","relation":{},"subject":[],"published":{"date-parts":[[2017,6,19]]},"assertion":[{"value":"2017-06-19","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}