{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,10,30]],"date-time":"2024-10-30T21:22:39Z","timestamp":1730323359998,"version":"3.28.0"},"publisher-location":"New York, NY, USA","reference-count":40,"publisher":"ACM","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,3,7]]},"DOI":"10.1145\/3444685.3446269","type":"proceedings-article","created":{"date-parts":[[2021,5,4]],"date-time":"2021-05-04T04:48:41Z","timestamp":1620103721000},"page":"1-7","update-policy":"http:\/\/dx.doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Self-supervised adversarial learning for cross-modal retrieval"],"prefix":"10.1145","author":[{"given":"Yangchao","family":"Wang","sequence":"first","affiliation":[{"name":"University of Electronic Science and Technology of China"}]},{"given":"Shiyuan","family":"He","sequence":"additional","affiliation":[{"name":"University of Electronic Science and Technology of China"}]},{"given":"Xing","family":"Xu","sequence":"additional","affiliation":[{"name":"University of Electronic Science and Technology of China"}]},{"given":"Yang","family":"Yang","sequence":"additional","affiliation":[{"name":"University of Electronic Science and Technology of China"}]},{"given":"Jingjing","family":"Li","sequence":"additional","affiliation":[{"name":"University of Electronic Science and Technology of China"}]},{"given":"Heng Tao","family":"Shen","sequence":"additional","affiliation":[{"name":"University of Electronic Science and Technology of China"}]}],"member":"320","published-online":{"date-parts":[[2021,5,3]]},"reference":[{"key":"e_1_3_2_2_1_1","unstructured":"Galen Andrew Raman Arora Jeff A. Bilmes and Karen Livescu. 2013. Deep Canonical Correlation Analysis. In ICML. 1247--1255. Galen Andrew Raman Arora Jeff A. Bilmes and Karen Livescu. 2013. Deep Canonical Correlation Analysis. In ICML. 1247--1255."},{"key":"e_1_3_2_2_2_1","unstructured":"Lucas Beyer Xiaohua Zhai Avital Oliver and Alexander Kolesnikov. 2019. S4L: Self-Supervised Semi-Supervised Learning. In ICCV. 1476--1485. Lucas Beyer Xiaohua Zhai Avital Oliver and Alexander Kolesnikov. 2019. S4L: Self-Supervised Semi-Supervised Learning. In ICCV. 1476--1485."},{"key":"e_1_3_2_2_3_1","doi-asserted-by":"crossref","unstructured":"Yue Cao Mingsheng Long Jianmin Wang and Shichen Liu. 2017. Collective Deep Quantization for Efficient Cross-Modal Retrieval. In AAAI. 3974--3980. Yue Cao Mingsheng Long Jianmin Wang and Shichen Liu. 2017. Collective Deep Quantization for Efficient Cross-Modal Retrieval. In AAAI. 3974--3980.","DOI":"10.1609\/aaai.v31i1.11218"},{"key":"e_1_3_2_2_4_1","volume-title":"Efros","author":"Doersch Carl","year":"2015","unstructured":"Carl Doersch , Abhinav Gupta , and Alexei A . Efros . 2015 . Unsupervised Visual Representation Learning by Context Prediction. In ICCV. 1422--1430. Carl Doersch, Abhinav Gupta, and Alexei A. Efros. 2015. Unsupervised Visual Representation Learning by Context Prediction. In ICCV. 1422--1430."},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2015.2496141"},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"crossref","unstructured":"Fangxiang Feng Xiaojie Wang and Ruifan Li. 2014. Cross-modal Retrieval with Correspondence Autoencoder. In ACM MM. 7--16. Fangxiang Feng Xiaojie Wang and Ruifan Li. 2014. Cross-modal Retrieval with Correspondence Autoencoder. In ACM MM. 7--16.","DOI":"10.1145\/2647868.2654902"},{"key":"e_1_3_2_2_7_1","volume-title":"Lempitsky","author":"Ganin Yaroslav","year":"2015","unstructured":"Yaroslav Ganin and Victor S . Lempitsky . 2015 . Unsupervised Domain Adaptation by Backpropagation. In ICML. 1180--1189. Yaroslav Ganin and Victor S. Lempitsky. 2015. Unsupervised Domain Adaptation by Backpropagation. In ICML. 1180--1189."},{"key":"e_1_3_2_2_8_1","unstructured":"Spyros Gidaris Praveer Singh and Nikos Komodakis. 2018. Unsupervised Representation Learning by Predicting Image Rotations. In ICLR. Spyros Gidaris Praveer Singh and Nikos Komodakis. 2018. Unsupervised Representation Learning by Predicting Image Rotations. In ICLR."},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1162\/0899766042321814"},{"key":"e_1_3_2_2_10_1","unstructured":"Dan Hendrycks Mantas Mazeika Saurav Kadavath and Dawn Song. 2019. Using Self-Supervised Learning Can Improve Model Robustness and Uncertainty. In NeurIPS. 15637--15648. Dan Hendrycks Mantas Mazeika Saurav Kadavath and Dawn Song. 2019. Using Self-Supervised Learning Can Improve Model Robustness and Uncertainty. In NeurIPS. 15637--15648."},{"key":"e_1_3_2_2_11_1","unstructured":"Peng Hu Liangli Zhen Dezhong Peng and Pei Liu. 2019. Scalable Deep Multi-modal Learning for Cross-Modal Retrieval. In SIGIR. 635--644. Peng Hu Liangli Zhen Dezhong Peng and Pei Liu. 2019. Scalable Deep Multi-modal Learning for Cross-Modal Retrieval. In SIGIR. 635--644."},{"key":"e_1_3_2_2_12_1","volume-title":"Lew","author":"Huiskes Mark J.","year":"2008","unstructured":"Mark J. Huiskes and Michael S . Lew . 2008 . The MIR flickr retrieval evaluation. In ACM MIR. 39--43. Mark J. Huiskes and Michael S. Lew. 2008. The MIR flickr retrieval evaluation. In ACM MIR. 39--43."},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"crossref","unstructured":"Meina Kan Shiguang Shan and Xilin Chen. 2016. Multi-view Deep Network for Cross-View Classification. In CVPR. 4847--4855. Meina Kan Shiguang Shan and Xilin Chen. 2016. Multi-view Deep Network for Cross-View Classification. In CVPR. 4847--4855.","DOI":"10.1109\/CVPR.2016.524"},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2015.2435740"},{"key":"e_1_3_2_2_15_1","volume-title":"Kingma and Jimmy Ba","author":"Diederik","year":"2015","unstructured":"Diederik P. Kingma and Jimmy Ba . 2015 . Adam : A Method for Stochastic Optimization. In ICLR. Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In ICLR."},{"key":"e_1_3_2_2_16_1","unstructured":"Alexander Kolesnikov Xiaohua Zhai and Lucas Beyer. [n. d.]. Revisiting Self-Supervised Visual Representation Learning. In CVPR. Alexander Kolesnikov Xiaohua Zhai and Lucas Beyer. [n. d.]. Revisiting Self-Supervised Visual Representation Learning. In CVPR."},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"crossref","unstructured":"Gustav Larsson Michael Maire and Gregory Shakhnarovich. 2016. Learning Representations for Automatic Colorization. In ECCV. 577--593. Gustav Larsson Michael Maire and Gregory Shakhnarovich. 2016. Learning Representations for Automatic Colorization. In ECCV. 577--593.","DOI":"10.1007\/978-3-319-46493-0_35"},{"key":"e_1_3_2_2_18_1","unstructured":"Jey Han Lau and Timothy Baldwin. 2016. An Empirical Evaluation of doc2vec with Practical Insights into Document Embedding Generation. In Rep4NLP@ACL Workshop. 78--86. Jey Han Lau and Timothy Baldwin. 2016. An Empirical Evaluation of doc2vec with Practical Insights into Document Embedding Generation. In Rep4NLP@ACL Workshop. 78--86."},{"key":"e_1_3_2_2_19_1","unstructured":"Chao Li Cheng Deng Ning Li Wei Liu Xinbo Gao and Dacheng Tao. 2018. Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval. In CVPR. 4242--4251. Chao Li Cheng Deng Ning Li Wei Liu Xinbo Gao and Dacheng Tao. 2018. Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval. In CVPR. 4242--4251."},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"crossref","unstructured":"Tsung-Yi Lin Michael Maire Serge J. Belongie James Hays Pietro Perona Deva Ramanan Piotr Doll\u00e1r and C. Lawrence Zitnick. 2014. Microsoft COCO: Common Objects in Context. In ECCV. 740--755. Tsung-Yi Lin Michael Maire Serge J. Belongie James Hays Pietro Perona Deva Ramanan Piotr Doll\u00e1r and C. Lawrence Zitnick. 2014. Microsoft COCO: Common Objects in Context. In ECCV. 740--755.","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"e_1_3_2_2_21_1","doi-asserted-by":"crossref","unstructured":"Mehdi Noroozi and Paolo Favaro. 2016. Unsupervised Learning of Visual Representations by Solving Jigsaw Puzzles. In ECCV. 69--84. Mehdi Noroozi and Paolo Favaro. 2016. Unsupervised Learning of Visual Representations by Solving Jigsaw Puzzles. In ECCV. 69--84.","DOI":"10.1007\/978-3-319-46466-4_5"},{"key":"e_1_3_2_2_22_1","unstructured":"Yuxin Peng Xin Huang and Jinwei Qi. 2016. Cross-Media Shared Representation by Hierarchical Learning with Multiple Deep Networks. In IJCAI. 3846--3853. Yuxin Peng Xin Huang and Jinwei Qi. 2016. Cross-Media Shared Representation by Hierarchical Learning with Multiple Deep Networks. In IJCAI. 3846--3853."},{"key":"e_1_3_2_2_23_1","first-page":"405","article-title":"CCL: Cross-modal Correlation Learning With Multigrained Fusion by Hierarchical Network","volume":"20","author":"Peng Yuxin","year":"2018","unstructured":"Yuxin Peng , Jinwei Qi , Xin Huang , and Yuxin Yuan . 2018 . CCL: Cross-modal Correlation Learning With Multigrained Fusion by Hierarchical Network . IEEE TMM 20 , 2 (2018), 405 -- 420 . Yuxin Peng, Jinwei Qi, Xin Huang, and Yuxin Yuan. 2018. CCL: Cross-modal Correlation Learning With Multigrained Fusion by Hierarchical Network. IEEE TMM 20, 2 (2018), 405--420.","journal-title":"IEEE TMM"},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2013.142"},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"crossref","unstructured":"Viresh Ranjan Nikhil Rasiwasia and C. V. Jawahar. 2015. Multi-label Cross-Modal Retrieval. In ICCV. 4094--4102. Viresh Ranjan Nikhil Rasiwasia and C. V. Jawahar. 2015. Multi-label Cross-Modal Retrieval. In ICCV. 4094--4102.","DOI":"10.1109\/ICCV.2015.466"},{"key":"e_1_3_2_2_26_1","unstructured":"Karen Simonyan and Andrew Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. In ICLR. Karen Simonyan and Andrew Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. In ICLR."},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"crossref","unstructured":"Bokun Wang Yang Yang Xing Xu Alan Hanjalic and Heng Tao Shen. 2017. Adversarial Cross-Modal Retrieval. In ACM MM. 154--162. Bokun Wang Yang Yang Xing Xu Alan Hanjalic and Heng Tao Shen. 2017. Adversarial Cross-Modal Retrieval. In ACM MM. 154--162.","DOI":"10.1145\/3123266.3123326"},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"crossref","unstructured":"Kaiye Wang Ran He Wei Wang Liang Wang and Tieniu Tan. 2013. Learning Coupled Feature Spaces for Cross-Modal Matching. In ICCV. 2088--2095. Kaiye Wang Ran He Wei Wang Liang Wang and Tieniu Tan. 2013. Learning Coupled Feature Spaces for Cross-Modal Matching. In ICCV. 2088--2095.","DOI":"10.1109\/ICCV.2013.261"},{"key":"e_1_3_2_2_29_1","unstructured":"Weiran Wang and Karen Livescu. 2016. Large-Scale Approximate Kernel Canonical Correlation Analysis. In ICLR. Weiran Wang and Karen Livescu. 2016. Large-Scale Approximate Kernel Canonical Correlation Analysis. In ICLR."},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2019.02.007"},{"key":"e_1_3_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2019.102130"},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01302"},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/3338533.3366552"},{"key":"e_1_3_2_2_34_1","doi-asserted-by":"crossref","unstructured":"Fei Yan and Krystian Mikolajczyk. 2015. Deep correlation for matching images and text. In CVPR. 3441--3450. Fei Yan and Krystian Mikolajczyk. 2015. Deep correlation for matching images and text. In CVPR. 3441--3450.","DOI":"10.1109\/CVPR.2015.7298966"},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"crossref","unstructured":"Ting Yao Tao Mei and Chong-Wah Ngo. 2015. Learning Query and Image Similarities with Ranking Canonical Correlation Analysis. In ICCV. 28--36. Ting Yao Tao Mei and Chong-Wah Ngo. 2015. Learning Query and Image Similarities with Ranking Canonical Correlation Analysis. In ICCV. 28--36.","DOI":"10.1109\/ICCV.2015.12"},{"key":"e_1_3_2_2_36_1","first-page":"965","article-title":"Learning Cross-Media Joint Representation With Sparse and Semisupervised Regularization","volume":"24","author":"Zhai Xiaohua","year":"2014","unstructured":"Xiaohua Zhai , Yuxin Peng , and Jianguo Xiao . 2014 . Learning Cross-Media Joint Representation With Sparse and Semisupervised Regularization . IEEE TCSVT 24 , 6 (2014), 965 -- 978 . Xiaohua Zhai, Yuxin Peng, and Jianguo Xiao. 2014. Learning Cross-Media Joint Representation With Sparse and Semisupervised Regularization. IEEE TCSVT 24, 6 (2014), 965--978.","journal-title":"IEEE TCSVT"},{"key":"e_1_3_2_2_37_1","first-page":"128","article-title":"Generalized Semi-supervised and Structured Subspace Learning for Cross-Modal Retrieval","volume":"20","author":"Zhang Liang","year":"2018","unstructured":"Liang Zhang , Bingpeng Ma , Guorong Li , Qingming Huang , and Qi Tian . 2018 . Generalized Semi-supervised and Structured Subspace Learning for Cross-Modal Retrieval . IEEE TMM 20 , 1 (2018), 128 -- 141 . Liang Zhang, Bingpeng Ma, Guorong Li, Qingming Huang, and Qi Tian. 2018. Generalized Semi-supervised and Structured Subspace Learning for Cross-Modal Retrieval. IEEE TMM 20, 1 (2018), 128--141.","journal-title":"IEEE TMM"},{"key":"e_1_3_2_2_38_1","doi-asserted-by":"crossref","unstructured":"Liheng Zhang Guo-Jun Qi Liqiang Wang and Jiebo Luo. 2019. AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transformations Rather Than Data. In CVPR. Liheng Zhang Guo-Jun Qi Liqiang Wang and Jiebo Luo. 2019. AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transformations Rather Than Data. In CVPR.","DOI":"10.1109\/CVPR.2019.00265"},{"key":"e_1_3_2_2_39_1","doi-asserted-by":"crossref","unstructured":"Ying Zhang and Huchuan Lu. 2018. Deep Cross-Modal Projection Learning for Image-Text Matching. In ECCV. 707--723. Ying Zhang and Huchuan Lu. 2018. Deep Cross-Modal Projection Learning for Image-Text Matching. In ECCV. 707--723.","DOI":"10.1007\/978-3-030-01246-5_42"},{"key":"e_1_3_2_2_40_1","doi-asserted-by":"crossref","unstructured":"Liangli Zhen Peng Hu Xu Wang and Dezhong Peng. 2019. Deep Supervised Cross-Modal Retrieval. In CVPR. 10394--10403. Liangli Zhen Peng Hu Xu Wang and Dezhong Peng. 2019. Deep Supervised Cross-Modal Retrieval. In CVPR. 10394--10403.","DOI":"10.1109\/CVPR.2019.01064"}],"event":{"name":"MMAsia '20: ACM Multimedia Asia","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Virtual Event Singapore","acronym":"MMAsia '20"},"container-title":["Proceedings of the 2nd ACM International Conference on Multimedia in Asia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3444685.3446269","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,6]],"date-time":"2023-01-06T03:49:37Z","timestamp":1672976977000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3444685.3446269"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,3,7]]},"references-count":40,"alternative-id":["10.1145\/3444685.3446269","10.1145\/3444685"],"URL":"https:\/\/doi.org\/10.1145\/3444685.3446269","relation":{},"subject":[],"published":{"date-parts":[[2021,3,7]]},"assertion":[{"value":"2021-05-03","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}