{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,8,25]],"date-time":"2024-08-25T12:06:39Z","timestamp":1724587599178},"publisher-location":"New York, NY, USA","reference-count":36,"publisher":"ACM","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,3,7]]},"DOI":"10.1145\/3444685.3446302","type":"proceedings-article","created":{"date-parts":[[2021,5,4]],"date-time":"2021-05-04T04:48:41Z","timestamp":1620103721000},"update-policy":"http:\/\/dx.doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":7,"title":["C3VQG"],"prefix":"10.1145","author":[{"given":"Shagun","family":"Uppal","sequence":"first","affiliation":[{"name":"IIIT-Delhi, India"}]},{"given":"Anish","family":"Madan","sequence":"additional","affiliation":[{"name":"IIIT-Delhi, India"}]},{"given":"Sarthak","family":"Bhagat","sequence":"additional","affiliation":[{"name":"IIIT-Delhi, India"}]},{"given":"Yi","family":"Yu","sequence":"additional","affiliation":[{"name":"NII, Japan"}]},{"given":"Rajiv Ratn","family":"Shah","sequence":"additional","affiliation":[{"name":"IIIT-Delhi, India"}]}],"member":"320","published-online":{"date-parts":[[2021,5,3]]},"reference":[{"key":"e_1_3_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-016-0966-6"},{"key":"e_1_3_2_2_2_1","doi-asserted-by":"crossref","unstructured":"Abdul Fatir Ansari and Harold Soh. 2018. Hyperprior Induced Unsupervised Disentanglement of Latent Representations. In AAAI. Abdul Fatir Ansari and Harold Soh. 2018. Hyperprior Induced Unsupervised Disentanglement of Latent Representations. In AAAI.","DOI":"10.1609\/aaai.v33i01.33013175"},{"key":"e_1_3_2_2_3_1","volume-title":"VQA: Visual Question Answering. In International Conference on Computer Vision (ICCV).","author":"Antol Stanislaw","year":"2015","unstructured":"Stanislaw Antol , Aishwarya Agrawal , Jiasen Lu , Margaret Mitchell , Dhruv Batra , C. Lawrence Zitnick , and Devi Parikh . 2015 . VQA: Visual Question Answering. In International Conference on Computer Vision (ICCV). Stanislaw Antol, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, C. Lawrence Zitnick, and Devi Parikh. 2015. VQA: Visual Question Answering. In International Conference on Computer Vision (ICCV)."},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"crossref","unstructured":"Sarthak Bhagat Shagun Uppal Vivian T. Yin and N. Lim. 2020. Disentangling Multiple Features in Video Sequences Using Gaussian Processes in Variational Autoencoders. In ECCV. Sarthak Bhagat Shagun Uppal Vivian T. Yin and N. Lim. 2020. Disentangling Multiple Features in Video Sequences Using Gaussian Processes in Variational Autoencoders. In ECCV.","DOI":"10.1007\/978-3-030-58592-1_7"},{"key":"e_1_3_2_2_5_1","unstructured":"Shaoxiang Chen Ting Yao and Yu-Gang Jiang. 2019. Deep Learning for Video Captioning: A Review. In IJCAI. Shaoxiang Chen Ting Yao and Yu-Gang Jiang. 2019. Deep Learning for Video Captioning: A Review. In IJCAI."},{"key":"e_1_3_2_2_6_1","volume-title":"Understanding Center Loss Based Network for Image Retrieval with Few Training Data. In ECCV Workshops.","author":"Ghosh Pallabi","unstructured":"Pallabi Ghosh and Larry S. Davis . 2018 . Understanding Center Loss Based Network for Image Retrieval with Few Training Data. In ECCV Workshops. Pallabi Ghosh and Larry S. Davis. 2018. Understanding Center Loss Based Network for Image Retrieval with Few Training Data. In ECCV Workshops."},{"key":"e_1_3_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00208"},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/3295748"},{"key":"e_1_3_2_2_9_1","volume-title":"Creativity: Generating Diverse Questions Using Variational Autoencoders. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)","author":"Jain Unnat","year":"2017","unstructured":"Unnat Jain , Ziyu Zhang , and Alexander G. Schwing . 2017 . Creativity: Generating Diverse Questions Using Variational Autoencoders. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) ( 2017 ), 5415--5424. Unnat Jain, Ziyu Zhang, and Alexander G. Schwing. 2017. Creativity: Generating Diverse Questions Using Variational Autoencoders. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017), 5415--5424."},{"key":"e_1_3_2_2_10_1","volume-title":"Attribute-Centered Loss for Soft-Biometrics Guided Face Sketch-Photo Recognition. 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)","author":"Kazemi Hadi","year":"2018","unstructured":"Hadi Kazemi , Sobhan Soleymani , Ali Dabouei , Seyed Mehdi Iranmanesh , and Nasser M. Nasrabadi . 2018 . Attribute-Centered Loss for Soft-Biometrics Guided Face Sketch-Photo Recognition. 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) ( 2018 ), 612--6128. Hadi Kazemi, Sobhan Soleymani, Ali Dabouei, Seyed Mehdi Iranmanesh, and Nasser M. Nasrabadi. 2018. Attribute-Centered Loss for Soft-Biometrics Guided Face Sketch-Photo Recognition. 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (2018), 612--6128."},{"key":"e_1_3_2_2_11_1","volume-title":"Bayes-Factor-VAE: Hierarchical Bayesian Deep Auto-Encoder Models for Factor Disentanglement. 2019 IEEE\/CVF International Conference on Computer Vision (ICCV)","author":"Kim Minyoung","year":"2019","unstructured":"Minyoung Kim , Yuting Wang , Pritish Sahu , and Vladimir Pavlovic . 2019 . Bayes-Factor-VAE: Hierarchical Bayesian Deep Auto-Encoder Models for Factor Disentanglement. 2019 IEEE\/CVF International Conference on Computer Vision (ICCV) (2019), 2979--2987. Minyoung Kim, Yuting Wang, Pritish Sahu, and Vladimir Pavlovic. 2019. Bayes-Factor-VAE: Hierarchical Bayesian Deep Auto-Encoder Models for Factor Disentanglement. 2019 IEEE\/CVF International Conference on Computer Vision (ICCV) (2019), 2979--2987."},{"key":"e_1_3_2_2_12_1","volume-title":"Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114","author":"Kingma Diederik P","year":"2013","unstructured":"Diederik P Kingma and Max Welling . 2013. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114 ( 2013 ). Diederik P Kingma and Max Welling. 2013. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114 (2013)."},{"key":"e_1_3_2_2_13_1","volume-title":"Zemel","author":"Klys Jack","year":"2018","unstructured":"Jack Klys , Jake Snell , and Richard S . Zemel . 2018 . Learning Latent Subspaces in Variational Autoencoders. ArXiv abs\/1812.06190 (2018). Jack Klys, Jake Snell, and Richard S. Zemel. 2018. Learning Latent Subspaces in Variational Autoencoders. ArXiv abs\/1812.06190 (2018)."},{"key":"e_1_3_2_2_14_1","volume-title":"Information Maximizing Visual Question Generation. 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2019)","author":"Krishna Ranjay","year":"2019","unstructured":"Ranjay Krishna , Michael Bernstein , and Li Fei-Fei . 2019 . Information Maximizing Visual Question Generation. 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2019) , 2008--2018. Ranjay Krishna, Michael Bernstein, and Li Fei-Fei. 2019. Information Maximizing Visual Question Generation. 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2019), 2008--2018."},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1405"},{"key":"e_1_3_2_2_16_1","volume-title":"2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Li Yikang","year":"2017","unstructured":"Yikang Li , Nan Duan , Bolei Zhou , X. R. Chu , Wanli Ouyang , and Xiaogang Wang . 2017 . Visual Question Generation as Dual Task of Visual Question Answering . 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (2017), 6116--6124. Yikang Li, Nan Duan, Bolei Zhou, X. R. Chu, Wanli Ouyang, and Xiaogang Wang. 2017. Visual Question Generation as Dual Task of Visual Question Answering. 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (2017), 6116--6124."},{"key":"e_1_3_2_2_17_1","volume-title":"ROUGE: A Package for Automatic Evaluation of Summaries. In Text Summarization Branches Out","author":"Lin Chin-Yew","year":"2004","unstructured":"Chin-Yew Lin . 2004 . ROUGE: A Package for Automatic Evaluation of Summaries. In Text Summarization Branches Out . Association for Computational Linguistics , Barcelona, Spain , 74--81. https:\/\/www.aclweb.org\/anthology\/W04-1013 Chin-Yew Lin. 2004. ROUGE: A Package for Automatic Evaluation of Summaries. In Text Summarization Branches Out. Association for Computational Linguistics, Barcelona, Spain, 74--81. https:\/\/www.aclweb.org\/anthology\/W04-1013"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00898"},{"key":"e_1_3_2_2_19_1","unstructured":"Mateusz Malinowski and Mario Fritz. 2014. A Multi-World Approach to Question Answering about Real-World Scenes based on Uncertain Input. arXiv:1410.0210 [cs.AI] Mateusz Malinowski and Mario Fritz. 2014. A Multi-World Approach to Question Answering about Real-World Scenes based on Uncertain Input. arXiv:1410.0210 [cs.AI]"},{"key":"e_1_3_2_2_20_1","unstructured":"Nasrin Mostafazadeh Chris Brockett William B. Dolan Michel Galley Jianfeng Gao Georgios P. Spithourakis and Lucy Vanderwende. 2017. Image-Grounded Conversations: Multimodal Context for Natural Question and Response Generation. In IJCNLP. Nasrin Mostafazadeh Chris Brockett William B. Dolan Michel Galley Jianfeng Gao Georgios P. Spithourakis and Lucy Vanderwende. 2017. Image-Grounded Conversations: Multimodal Context for Natural Question and Response Generation. In IJCNLP."},{"key":"e_1_3_2_2_21_1","volume-title":"Generating Natural Questions About an Image. ArXiv abs\/1603.06059","author":"Mostafazadeh Nasrin","year":"2016","unstructured":"Nasrin Mostafazadeh , Ishan Misra , Jacob Devlin , Margaret Mitchell , Xiaodong He , and Lucy Vanderwende . 2016. Generating Natural Questions About an Image. ArXiv abs\/1603.06059 ( 2016 ). Nasrin Mostafazadeh, Ishan Misra, Jacob Devlin, Margaret Mitchell, Xiaodong He, and Lucy Vanderwende. 2016. Generating Natural Questions About an Image. ArXiv abs\/1603.06059 (2016)."},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/BigMM.2019.00-42"},{"key":"e_1_3_2_2_23_1","volume-title":"User Input Based Style Transfer While Retaining Facial Attributes. 2019 IEEE Fifth International Conference on Multimedia Big Data (BigMM)","author":"Pai Sharan","year":"2019","unstructured":"Sharan Pai , Nikhil Sachdeva , R. Shah , and R. Zimmermann . 2019 . User Input Based Style Transfer While Retaining Facial Attributes. 2019 IEEE Fifth International Conference on Multimedia Big Data (BigMM) ( 2019 ), 68--76. Sharan Pai, Nikhil Sachdeva, R. Shah, and R. Zimmermann. 2019. User Input Based Style Transfer While Retaining Facial Attributes. 2019 IEEE Fifth International Conference on Multimedia Big Data (BigMM) (2019), 68--76."},{"key":"e_1_3_2_2_24_1","volume-title":"Multimodal Analysis of User-Generated Multimedia Content","author":"Shah Rajiv","unstructured":"Rajiv Shah and Roger Zimmermann . 2017. Multimodal Analysis of User-Generated Multimedia Content ( 1 st ed.). Springer Publishing Company, Inc orporated. Rajiv Shah and Roger Zimmermann. 2017. Multimodal Analysis of User-Generated Multimedia Content (1st ed.). Springer Publishing Company, Incorporated.","edition":"1"},{"key":"e_1_3_2_2_25_1","volume-title":"Turaga","author":"Shukla Ankita","year":"2019","unstructured":"Ankita Shukla , Sarthak Bhagat , Shagun Uppal , Saket Anand , and Pavan K . Turaga . 2019 . Product of Orthogonal Spheres Parameterization for Disentangled Representation Learning. In BMVC. Ankita Shukla, Sarthak Bhagat, Shagun Uppal, Saket Anand, and Pavan K. Turaga. 2019. Product of Orthogonal Spheres Parameterization for Disentangled Representation Learning. In BMVC."},{"key":"e_1_3_2_2_26_1","volume-title":"Emerging Trends of Multimodal Research in Vision and Language. ArXiv abs\/2010.09522","author":"Uppal Shagun","year":"2020","unstructured":"Shagun Uppal , Sarthak Bhagat , Devamanyu Hazarika , Navonil Majumdar , Soujanya Poria , R. Zimmermann , and Amir Zadeh . 2020. Emerging Trends of Multimodal Research in Vision and Language. ArXiv abs\/2010.09522 ( 2020 ). Shagun Uppal, Sarthak Bhagat, Devamanyu Hazarika, Navonil Majumdar, Soujanya Poria, R. Zimmermann, and Amir Zadeh. 2020. Emerging Trends of Multimodal Research in Vision and Language. ArXiv abs\/2010.09522 (2020)."},{"key":"e_1_3_2_2_27_1","volume-title":"Rajiv Ratn Shah, and Amanda Stent","author":"Uppal Shagun","year":"2020","unstructured":"Shagun Uppal , Vivek Gupta , Avinash Swaminathan , Haimin Zhang , Debanjan Mahata , Rakesh Gosangi , Rajiv Ratn Shah, and Amanda Stent . 2020 . Two-Step Classification using Recasted Data for Low Resource Settings. In Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing. Association for Computational Linguistics, Suzhou, China, 706--719. https:\/\/www.aclweb.org\/anthology\/2020.aacl-main.71 Shagun Uppal, Vivek Gupta, Avinash Swaminathan, Haimin Zhang, Debanjan Mahata, Rakesh Gosangi, Rajiv Ratn Shah, and Amanda Stent. 2020. Two-Step Classification using Recasted Data for Low Resource Settings. In Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing. Association for Computational Linguistics, Suzhou, China, 706--719. https:\/\/www.aclweb.org\/anthology\/2020.aacl-main.71"},{"key":"e_1_3_2_2_28_1","volume-title":"2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)","author":"Vedantam Ramakrishna","year":"2014","unstructured":"Ramakrishna Vedantam , C. Lawrence Zitnick , and Devi Parikh . 2014 . CIDEr: Consensus-based image description evaluation . 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2014), 4566--4575. Ramakrishna Vedantam, C. Lawrence Zitnick, and Devi Parikh. 2014. CIDEr: Consensus-based image description evaluation. 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2014), 4566--4575."},{"key":"e_1_3_2_2_29_1","volume-title":"A Joint Model for Question Answering and Question Generation. ArXiv abs\/1706.01450","author":"Wang Tong","year":"2017","unstructured":"Tong Wang , Xingdi Yuan , and Adam Trischler . 2017. A Joint Model for Question Answering and Question Generation. ArXiv abs\/1706.01450 ( 2017 ). Tong Wang, Xingdi Yuan, and Adam Trischler. 2017. A Joint Model for Question Answering and Question Generation. ArXiv abs\/1706.01450 (2017)."},{"key":"e_1_3_2_2_30_1","unstructured":"Yandong Wen Kaipeng Zhang Zhifeng Li and Yu Qiao. 2016. A Discriminative Feature Learning Approach for Deep Face Recognition. In ECCV. Yandong Wen Kaipeng Zhang Zhifeng Li and Yu Qiao. 2016. A Discriminative Feature Learning Approach for Deep Face Recognition. In ECCV."},{"key":"e_1_3_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-018-01142-4"},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICME.2018.8486475"},{"key":"e_1_3_2_2_33_1","volume-title":"Visual Curiosity: Learning to Ask Questions to Learn Visual Recognition. In CoRL.","author":"Yang Jianwei","year":"2018","unstructured":"Jianwei Yang , Jiasen Lu , Stefan Lee , Dhruv Batra , and Devi Parikh . 2018 . Visual Curiosity: Learning to Ask Questions to Learn Visual Recognition. In CoRL. Jianwei Yang, Jiasen Lu, Stefan Lee, Dhruv Batra, and Devi Parikh. 2018. Visual Curiosity: Learning to Ask Questions to Learn Visual Recognition. In CoRL."},{"key":"e_1_3_2_2_34_1","volume-title":"Neural Self Talk: Image Understanding via Continuous Questioning and Answering. ArXiv abs\/1512.03460","author":"Yang Yezhou","year":"2015","unstructured":"Yezhou Yang , Yi Li , Cornelia Ferm\u00fcller , and Yiannis Aloimonos . 2015. Neural Self Talk: Image Understanding via Continuous Questioning and Answering. ArXiv abs\/1512.03460 ( 2015 ). Yezhou Yang, Yi Li, Cornelia Ferm\u00fcller, and Yiannis Aloimonos. 2015. Neural Self Talk: Image Understanding via Continuous Questioning and Answering. ArXiv abs\/1512.03460 (2015)."},{"key":"e_1_3_2_2_35_1","volume-title":"Automatic Generation of Grounded Visual Questions. ArXiv abs\/1612.06530","author":"Zhang Shijie","year":"2016","unstructured":"Shijie Zhang , Lizhen Qu , Shaodi You , Zhenglu Yang , and Jiawan Zhang . 2016. Automatic Generation of Grounded Visual Questions. ArXiv abs\/1612.06530 ( 2016 ). Shijie Zhang, Lizhen Qu, Shaodi You, Zhenglu Yang, and Jiawan Zhang. 2016. Automatic Generation of Grounded Visual Questions. ArXiv abs\/1612.06530 (2016)."},{"key":"e_1_3_2_2_36_1","doi-asserted-by":"crossref","unstructured":"Yuke Zhu Oliver Groth Michael Bernstein and Li Fei-Fei. 2015. Visual7W: Grounded Question Answering in Images. arXiv:1511.03416 [cs.CV] Yuke Zhu Oliver Groth Michael Bernstein and Li Fei-Fei. 2015. Visual7W: Grounded Question Answering in Images. arXiv:1511.03416 [cs.CV]","DOI":"10.1109\/CVPR.2016.540"}],"event":{"name":"MMAsia '20: ACM Multimedia Asia","location":"Virtual Event Singapore","acronym":"MMAsia '20","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 2nd ACM International Conference on Multimedia in Asia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3444685.3446302","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,6]],"date-time":"2023-01-06T03:55:30Z","timestamp":1672977330000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3444685.3446302"}},"subtitle":["category consistent cyclic visual question generation"],"short-title":[],"issued":{"date-parts":[[2021,3,7]]},"references-count":36,"alternative-id":["10.1145\/3444685.3446302","10.1145\/3444685"],"URL":"https:\/\/doi.org\/10.1145\/3444685.3446302","relation":{},"subject":[],"published":{"date-parts":[[2021,3,7]]},"assertion":[{"value":"2021-05-03","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}