{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,9,8]],"date-time":"2024-09-08T04:50:37Z","timestamp":1725771037914},"publisher-location":"New York, NY, USA","reference-count":52,"publisher":"ACM","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,6,8]]},"DOI":"10.1145\/3372278.3390680","type":"proceedings-article","created":{"date-parts":[[2020,6,2]],"date-time":"2020-06-02T04:35:27Z","timestamp":1591072527000},"page":"108-116","update-policy":"http:\/\/dx.doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["Interactivity Proposals for Surveillance Videos"],"prefix":"10.1145","author":[{"given":"Shuo","family":"Chen","sequence":"first","affiliation":[{"name":"University of Amsterdam, Amsterdam, Netherlands"}]},{"given":"Pascal","family":"Mettes","sequence":"additional","affiliation":[{"name":"University of Amsterdam, Amsterdam, Netherlands"}]},{"given":"Tao","family":"Hu","sequence":"additional","affiliation":[{"name":"University of Amsterdam, Amsterdam, Netherlands"}]},{"given":"Cees G.M.","family":"Snoek","sequence":"additional","affiliation":[{"name":"University of Amsterdam, Amsterdam, Netherlands"}]}],"member":"320","published-online":{"date-parts":[[2020,6,8]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Measuring the objectness of image windows. PAMI","author":"Alexe Bogdan","year":"2012","unstructured":"Bogdan Alexe , Thomas Deselaers , and Vittorio Ferrari . 2012. Measuring the objectness of image windows. PAMI ( 2012 ). Bogdan Alexe, Thomas Deselaers, and Vittorio Ferrari. 2012. Measuring the objectness of image windows. PAMI (2012)."},{"key":"e_1_3_2_1_2_1","volume-title":"TRECVID 2018: Benchmarking Video Activity Detection, Video Captioning and Matching, Video Storytelling Linking and Video Search. In TRECVID.","author":"Awad George","year":"2018","unstructured":"George Awad , Asad Butt , Keith Curtis , Yooyoung Lee , Jonathan Fiscus , Afzal Godil , David Joy , Andrew Delgado , Alan F. Smeaton , Yvette Graham , Wessel Kraaij , Georges Qu\u00e9not , Joao Magalhaes , David Semedo , and Saverio Blasi . 2018 . TRECVID 2018: Benchmarking Video Activity Detection, Video Captioning and Matching, Video Storytelling Linking and Video Search. In TRECVID. George Awad, Asad Butt, Keith Curtis, Yooyoung Lee, Jonathan Fiscus, Afzal Godil, David Joy, Andrew Delgado, Alan F. Smeaton, Yvette Graham, Wessel Kraaij, Georges Qu\u00e9not, Joao Magalhaes, David Semedo, and Saverio Blasi. 2018. TRECVID 2018: Benchmarking Video Activity Detection, Video Captioning and Matching, Video Storytelling Linking and Video Search. In TRECVID."},{"key":"e_1_3_2_1_3_1","volume-title":"Jamie Ryan Kiros, and Geoffrey E Hinton","author":"Ba Jimmy Lei","year":"2016","unstructured":"Jimmy Lei Ba , Jamie Ryan Kiros, and Geoffrey E Hinton . 2016 . Layer normalization. arXiv preprint arXiv:1607.06450 (2016). Jimmy Lei Ba, Jamie Ryan Kiros, and Geoffrey E Hinton. 2016. Layer normalization. arXiv preprint arXiv:1607.06450 (2016)."},{"key":"e_1_3_2_1_4_1","volume-title":"SST: Single-stream temporal action proposals. In CVPR.","author":"Buch Shyamal","year":"2017","unstructured":"Shyamal Buch , Victor Escorcia , Chuanqi Shen , Bernard Ghanem , and Juan Carlos Niebles . 2017 . SST: Single-stream temporal action proposals. In CVPR. Shyamal Buch, Victor Escorcia, Chuanqi Shen, Bernard Ghanem, and Juan Carlos Niebles. 2017. SST: Single-stream temporal action proposals. In CVPR."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"crossref","unstructured":"Yu-Wei Chao Yunfan Liu Xieyang Liu Huayi Zeng and Jia Deng. 2018. Learning to detect human-object interactions. In WACV. Yu-Wei Chao Yunfan Liu Xieyang Liu Huayi Zeng and Jia Deng. 2018. Learning to detect human-object interactions. In WACV.","DOI":"10.1109\/WACV.2018.00048"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/WACVW.2019.00015"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/WACVW.2019.00015"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"crossref","unstructured":"Wei Chen Caiming Xiong Ran Xu and Jason J Corso. 2014. Actionness ranking with lattice conditional ordinal random fields. In CVPR. Wei Chen Caiming Xiong Ran Xu and Jason J Corso. 2014. Actionness ranking with lattice conditional ordinal random fields. In CVPR.","DOI":"10.1109\/CVPR.2014.101"},{"key":"e_1_3_2_1_9_1","volume-title":"Imagenet: A large-scale hierarchical image database. In CVPR.","author":"Deng Jia","year":"2009","unstructured":"Jia Deng , Wei Dong , Richard Socher , Li-Jia Li , Kai Li , and Li Fei-Fei . 2009 . Imagenet: A large-scale hierarchical image database. In CVPR. Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. Imagenet: A large-scale hierarchical image database. In CVPR."},{"key":"e_1_3_2_1_10_1","volume-title":"Juan Carlos Niebles, and Bernard Ghanem.","author":"Escorcia Victor","year":"2016","unstructured":"Victor Escorcia , Fabian Caba Heilbron , Juan Carlos Niebles, and Bernard Ghanem. 2016 . Daps : Deep action proposals for action understanding. In ECCV. Victor Escorcia, Fabian Caba Heilbron, Juan Carlos Niebles, and Bernard Ghanem. 2016. Daps: Deep action proposals for action understanding. In ECCV."},{"key":"e_1_3_2_1_11_1","volume-title":"ICAN: Instance-centric attention network for human-object interaction detection. In BMVC.","author":"Gao Chen","year":"2018","unstructured":"Chen Gao , Yuliang Zou , and Jia-Bin Huang . 2018 b. ICAN: Instance-centric attention network for human-object interaction detection. In BMVC. Chen Gao, Yuliang Zou, and Jia-Bin Huang. 2018b. ICAN: Instance-centric attention network for human-object interaction detection. In BMVC."},{"key":"e_1_3_2_1_12_1","volume-title":"CTAP: Complementary temporal action proposal generation. In ECCV.","author":"Gao Jiyang","year":"2018","unstructured":"Jiyang Gao , Kan Chen , and Ram Nevatia . 2018 a. CTAP: Complementary temporal action proposal generation. In ECCV. Jiyang Gao, Kan Chen, and Ram Nevatia. 2018a. CTAP: Complementary temporal action proposal generation. In ECCV."},{"key":"e_1_3_2_1_13_1","volume-title":"TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals.","author":"Gao Jiyang","year":"2017","unstructured":"Jiyang Gao , Zhenheng Yang , Chen Sun , Kan Chen , and Ramakant Nevatia . 2017 . TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals. (2017). Jiyang Gao, Zhenheng Yang, Chen Sun, Kan Chen, and Ramakant Nevatia. 2017. TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals. (2017)."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"crossref","unstructured":"Georgia Gkioxari Ross Girshick Piotr Doll\u00e1r and Kaiming He. 2018. Detecting and recognizing human-object interactions. In CVPR. Georgia Gkioxari Ross Girshick Piotr Doll\u00e1r and Kaiming He. 2018. Detecting and recognizing human-object interactions. In CVPR.","DOI":"10.1109\/CVPR.2018.00872"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"crossref","unstructured":"Joshua Gleason Rajeev Ranjan Steven Schwarcz Carlos Castillo Jun-Cheng Chen and Rama Chellappa. 2019. A Proposal-Based Solution to Spatio-Temporal Action Detection in Untrimmed Videos. In WACV. Joshua Gleason Rajeev Ranjan Steven Schwarcz Carlos Castillo Jun-Cheng Chen and Rama Chellappa. 2019. A Proposal-Based Solution to Spatio-Temporal Action Detection in Untrimmed Videos. In WACV.","DOI":"10.1109\/WACV.2019.00021"},{"key":"e_1_3_2_1_16_1","volume-title":"AVA: A video dataset of spatio-temporally localized atomic visual actions. In CVPR.","author":"Gu Chunhui","year":"2018","unstructured":"Chunhui Gu , Chen Sun , David A Ross , Carl Vondrick , Caroline Pantofaru , Yeqing Li , Sudheendra Vijayanarasimhan , George Toderici , Susanna Ricco , Rahul Sukthankar , 2018 . AVA: A video dataset of spatio-temporally localized atomic visual actions. In CVPR. Chunhui Gu, Chen Sun, David A Ross, Carl Vondrick, Caroline Pantofaru, Yeqing Li, Sudheendra Vijayanarasimhan, George Toderici, Susanna Ricco, Rahul Sukthankar, et al. 2018. AVA: A video dataset of spatio-temporally localized atomic visual actions. In CVPR."},{"key":"e_1_3_2_1_17_1","unstructured":"Jiawei He Zhiwei Deng Mostafa S Ibrahim and Greg Mori. 2018. Generic tubelet proposals for action localization. In WACV. Jiawei He Zhiwei Deng Mostafa S Ibrahim and Greg Mori. 2018. Generic tubelet proposals for action localization. In WACV."},{"key":"e_1_3_2_1_18_1","unstructured":"Kaiming He Georgia Gkioxari Piotr Doll\u00e1r and Ross Girshick. 2017. Mask r-cnn. In ICCV. Kaiming He Georgia Gkioxari Piotr Doll\u00e1r and Ross Girshick. 2017. Mask r-cnn. In ICCV."},{"key":"e_1_3_2_1_19_1","unstructured":"Kaiming He Xiangyu Zhang Shaoqing Ren and Jian Sun. 2016. Deep residual learning for image recognition. In CVPR. Kaiming He Xiangyu Zhang Shaoqing Ren and Jian Sun. 2016. Deep residual learning for image recognition. In CVPR."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"crossref","unstructured":"Han Hu Jiayuan Gu Zheng Zhang Jifeng Dai and Yichen Wei. 2018. Relation networks for object detection. In CVPR. Han Hu Jiayuan Gu Zheng Zhang Jifeng Dai and Yichen Wei. 2018. Relation networks for object detection. In CVPR.","DOI":"10.1109\/CVPR.2018.00378"},{"key":"e_1_3_2_1_21_1","volume-title":"Herv\u00e9 J\u00e9gou, Patrick Bouthemy, and Cees G M Snoek.","author":"Jain Mihir","year":"2017","unstructured":"Mihir Jain , Jan Van Gemert , Herv\u00e9 J\u00e9gou, Patrick Bouthemy, and Cees G M Snoek. 2017 . Tubelets : Unsupervised action proposals from spatiotemporal super-voxels. IJCV ( 2017). Mihir Jain, Jan Van Gemert, Herv\u00e9 J\u00e9gou, Patrick Bouthemy, and Cees G M Snoek. 2017. Tubelets: Unsupervised action proposals from spatiotemporal super-voxels. IJCV (2017)."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"crossref","unstructured":"Hueihan Jhuang Juergen Gall Silvia Zuffi Cordelia Schmid and Michael J Black. 2013. Towards understanding action recognition. In ICCV. Hueihan Jhuang Juergen Gall Silvia Zuffi Cordelia Schmid and Michael J Black. 2013. Towards understanding action recognition. In ICCV.","DOI":"10.1109\/ICCV.2013.396"},{"key":"e_1_3_2_1_23_1","volume-title":"Synthesizing Attributes with Unreal Engine for Fine-grained Activity Analysis. In WACV Workshop.","author":"Kim Tae Soo","year":"2019","unstructured":"Tae Soo Kim , Mike Peven , Weichao Qiu , Alan Yuille , and Gregory D Hager . 2019 . Synthesizing Attributes with Unreal Engine for Fine-grained Activity Analysis. In WACV Workshop. Tae Soo Kim, Mike Peven, Weichao Qiu, Alan Yuille, and Gregory D Hager. 2019. Synthesizing Attributes with Unreal Engine for Fine-grained Activity Analysis. In WACV Workshop."},{"key":"e_1_3_2_1_24_1","volume-title":"Alexander G Hauptmann, and Li Fei-Fei.","author":"Liang Junwei","year":"2019","unstructured":"Junwei Liang , Lu Jiang , Juan Carlos Niebles , Alexander G Hauptmann, and Li Fei-Fei. 2019 . Peeking into the future: Predicting future person activities and locations in videos. In CVPR. Junwei Liang, Lu Jiang, Juan Carlos Niebles, Alexander G Hauptmann, and Li Fei-Fei. 2019. Peeking into the future: Predicting future person activities and locations in videos. In CVPR."},{"key":"e_1_3_2_1_25_1","volume-title":"Bsn: Boundary sensitive network for temporal action proposal generation. In ECCV.","author":"Lin Tianwei","year":"2018","unstructured":"Tianwei Lin , Xu Zhao , Haisheng Su , Chongjing Wang , and Ming Yang . 2018 . Bsn: Boundary sensitive network for temporal action proposal generation. In ECCV. Tianwei Lin, Xu Zhao, Haisheng Su, Chongjing Wang, and Ming Yang. 2018. Bsn: Boundary sensitive network for temporal action proposal generation. In ECCV."},{"key":"e_1_3_2_1_26_1","unstructured":"Tsung-Yi Lin Piotr Doll\u00e1r Ross Girshick Kaiming He Bharath Hariharan and Serge Belongie. 2017. Feature pyramid networks for object detection. In ICCV. Tsung-Yi Lin Piotr Doll\u00e1r Ross Girshick Kaiming He Bharath Hariharan and Serge Belongie. 2017. Feature pyramid networks for object detection. In ICCV."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"crossref","unstructured":"Yuan Liu Lin Ma Yifeng Zhang Wei Liu and Shih-Fu Chang. 2019. Multi-granularity Generator for Temporal Action Proposal. In CVPR. Yuan Liu Lin Ma Yifeng Zhang Wei Liu and Shih-Fu Chang. 2019. Multi-granularity Generator for Temporal Action Proposal. In CVPR.","DOI":"10.1109\/CVPR.2019.00372"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"crossref","unstructured":"Ishan Misra Abhinav Shrivastava and Martial Hebert. 2015. Watch and learn: Semi-supervised learning for object detectors from video. In CVPR. Ishan Misra Abhinav Shrivastava and Martial Hebert. 2015. Watch and learn: Semi-supervised learning for object detectors from video. In CVPR.","DOI":"10.1109\/CVPR.2015.7298982"},{"key":"e_1_3_2_1_29_1","volume-title":"Saurajit Mukherjee, JK Aggarwal, Hyungtae Lee, Larry Davis, et al.","author":"Oh Sangmin","year":"2011","unstructured":"Sangmin Oh , Anthony Hoogs , Amitha Perera , Naresh Cuntoor , Chia-Chih Chen , Jong Taek Lee , Saurajit Mukherjee, JK Aggarwal, Hyungtae Lee, Larry Davis, et al. 2011 . A large-scale benchmark dataset for event recognition in surveillance video. In CVPR. Sangmin Oh, Anthony Hoogs, Amitha Perera, Naresh Cuntoor, Chia-Chih Chen, Jong Taek Lee, Saurajit Mukherjee, JK Aggarwal, Hyungtae Lee, Larry Davis, et al. 2011. A large-scale benchmark dataset for event recognition in surveillance video. In CVPR."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"crossref","unstructured":"Dan Oneata J\u00e9r\u00f4me Revaud Jakob Verbeek and Cordelia Schmid. 2014. Spatio-temporal object detection proposals. In ECCV. Dan Oneata J\u00e9r\u00f4me Revaud Jakob Verbeek and Cordelia Schmid. 2014. Spatio-temporal object detection proposals. In ECCV.","DOI":"10.1007\/978-3-319-10578-9_48"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2012.175"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"crossref","unstructured":"Haonan Qiu Yingbin Zheng Hao Ye Yao Lu Feng Wang and Liang He. 2018. Precise temporal action localization by evolving temporal proposals. In ICMR. Haonan Qiu Yingbin Zheng Hao Ye Yao Lu Feng Wang and Liang He. 2018. Precise temporal action localization by evolving temporal proposals. In ICMR.","DOI":"10.1145\/3206025.3206029"},{"key":"e_1_3_2_1_33_1","unstructured":"Shaoqing Ren Kaiming He Ross Girshick and Jian Sun. 2015. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. In NIPS. Shaoqing Ren Kaiming He Ross Girshick and Jian Sun. 2015. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. In NIPS."},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"crossref","unstructured":"Mikel D Rodriguez Javed Ahmed and Mubarak Shah. 2008. Action MACH a spatio-temporal Maximum Average Correlation Height filter for action recognition.. In CVPR. Mikel D Rodriguez Javed Ahmed and Mubarak Shah. 2008. Action MACH a spatio-temporal Maximum Average Correlation Height filter for action recognition.. In CVPR.","DOI":"10.1109\/CVPR.2008.4587727"},{"key":"e_1_3_2_1_35_1","volume-title":"The watershed transform: Definitions, algorithms and parallelization strategies. Fundamenta Informaticae","author":"Roerdink Jos BTM","year":"2000","unstructured":"Jos BTM Roerdink and Arnold Meijster . 2000. The watershed transform: Definitions, algorithms and parallelization strategies. Fundamenta Informaticae ( 2000 ). Jos BTM Roerdink and Arnold Meijster. 2000. The watershed transform: Definitions, algorithms and parallelization strategies. Fundamenta Informaticae (2000)."},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"crossref","unstructured":"Maguell LTL Sandifort Jianquan Liu Shoji Nishimura and Wolfgang H\u00fcrst. 2018a. An entropy model for loiterer retrieval across multiple surveillance cameras. In ICMR. Maguell LTL Sandifort Jianquan Liu Shoji Nishimura and Wolfgang H\u00fcrst. 2018a. An entropy model for loiterer retrieval across multiple surveillance cameras. In ICMR.","DOI":"10.1145\/3206025.3206049"},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"crossref","unstructured":"Maguell LTL Sandifort Jianquan Liu Shoji Nishimura and Wolfgang H\u00fcrst. 2018b. VisLoiter+: An entropy model-based loiterer retrieval system with user-friendly interfaces. In ICMR. Maguell LTL Sandifort Jianquan Liu Shoji Nishimura and Wolfgang H\u00fcrst. 2018b. VisLoiter+: An entropy model-based loiterer retrieval system with user-friendly interfaces. In ICMR.","DOI":"10.1145\/3206025.3206091"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"crossref","unstructured":"Xindi Shang Donglin Di Junbin Xiao Yu Cao Xun Yang and Tat-Seng Chua. 2019. Annotating Objects and Relations in User-Generated Videos. In ICMR. Xindi Shang Donglin Di Junbin Xiao Yu Cao Xun Yang and Tat-Seng Chua. 2019. Annotating Objects and Relations in User-Generated Videos. In ICMR.","DOI":"10.1145\/3323873.3325056"},{"key":"e_1_3_2_1_39_1","volume-title":"Dropout: a simple way to prevent neural networks from overfitting. The journal of machine learning research","author":"Srivastava Nitish","year":"2014","unstructured":"Nitish Srivastava , Geoffrey Hinton , Alex Krizhevsky , Ilya Sutskever , and Ruslan Salakhutdinov . 2014. Dropout: a simple way to prevent neural networks from overfitting. The journal of machine learning research ( 2014 ). Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. 2014. Dropout: a simple way to prevent neural networks from overfitting. The journal of machine learning research (2014)."},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"crossref","unstructured":"Eran Swears Anthony Hoogs Qiang Ji and Kim Boyer. 2014. Complex activity recognition using granger constrained dbn (gcdbn) in sports and surveillance video. In CVPR. Eran Swears Anthony Hoogs Qiang Ji and Kim Boyer. 2014. Complex activity recognition using granger constrained dbn (gcdbn) in sports and surveillance video. In CVPR.","DOI":"10.1109\/CVPR.2014.106"},{"key":"e_1_3_2_1_41_1","volume-title":"APT: Action localization proposals from dense trajectories.. In BMVC.","author":"Van Gemert Jan C","year":"2015","unstructured":"Jan C Van Gemert , Mihir Jain , Ella Gati , and Cees G M Snoek . 2015 . APT: Action localization proposals from dense trajectories.. In BMVC. Jan C Van Gemert, Mihir Jain, Ella Gati, and Cees G M Snoek. 2015. APT: Action localization proposals from dense trajectories.. In BMVC."},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"crossref","unstructured":"Jacob Walker Abhinav Gupta and Martial Hebert. 2014. Patch to the future: Unsupervised visual prediction. In CVPR. Jacob Walker Abhinav Gupta and Martial Hebert. 2014. Patch to the future: Unsupervised visual prediction. In CVPR.","DOI":"10.1109\/CVPR.2014.416"},{"volume-title":"Computer Graphics Forum","author":"Wang He","key":"e_1_3_2_1_43_1","unstructured":"He Wang , S\u00f6ren Pirk , Ersin Yumer , Vladimir G Kim , Ozan Sener , Srinath Sridhar , and Leonidas J Guibas . 2019. Learning a Generative Model for Multi-Step Human-Object Interactions from Videos . In Computer Graphics Forum , Vol. 38 . Wiley Online Library , 367--378. He Wang, S\u00f6ren Pirk, Ersin Yumer, Vladimir G Kim, Ozan Sener, Srinath Sridhar, and Leonidas J Guibas. 2019. Learning a Generative Model for Multi-Step Human-Object Interactions from Videos. In Computer Graphics Forum, Vol. 38. Wiley Online Library, 367--378."},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"crossref","unstructured":"Limin Wang Yu Qiao Xiaoou Tang and Luc Van Gool. 2016. Actionness estimation using hybrid fully convolutional networks. In CVPR. Limin Wang Yu Qiao Xiaoou Tang and Luc Van Gool. 2016. Actionness estimation using hybrid fully convolutional networks. In CVPR.","DOI":"10.1109\/CVPR.2016.296"},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"crossref","unstructured":"Xiaolong Wang Ross Girshick Abhinav Gupta and Kaiming He. 2018. Non-local neural networks. In CVPR. Xiaolong Wang Ross Girshick Abhinav Gupta and Kaiming He. 2018. Non-local neural networks. In CVPR.","DOI":"10.1109\/CVPR.2018.00813"},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"crossref","unstructured":"Xiaoyang Wang and Qiang Ji. 2014. A hierarchical context model for event recognition in surveillance video. In CVPR. Xiaoyang Wang and Qiang Ji. 2014. A hierarchical context model for event recognition in surveillance video. In CVPR.","DOI":"10.1109\/CVPR.2014.328"},{"key":"e_1_3_2_1_47_1","doi-asserted-by":"crossref","unstructured":"Nicolai Wojke Alex Bewley and Dietrich Paulus. 2017. Simple online and realtime tracking with a deep association metric. In ICIP. Nicolai Wojke Alex Bewley and Dietrich Paulus. 2017. Simple online and realtime tracking with a deep association metric. In ICIP.","DOI":"10.1109\/ICIP.2017.8296962"},{"key":"e_1_3_2_1_48_1","unstructured":"Bingjie Xu Yongkang Wong Junnan Li Qi Zhao and Mohan S Kankanhalli. 2019. Learning to Detect Human-Object Interactions With Knowledge. In CVPR. Bingjie Xu Yongkang Wong Junnan Li Qi Zhao and Mohan S Kankanhalli. 2019. Learning to Detect Human-Object Interactions With Knowledge. In CVPR."},{"key":"e_1_3_2_1_49_1","unstructured":"Gang Yu and Junsong Yuan. 2015. Fast action proposals for human action detection and search. In CVPR. Gang Yu and Junsong Yuan. 2015. Fast action proposals for human action detection and search. In CVPR."},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"crossref","unstructured":"Yibing Zhan Jun Yu Ting Yu and Dacheng Tao. 2019. On Exploring Undetermined Relationships for Visual Relationship Detection. In CVPR. Yibing Zhan Jun Yu Ting Yu and Dacheng Tao. 2019. On Exploring Undetermined Relationships for Visual Relationship Detection. In CVPR.","DOI":"10.1109\/CVPR.2019.00527"},{"key":"e_1_3_2_1_51_1","doi-asserted-by":"crossref","unstructured":"Yue Zhao Yuanjun Xiong Limin Wang Zhirong Wu Xiaoou Tang and Dahua Lin. 2017. Temporal action detection with structured segment networks. In ICCV. Yue Zhao Yuanjun Xiong Limin Wang Zhirong Wu Xiaoou Tang and Dahua Lin. 2017. Temporal action detection with structured segment networks. In ICCV.","DOI":"10.1109\/ICCV.2017.317"},{"key":"e_1_3_2_1_52_1","volume-title":"A unified framework with a benchmark dataset for surveillance event detection. Neurocomputing","author":"Zhao Zhicheng","year":"2018","unstructured":"Zhicheng Zhao , Xuanchong Li , Xingzhong Du , Qi Chen , Yanyun Zhao , Fei Su , Xiaojun Chang , and Alexander G Hauptmann . 2018. A unified framework with a benchmark dataset for surveillance event detection. Neurocomputing ( 2018 ). Zhicheng Zhao, Xuanchong Li, Xingzhong Du, Qi Chen, Yanyun Zhao, Fei Su, Xiaojun Chang, and Alexander G Hauptmann. 2018. A unified framework with a benchmark dataset for surveillance event detection. Neurocomputing (2018)."}],"event":{"name":"ICMR '20: International Conference on Multimedia Retrieval","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Dublin Ireland","acronym":"ICMR '20"},"container-title":["Proceedings of the 2020 International Conference on Multimedia Retrieval"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3372278.3390680","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,9]],"date-time":"2023-06-09T21:34:10Z","timestamp":1686346450000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3372278.3390680"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,6,8]]},"references-count":52,"alternative-id":["10.1145\/3372278.3390680","10.1145\/3372278"],"URL":"https:\/\/doi.org\/10.1145\/3372278.3390680","relation":{},"subject":[],"published":{"date-parts":[[2020,6,8]]},"assertion":[{"value":"2020-06-08","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}