default search action
24th ACM Multimedia 2016: Amsterdam, The Netherlands
- Alan Hanjalic, Cees Snoek, Marcel Worring, Dick C. A. Bulterman, Benoit Huet, Aisling Kelliher, Yiannis Kompatsiaris, Jin Li:
Proceedings of the 2016 ACM Conference on Multimedia Conference, MM 2016, Amsterdam, The Netherlands, October 15-19, 2016. ACM 2016, ISBN 978-1-4503-3603-1
Keynote Address
- Dirk Helbing:
A Digital World to Thrive In: How the Internet of Things Can Make the "Invisible Hand" Work. 1
Best Paper
- Shengsheng Qian, Tianzhu Zhang, Changsheng Xu:
Multi-modal Multi-view Topic-opinion Mining for Social Event Analysis. 2-11 - Nic Lupfer, Andruid Kerne, Andrew M. Webb, Rhema Linder:
Patterns of Free-form Curation: Visual Thinking with Web Content. 12-21 - Mengbai Xiao, Viswanathan Swaminathan, Sheng Wei, Songqing Chen:
DASH2M: Exploring HTTP/2 for Internet Streaming to Mobile Devices. 22-31 - Jingjing Chen, Chong-Wah Ngo:
Deep-based Ingredient Recognition for Cooking Recipe Retrieval. 32-41
Posters
- Chris Greenhalgh, Adrian Hazzard, Sean McGrath, Steve Benford:
GeoTracks: Adaptive Music for Everyday Journeys. 42-46 - Xiaoshan Yang, Tianzhu Zhang, Changsheng Xu:
Abnormal Event Discovery in User Generated Photos. 47-51 - Shuhui Jiang, Yue Wu, Yun Fu:
Deep Bi-directional Cross-triplet Embedding for Cross-Domain Clothing Retrieval. 52-56 - Liping Jing, Bo Liu, Jaeyoung Choi, Adam Janin, Julia Bernd, Michael W. Mahoney, Gerald Friedland:
A Discriminative and Compact Audio Representation for Event Detection. 57-61 - Kyeong-Ah Jeong, Hyeon-Jeong Suk:
Jockey Time: Making Video Playback to Enhance Emotional Effect. 62-66 - Hui-Hung Wang, Yi-Ling Chen, Chen-Kuo Chiang:
Discriminative Paired Dictionary Learning for Visual Recognition. 67-71 - Yanhao Zhang, Lei Qin, Qingming Huang, Kuiyuan Yang, Jun Zhang, Hongxun Yao:
From Seed Discovery to Deep Reconstruction: Predicting Saliency in Crowd via Deep Networks. 72-76 - Ke Chen, Joni-Kristian Kämäräinen, Zhaoxiang Zhang:
Facial Age Estimation Using Robust Label Distribution. 77-81 - Sidi Liu, Jinglei Lv, Yimin Hou, Ting Shoemaker, Qinglin Dong, Kaiming Li, Tianming Liu:
What Makes a Good Movie Trailer?: Interpretation from Simultaneous EEG and Eyetracker Recording. 82-86 - Xiaojie Guo:
LIME: A Method for Low-light IMage Enhancement. 87-91 - Rufael Mekuria, Jelte Fennema, Dirk Griffioen:
Multi-Protocol Video Delivery with Late Trans-Muxing. 92-96 - Ravi Kiran Sarvadevabhatla, R. Venkatesh Babu:
Analyzing Structural Characteristics of Object Category Representations From Their Semantic-part Distributions. 97-101 - Pichao Wang, Zhaoyang Li, Yonghong Hou, Wanqing Li:
Action Recognition Based on Joint Trajectory Maps Using Convolutional Neural Networks. 102-106 - Chung-Hua Chu:
Efficient Digital Holographic Image Reconstruction on Mobile Devices. 107-111 - Tetsuaki Mano, Hiroaki Yamane, Tatsuya Harada:
Scene Image Synthesis from Natural Sentences Using Hierarchical Syntactic Analysis. 112-116 - Zan Gao, Deyu Wang, Hua Zhang, Yanbing Xue, Guangping Xu:
A Fast 3D Retrieval Algorithm via Class-Statistic and Pair-Constraint Model. 117-121 - Michael Gygli, Mohammad Soleymani:
Analyzing and Predicting GIF Interestingness. 122-126 - Chen Chen, Zuxuan Wu, Yu-Gang Jiang:
Emotion in Context: Deep Semantic Feature Fusion for Video Emotion Recognition. 127-131 - Ying Li, Xiangwei Kong, Liang Zheng, Qi Tian:
Exploiting Hierarchical Activations of Neural Network for Image Retrieval. 132-136 - Lorenzo Porzi, Samuel Rota Bulò, Elisa Ricci:
A Deeply-Supervised Deconvolutional Network for Horizon Line Detection. 137-141 - Yongqing Sun, Zuxuan Wu, Xi Wang, Hiroyuki Arai, Tetsuya Kinebuchi, Yu-Gang Jiang:
Exploiting Objects with LSTMs for Video Categorization. 142-146 - Jacob Thorn, Rodrigo Pizarro, Bernhard Spanlang, Pablo Bermell-Garcia, Mar González-Franco:
Assessing 3D Scan Quality Through Paired-comparisons Psychophysics. 147-151 - Zhou Zhao, Hanqing Lu, Deng Cai, Xiaofei He, Yueting Zhuang:
Partial Multi-Modal Sparse Coding via Adaptive Similarity Structure Regularization. 152-156 - Hervé Bredin, Gregory Gelly:
Improving Speaker Diarization of TV Series using Talking-Face Detection and Clustering. 157-161 - Jen-Yin Chang, Kuan-Ying Lee, Yu-Lin Wei, Kate Ching-Ju Lin, Winston H. Hsu:
Location-Independent WiFi Action Recognition via Vision-based Methods. 162-166 - Edip Demirbilek, Jean-Charles Grégoire:
INRS Audiovisual Quality Dataset. 167-171 - Hui Wu, Michele Merler, Rosario Uceda-Sosa, John R. Smith:
Learning to Make Better Mistakes: Semantics-aware Visual Food Recognition. 172-176 - Xin-Shun Xu:
Dictionary Learning Based Hashing for Cross-Modal Retrieval. 177-181 - Taylor Zheng, Prem Seetharaman, Bryan Pardo:
SocialFX: Studying a Crowdsourced Folksonomy of Audio Effects Terms. 182-186 - Ravi Kiran Sarvadevabhatla, Shiv Surya, Srinivas S. S. Kruthiventi, R. Venkatesh Babu:
SwiDeN: Convolutional Neural Networks For Depiction Invariant Object Recognition. 187-191 - Jiawei Liu, Zheng-Jun Zha, Q. I. Tian, Dong Liu, Ting Yao, Qiang Ling, Tao Mei:
Multi-Scale Triplet CNN for Person Re-Identification. 192-196 - Masoud Mazloom, Robert Rietveld, Stevan Rudinac, Marcel Worring, Willemijn van Dolen:
Multimodal Popularity Prediction of Brand-related Social Media Posts. 197-201 - Nam Do-Hoang Le, Jean-Marc Odobez:
Learning Multimodal Temporal Representation for Dubbing Detection in Broadcast Media. 202-206 - Zhou Ren, Hailin Jin, Zhe L. Lin, Chen Fang, Alan L. Yuille:
Joint Image-Text Representation by Gaussian Visual-Semantic Embedding. 207-211 - Yazhou Yao, Xian-Sheng Hua, Fumin Shen, Jian Zhang, Zhenmin Tang:
A Domain Robust Approach For Image Dataset Construction. 212-216 - Harsh Jhamtani, Shubham Varma, Midhun Gundapuneni, Siddhartha Kumar Dutta:
A Supervised Approach for Text Illustration. 217-221 - Yang Liu, Yan Liu, Xiang Zhang, Gong Chen, Kejun Zhang:
Learning Music Emotion Primitives via Supervised Dynamic Clustering. 222-226 - Jianfeng He, Bingpeng Ma, Shuhui Wang, Yugui Liu, Qingming Huang:
Cross-modal Retrieval by Real Label Partial Least Squares. 227-231 - Yiru Zhao, Yaoyi Li, Zhiwen Shao, Hongtao Lu:
LSOD: Local Sparse Orthogonal Descriptor for Image Matching. 232-236 - Dekui Ma, Jian Liang, Xiangwei Kong, Ran He:
Frustratingly Easy Cross-Modal Hashing. 237-241 - Joseph P. Robinson, Ming Shao, Yue Wu, Yun Fu:
Families in the Wild (FIW): Large-Scale Kinship Image Database and Benchmarks. 242-246 - Ravi Kiran Sarvadevabhatla, Jogendra Kundu, R. Venkatesh Babu:
Enabling My Robot To Play Pictionary: Recurrent Neural Networks For Sketch Recognition. 247-251 - Payal Bajaj, Sumit Shekhar:
Experience Individualization on Online TV Platforms through Persona-based Account Decomposition. 252-256 - Katsunori Ohnishi, Masatoshi Hidaka, Tatsuya Harada:
Improved Dense Trajectory with Cross Streams. 257-261 - Ye Zhou, Xin Lu, Junping Zhang, James Z. Wang:
Joint Image and Text Representation for Aesthetics Analysis. 262-266 - Laura Cabrera Quiros, Hayley Hung:
Who is where?: Matching People in Video to Wearable Acceleration During Crowded Mingling Events. 267-271 - Yun Gu, Chao Ma, Jie Yang:
Supervised Recurrent Hashing for Large Scale Video Retrieval. 272-276 - Nakamasa Inoue, Koichi Shinoda:
Adaptation of Word Vectors using Tree Structure for Visual Semantics. 277-281 - Min-Kook Choi, Hyun-Gyu Lee, Minseok Song, Sang-Chul Lee:
Adaptive Bitrate Selection for Video Encoding with Reduced Block Artifacts. 282-286 - Miriam Redi, Damon Crockett, Lev Manovich, Simon Osindero:
What Makes Photo Cultures Different? 287-291 - Michal Muszynski, Theodoros Kostoulas, Patrizia Lombardo, Thierry Pun, Guillaume Chanel:
Synchronization among Groups of Spectators for Highlight Detection in Movies. 292-296 - Chao Zhang, Junchi Yan, Changsheng Li, Xiaoguang Rui, Liang Liu, Rongfang Bie:
On Estimating Air Pollution from Photos Using Convolutional Neural Network. 297-301 - Xing Xu, Fumin Shen, Yang Yang, Heng Tao Shen, Li He, Jingkuan Song:
Cross-modal Retrieval with Label Completion. 302-306 - Yuhang Wang, Jing Liu, Yong Li, Junjie Yan, Hanqing Lu:
Objectness-aware Semantic Segmentation. 307-311 - Dimitris Chatzopoulos, Pan Hui:
ReadMe: A Real-Time Recommendation System for Mobile Augmented Reality Ecosystems. 312-316 - Yi Tian, Qiuqi Ruan, Gaoyun An, Yun Fu:
Action Recognition Using Local Consistent Group Sparse Coding with Spatio-Temporal Structure. 317-321 - Haiyi Mao, Yue Wu, Jun Li, Yun Fu:
Super Resolution of the Partial Pixelated Images With Deep Convolutional Neural Network. 322-326 - Takuhiro Kaneko, Kaoru Hiramatsu, Kunio Kashino:
Adaptive Visual Feedback Generation for Facial Expression Improvement with Multi-task Deep Neural Networks. 327-331 - Angelos Katharopoulos, Despoina Paschalidou, Christos Diou, Anastasios Delopoulos:
Fast Supervised LDA for Discovering Micro-Events in Large-Scale Video Datasets. 332-336 - Ryan Stables, Brecht De Man, Sean Enderby, Joshua D. Reiss, György Fazekas, Thomas Wilmering:
Semantic Description of Timbral Transformations in Music Production. 337-341 - Di Hu, Xiaoqiang Lu, Xuelong Li:
Multimodal Learning via Exploring Deep Semantic Similarity. 342-346 - Feifei Zhang, Qirong Mao, Ming Dong, Yongzhao Zhan:
Multi-pose Facial Expression Recognition Using Transformed Dirichlet Process. 347-351 - Botong Wu, Yizhou Wang:
Neighborhood-Preserving Hashing for Large-Scale Cross-Modal Search. 352-356 - Zhao Guo, Lianli Gao, Jingkuan Song, Xing Xu, Jie Shao, Heng Tao Shen:
Attention-based LSTM with Semantic Consistency for Videos Captioning. 357-361 - Keiji Yanai, Ryosuke Tanno, Koichi Okamoto:
Efficient Mobile Implementation of A CNN-based Object Recognition System. 362-366 - Jinxin Zheng, Yongtao Wang, Zhi Tang:
Context-aware Geometric Object Reconstruction for Mobile Education. 367-371 - Jen-Chun Lin, Wen-Li Wei, Hsin-Min Wang:
Automatic Music Video Generation Based on Emotion-Oriented Pseudo Song Prediction and Matching. 372-376 - Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen, Hsin-Min Wang, Hsin-Hsi Chen:
Novel Word Embedding and Translation-based Language Modeling for Extractive Speech Summarization. 377-381 - Dae Hoe Kim, Wissam J. Baddar, Yong Man Ro:
Micro-Expression Recognition with Expression-State Constrained Spatio-Temporal Feature Representations. 382-386 - Yuma Sasaka, Takahiro Ogawa, Miki Haseyama:
Multimodal Interest Level Estimation via Variational Bayesian Mixture of Robust CCA. 387-391 - Toan H. Vu, Le Dung, Jia-Ching Wang:
Transportation Mode Detection on Mobile Devices Using Recurrent Nets. 392-396 - Youbao Tang, Xiangqian Wu, Wei Bu:
Deeply-Supervised Recurrent Convolutional Neural Network for Saliency Detection. 397-401 - Wei-Ta Chu, Yi-Ling Wu:
Deep Correlation Features for Image Style Classification. 402-406 - Ke Yan, Yaowei Wang, Dawei Liang, Tiejun Huang, Yonghong Tian:
CNN vs. SIFT for Image Retrieval: Alternative or Complementary? 407-411 - Xiaoyu Xiong, Maurizio Filippone, Alessandro Vinciarelli:
Looking Good With Flickr Faves: Gaussian Processes for Finding Difference Makers in Personality Impressions. 412-415 - Dejiang Kong, Fei Wu, Siliang Tang, Yueting Zhuang:
Ad Recommendation for Sponsored Search Engine via Composite Long-Short Term Memory. 416-420 - Zhao Liu, Yuwei Wu, Junsong Yuan, Yap-Peng Tan:
Learning a Multi-class Discriminative Dictionary with Nonredundancy Constraints for Visual Classification. 421-425 - Yuwei Wu, Zhe Wang, Junsong Yuan, Ling-Yu Duan:
A Compact Binary Aggregated Descriptor via Dual Selection for Visual Search. 426-430 - Mengfan Tang, Feiping Nie, Ramesh C. Jain:
Capped Lp-Norm Graph Embedding for Photo Clustering. 431-435 - Yi Bin, Yang Yang, Fumin Shen, Xing Xu, Heng Tao Shen:
Bidirectional Long-Short Term Memory for Video Description. 436-440 - Yashaswi Verma, C. V. Jawahar:
A Robust Distance with Correlated Metric Learning for Multi-Instance Multi-Label Data. 441-445 - Yawei Li, Xiaofeng Li, Zhizhong Fu, Wenli Zhong:
Multiview Video Super-Resolution via Information Extraction and Merging. 446-450 - Darshan Santani, Rui Hu, Daniel Gatica-Perez:
InnerView: Learning Place Ambiance from Social Media Images. 451-455 - Jiewei Cao, Zi Huang, Peng Wang, Chao Li, Xiaoshuai Sun, Heng Tao Shen:
Quartet-net Learning for Visual Instance Retrieval. 456-460 - Stavros Arestis-Chartampilas, Nikolaos Gkalelis, Vasileios Mezaris:
AKSDA-MSVM: A GPU-accelerated Multiclass Learning Framework for Multimedia. 461-465 - Chao Sun, Shuaicheng Liu, Taotao Yang, Bing Zeng, Zhengning Wang, Guanghui Liu:
Automatic Reflection Removal using Gradient Intensity and Motion Cues. 466-470 - Xueting Wang, Kensho Hara, Yu Enokibori, Takatsugu Hirayama, Kenji Mase:
Personal Multi-view Viewpoint Recommendation based on Trajectory Distribution of the Viewing Target. 471-475 - Stefano Alletto, Giuseppe Serra, Rita Cucchiara:
Motion Segmentation using Visual and Bio-mechanical Features. 476-480 - Yuan-Shan Lee, Chien-Yao Wang, Seksan Mathulaprangsan, Jia Hao Zhao, Jia-Ching Wang:
Locality-preserving K-SVD Based Joint Dictionary and Classifier Learning for Object Recognition. 481-485 - Huy Phan, Lars Hertel, Marco Maaß, Philipp Koch, Alfred Mertins:
Label Tree Embeddings for Acoustic Scene Classification. 486-490 - Yoann Baveye, Romain Cohendet, Matthieu Perreira Da Silva, Patrick Le Callet:
Deep Learning for Image Memorability Prediction: the Emotional Bias. 491-495 - Zhengzhong Zhou, Jingjin Zhou, Liqing Zhang:
Demand-adaptive Clothing Image Retrieval Using Hybrid Topic Model. 496-500 - Foteini Markatopoulou, Vasileios Mezaris, Ioannis Patras:
Deep Multi-task Learning with Label Correlation Constraint for Video Concept Detection. 501-505 - Raheeb Muzaffar, Evsen Yanmaz, Christian Bettstetter, Andrea Cavallaro:
Application-Layer Rate-Adaptive Multicast Video Streaming over 802.11 for Mobile Devices. 506-510 - Xing Wang, Jie Liang:
Scalable Compression of Deep Neural Networks. 511-515 - Jiahui Yu, Yuning Jiang, Zhangyang Wang, Zhimin Cao, Thomas S. Huang:
UnitBox: An Advanced Object Detection Network. 516-520 - Wenxuan Mou, Hatice Gunes, Ioannis Patras:
Alone versus In-a-group: A Comparative Analysis of Facial Affect Recognition. 521-525 - Meng Wang, Yi Fang:
Local Diffusion Map Signature for Symmetry-aware Non-rigid Shape Correspondence. 526-530 - Francesco Barbieri, Germán Kruszewski, Francesco Ronzano, Horacio Saggion:
How Cosmopolitan Are Emojis?: Exploring Emojis Usage and Meaning over Different Languages with Distributional Semantics. 531-535 - Hanhe Lin, Jeremiah D. Deng, Brendon J. Woodford, Ahmad Shahi:
Online Weighted Clustering for Real-time Abnormal Event Detection in Video Surveillance. 536-540 - Peisong Wang, Jian Cheng:
Accelerating Convolutional Neural Networks for Mobile Applications. 541-545 - Raghvendra Kannao, Durgaprasad Dandi, Swamy Yellapu, Prithwijit Guha:
News Program Detection in TV Broadcast Videos. 546-550 - Wenyi Huang, Dafang He, Xiao Yang, Zihan Zhou, Daniel Kifer, C. Lee Giles:
Detecting Arbitrary Oriented Text in the Wild with a Visual Attention Model. 551-555 - Meng Wang, Yi Fang:
Global Consistent Shape Correspondence for Efficient and Effective Active Shape Models. 556-560 - Pin-Chun Wang, Ching-Ling Fan, Chun-Ying Huang, Kuan-Ta Chen, Cheng-Hsin Hsu:
Towards Ultra-Low-Bitrate Video Conferencing Using Facial Landmarks. 561-565 - Niluthpol Chowdhury Mithun, Rameswar Panda, Amit K. Roy-Chowdhury:
Generating Diverse Image Datasets with Limited Labeling. 566-570 - Shizhe Chen, Qin Jin:
Multi-modal Conditional Attention Fusion for Dimensional Emotion Prediction. 571-575 - Shohei Yamamoto, Tatsuya Harada:
Video Generation Using 3D Convolutional Neural Network. 576-580 - Weiwei Sun, Jiantao Zhou, Ran Lyu, Shuyuan Zhu:
Processing-Aware Privacy-Preserving Photo Sharing over Online Social Networks. 581-585 - Xirong Li, Yujia Huo, Qin Jin, Jieping Xu:
Detecting Violence in Video using Subclasses. 586-590 - Yachuang Feng, Yuan Yuan, Xiaoqiang Lu:
Deep Representation for Abnormal Event Detection in Crowded Scenes. 591-595 - Sanket Khanwalkar, Shonali Balakrishna, Ramesh C. Jain:
Exploration of Large Image Corpuses in Virtual Reality. 596-600 - Alireza Zare, Alireza Aminlou, Miska M. Hannuksela, Moncef Gabbouj:
HEVC-compliant Tile-based Streaming of Panoramic Video for Virtual Reality Applications. 601-605 - Rui Wang, Dong Liang, Wei Zhang, Xiaochun Cao:
MatchDR: Image Correspondence by Leveraging Distance Ratio Constraint. 606-610 - Zhenqiang Ying, Ge Li, Xianghao Zang, Ronggang Wang, Wenmin Wang:
A Novel Shadow-Free Feature Extractor for Real-Time Road Detection. 611-615 - Chongliang Wu, Shangfei Wang, Bowen Pan, Huaping Chen:
Facial Expression Recognition with Deep two-view Support Vector Machine. 616-620 - Richang Hong, Jun He, Hanwang Zhang, Tat-Seng Chua:
Mental Visual Indexing: Towards Fast Video Browsing. 621-625 - Stefan Wilk, Manisha Luthra, Wolfgang Effelsberg:
One Sensor is not Enough: Adapting and Fusing Sensors for the Quality Assessment of User Generated Video. 626-630 - Yuan Liu, Zhongchao Shi:
Boosting Video Description Generation by Explicitly Translating from Frame-Level Captions. 631-634 - Kevin Alfianto Jangtjik, Mei-Chen Yeh, Kai-Lung Hua:
Artist-based Classification via Deep Learning with Multi-scale Weighted Pooling. 635-639 - Lokesh Boominathan, Srinivas S. S. Kruthiventi, R. Venkatesh Babu:
CrowdNet: A Deep Convolutional Network for Dense Crowd Counting. 640-644 - Matteo Bruni, Tiberio Uricchio, Lorenzo Seidenari, Alberto Del Bimbo:
Do Textual Descriptions Help Action Recognition? 645-649 - Xiao Shu, Xiaolin Wu:
Frame Untangling for Unobtrusive Display-Camera Visible Light Communication. 650-654 - Chun-Ming Chang, Cheng-Hsin Hsu, Chih-Fan Hsu, Kuan-Ta Chen:
Performance Measurements of Virtual Reality Systems: Quantifying the Timing and Positioning Accuracy. 655-659 - Cheng-Han Yang, Ying-Miao Kuo, Hung-Kuo Chu:
Synthesizing Emerging Images from Photographs. 660-664 - Oleksandr Murashko, John Thomson, Hugh Leather:
Predicting and Optimizing Image Compression. 665-669 - Jouni Pohjalainen, Fabien Ringeval, Zixing Zhang, Björn W. Schuller:
Spectral and Cepstral Audio Noise Reduction Techniques in Speech Emotion Recognition. 670-674
Video Program
- Jianquan Liu, Shoji Nishimura, Takuya Araki:
AntiLoiter: A Loitering Discovery System for Longtime Videos across Multiple Surveillance Cameras. 675-679 - Yejun Liu, Jia Jia, Jingtian Fu, Yihui Ma, Jie Huang, Zijian Tong:
Magic Mirror: A Virtual Fashion Consultant. 680-683 - Joseph G. Ellis, Svebor Karaman, Hongzhi Li, Hong Bin Shim, Shih-Fu Chang:
Placing Broadcast News Videos in their Social Media Context Using Hashtags. 684-688
Demonstrations
- Tam V. Nguyen, Dorothy Tan, Bilal Mirza, Jose Sepulveda:
MARIM: Mobile Augmented Reality for Interactive Manuals. 689-690 - Shengtao Xiao, Luoqi Liu, Xuecheng Nie, Jiashi Feng, Ashraf A. Kassim, Shuicheng Yan:
A Live Face Swapper. 691-692 - Scott A. Carter, Laurent Denoue, Matthew Cooper:
WorkCache: Salvaging siloed knowledge. 693-694 - Stefan John, Christian Handschigl, Britta Meixner, Michael Granitzer:
Hypervideo Production Using Crowdsourced Youtube Videos. 695-697 - Haojin Yang, Cheng Wang, Christian Bartz, Christoph Meinel:
SceneTextReg: A Real-Time Video OCR System. 698-700 - Xinyu Ou, Si Liu, Xiaochun Cao, Hefei Ling:
Beauty eMakeup: A Deep Makeup Transfer System. 701-702 - Giovanni Taverriti, Stefano Lombini, Lorenzo Seidenari, Marco Bertini, Alberto Del Bimbo:
Real-time Wearable Computer Vision System for Improved Museum Experience. 703-704 - Jun He, Hanwang Zhang, Ling Shen, Richang Hong, Tat-Seng Chua:
An Intention-Aware Interactive System for Mobile Video Browsing. 705-707 - David S. Monaghan, Freddie Honohan, Amin Ahmadi, Troy McDaniel, Ramin Tadayon, Ajay Karpur, Kieran Moran, Noel E. O'Connor, Sethuraman Panchanathan:
A Multimodal Gamified Platform for Real-Time User Feedback in Sports Performance. 708-710 - Ricardo Dias, Daniel Gonçalves, Manuel J. Fonseca:
PlaylistCreator: An Assisted Approach for Playlist Creation. 711-713 - Michael Dorkhom, Alan Woodley, Shlomo Geva, Richi Nayak:
WIMBY: What's in My Backyard? 714-716 - Christoph Korinke, Tim Claudius Stratmann, Tim Laue, Susanne Boll:
SuperSelect: An Interactive Superpixel-Based Segmentation Method for Touch Displays. 717-719 - Maximilien Servajean, Alexis Joly, Dennis E. Shasha, Julien Champ, Esther Pacitti:
ThePlantGame: Actively Training Human Annotators for Domain-specific Crowdsourcing. 720-721 - Marco A. Hudelist, Sabrina Kletz, Klaus Schoeffmann:
A Multi-Video Browser for Endoscopic Videos on Tablets. 722-724 - Marco A. Hudelist, Sabrina Kletz, Klaus Schoeffmann:
A Tablet Annotation Tool for Endoscopic Videos. 725-727 - Benjamin Renoust, Thanh Duc Ngo, Duy-Dinh Le, Shin'ichi Satoh:
News Archive Exploration Combining Face Detection and Tracking with Network Visual Analytics. 728-730 - Wolfgang Hürst, Algernon Ip Vai Ching, Marco A. Hudelist, Manfred Jürgen Primus, Klaus Schoeffmann, Christian Beecks:
A New Tool for Collaborative Video Search via Content-based Retrieval and Visual Inspection. 731-732 - Lorenzo Baraldi, Costantino Grana, Alberto Messina, Rita Cucchiara:
A Browsing and Retrieval System for Broadcast Videos using Scene Detection and Automatic Annotation. 733-734 - Ilya Makarov, Mikhail Tokmakov, Pavel Polyakov, Peter Zyuzin, Maxim Martynov, Oleg Konoplya, George Kuznetsov, Ivan Guschenko-Cheverda, Maxim Uriev, Ivan Mokeev, Olga Gerasimova, Lada Tokmakova, Alexey Kosmachev:
First-Person Shooter Game for Virtual Reality Headset with Advanced Multi-Agent Intelligent System. 735-736 - Yong Xue Eu, Jermyn Tanu, Justin Jieting Law, Muhammad Hanif B. Ghazali, Shuan Siang Tay, Wei Tsang Ooi, Anand Bhojan:
SuperStreamer: Enabling Progressive Content Streaming in a Game Engine. 737-738 - Omar Seddati, Stéphane Dupont, Saïd Mahmoudi:
DeepSketch2Image: Deep Convolutional Neural Networks for Partial Sketch Recognition and Image Retrieval. 739-741 - Santosh Kumar, Sanjay Kumar Singh, Tanima Dutta, Hari Prabhat Gupta:
A Fast Cattle Recognition System using Smart devices. 742-743 - Wolfgang Hürst, Nina Rosa, Jean-Paul van Bommel:
Vibrotactile Experiences for Augmented Reality. 744-745 - Chang Liu, Changhu Wang, Fuchun Sun, Yong Rui:
Image2Text: A Multimodal Image Captioner. 746-748 - Yifan Xiong, Jia Chen, Qin Jin, Chao Zhang:
History Rhyme: Searching Historic Events by Multimedia Knowledge. 749-751 - Toru Takahashi, Yuta Kudo, Rui Ishiyama:
Intelli-Wrench: Smart Navigation Tool for Mechanical Assembly and Maintenance. 752-753 - Zhengzhong Zhou, Yifei Xu, Jingjin Zhou, Liqing Zhang:
Interactive Image Search for Clothing Recommendation. 754-756 - Yehao Li, Ting Yao, Rui Hu, Tao Mei, Yong Rui:
Video ChatBot: Triggering Live Social Interactions by Automatic Video Commenting. 757-758 - Aleksandr Farseev, Ivan Samborskii, Tat-Seng Chua:
bBridge: A Big Data Platform for Social Multimedia Analytics. 759-761 - Karim Jahed, Sanaa Sharafeddine, Abdallah Moussawi, Abbas Abou Daya, Hassan Dbouk, Saadallah Kassir, Zaher Dawy, Preethi Valsalan, Wael Chérif, Fethi Filali:
Scalable Multimedia Streaming in Wireless Networks with Device-to-Device Cooperation. 762-764 - Syed Obaid Amin, Qingji Zheng, Ravishankar Ravindran, Guoqiang Wang:
Leveraging ICN for Secure Content Distribution in IP Networks. 765-767
Art Exhibition
- Lucas Evers, Frank Nack:
Data Aesthetics: The Ethics and Aesthetics of Big Data Gathering seen from the Artists Eye. 779-780
Topics in Multimedia I
- Hanwang Zhang, Meng Wang, Richang Hong, Tat-Seng Chua:
Play and Rewind: Optimizing Binary Representations of Videos by Self-Supervised Temporal Hashing. 781-790 - Zuxuan Wu, Yu-Gang Jiang, Xi Wang, Hao Ye, Xiangyang Xue:
Multi-Stream Multi-Class Fusion of Deep Networks for Video Classification. 791-800 - Yi Zhu, Alan Hanjalic, Judith A. Redi:
QoE Prediction for Enriched Assessment of Individual Video Viewing Experience. 801-810 - Junxuan Chen, Baigui Sun, Hao Li, Hongtao Lu, Xian-Sheng Hua:
Deep CTR Prediction in Display Advertising. 811-820
Analysis & Search
- Hongzhi Li, Joseph G. Ellis, Heng Ji, Shih-Fu Chang:
Event Specific Multimodal Pattern Mining for Knowledge Base Construction. 821-830 - Jingkuan Song, Lianli Gao, Mihai Marian Puscas, Feiping Nie, Fumin Shen, Nicu Sebe:
Joint Graph Learning and Video Segmentation via Multiple Cues and Topology Calibration. 831-840 - Qianqian Xu, Jiechao Xiong, Xiaochun Cao, Yuan Yao:
Parsimonious Mixed-Effects HodgeRank for Crowdsourced Preference Aggregation. 841-850 - Ognjen Arandjelovic:
Weighted Linear Fusion of Multimodal Data: A Reasonable Baseline? 851-857
Video Analysis & Streaming
- Xi Chen, Lei Rao, Qiao Xiang, Xue Liu, Fan Bai:
DRIVING: Distributed Scheduling for Video Streaming in Vehicular Wi-Fi Systems. 858-867 - Guanyu Gao, Yonggang Wen, Cédric Westphal:
Dynamic Resource Provisioning with QoS Guarantee for Video Transcoding in Online Video Sharing Service. 868-877 - Xinxin Zuo, Sen Wang, Jiangbin Zheng, Ruigang Yang:
High-speed Depth Stream Generation from a Hybrid Camera. 878-887 - Bayan Taani, Roger Zimmermann:
Spatio-Temporal Analysis of Bandwidth Maps for Geo-Predictive Video Streaming in Mobile Environments. 888-897
Topics in Multimedia II
- Jingyuan Chen, Xuemeng Song, Liqiang Nie, Xiang Wang, Hanwang Zhang, Tat-Seng Chua:
Micro Tells Macro: Predicting the Popularity of Micro-Videos via a Transductive Model. 898-907 - Vinay Bettadapura, Caroline Pantofaru, Irfan A. Essa:
Leveraging Contextual Cues for Generating Basketball Highlights. 908-917 - Yunhua Deng, Yusen Li, Xueyan Tang, Wentong Cai:
Server Allocation for Multiplayer Cloud Gaming. 918-927 - Yehao Li, Ting Yao, Tao Mei, Hongyang Chao, Yong Rui:
Share-and-Chat: Achieving Human-Level Video Commenting by Search and Multi-View Embedding. 928-937
Brave News Topic
- Mengfan Tang, Siripen Pongpaichet, Ramesh C. Jain:
Research Challenges in Developing Multimedia Systems for Managing Emergency Situations. 938-947 - Andrea Castelletti, Roman Fedorov, Piero Fraternali, Matteo Giuliani:
Multimedia on the Mountaintop: Using Public Snow Images to Improve Water Systems Operation. 948-957 - Alexis Joly, Hervé Goëau, Julien Champ, Samuel Dufour-Kowalski, Henning Müller, Pierre Bonnet:
Crowdsourcing Biodiversity Monitoring: How Sharing your Photo Stream can Sustain our Planet. 958-967 - Michael Riegler, Mathias Lux, Carsten Griwodz, Concetto Spampinato, Thomas de Lange, Sigrun Losada Eskeland, Konstantin Pogorelov, Wallapak Tavanapong, Peter Thelin Schmidt, Cathal Gurrin, Dag Johansen, Håvard D. Johansen, Pål Halvorsen:
Multimedia and Medicine: Teammates for Better Disease Detection and Survival. 968-977
Deep Learning
- Xiaodong Yang, Pavlo Molchanov, Jan Kautz:
Multilayer and Multimodal Fusion of Deep Neural Networks for Video Classification. 978-987 - Cheng Wang, Haojin Yang, Christian Bartz, Christoph Meinel:
Image Captioning with Deep Bidirectional LSTMs. 988-997 - Brendan Jou, Shih-Fu Chang:
Deep Cross Residual Learning for Multitask Visual Recognition. 998-1007 - Quanzeng You, Liangliang Cao, Hailin Jin, Jiebo Luo:
Robust Visual-Textual Sentiment Analysis: When Attention meets Tree-structured Recursive Neural Networks. 1008-1017
Events and Context
- Tao Chen, Xiangnan He, Min-Yen Kan:
Context-aware Image Tweet Modelling and Recommendation. 1018-1027 - Jia Chen, Qin Jin, Yifan Xiong:
Semantic Image Profiling for Historic Events: Linking Images to Phrases. 1028-1037 - Anurag Kumar, Bhiksha Raj:
Audio Event Detection using Weakly Labeled Data. 1038-1047 - Jen-Yu Liu, Yi-Hsuan Yang:
Event Localization in Music Auto-tagging. 1048-1057
Multimedia Grand Challenge
- Hao Ye, Weiyuan Shao, Hong Wang, Jianqi Ma, Li Wang, Yingbin Zheng, Xiangyang Xue:
Face Recognition via Active Annotation and Learning. 1058-1062 - Yue Wu, Jun Li, Yu Kong, Yun Fu:
Deep Convolutional Neural Network with Independent Softmax for Large Scale Face Recognition. 1063-1067 - Jianshu Li, Jian Zhao, Fang Zhao, Hao Liu, Jing Li, Shengmei Shen, Jiashi Feng, Terence Sim:
Robust Face Recognition with Deep Multi-View Representation Learning. 1068-1072 - Rakshith Shetty, Jorma Laaksonen:
Frame- and Segment-Level Features and Candidate Pool Evaluation for Video Caption Generation. 1073-1076 - Benjamin Bischke, Damian Borth, Christian Schulze, Andreas Dengel:
Contextual Enrichment of Remote-Sensed Events with Social Media Streams. 1077-1081 - Jianfeng Dong, Xirong Li, Weiyu Lan, Yujia Huo, Cees G. M. Snoek:
Early Embedding and Late Reranking for Video Captioning. 1082-1086 - Qin Jin, Jia Chen, Shizhe Chen, Yifan Xiong, Alexander G. Hauptmann:
Describing Videos using Multi-modal Fusion. 1087-1091 - Vasili Ramanishka, Abir Das, Dong Huk Park, Subhashini Venugopalan, Lisa Anne Hendricks, Marcus Rohrbach, Kate Saenko:
Multimodal Video Description. 1092-1096 - Jingya Wang, Mohammed Korayem, Saúl A. Blanco, David J. Crandall:
Tracking Natural Events through Social Media and Computer Vision. 1097-1101 - Yogesh Singh Rawat, Mohan S. Kankanhalli:
ConTagNet: Exploiting User Context for Image Tag Recommendation. 1102-1106 - Xiangyang Li, Xinhang Song, Luis Herranz, Yaohui Zhu, Shuqiang Jiang:
Image Captioning with both Object and Scene Information. 1107-1110 - Tushar Karayil, Philipp Blandfort, Damian Borth, Andreas Dengel:
Generating Affective Captions using Concept And Syntax Transition Networks. 1111-1115
Keynote 2
- Jarke J. van Wijk:
Visual Analytics for Multimedia: Challenges and Opportunities. 1116
Topics in Multimedia III
- Xiaobai Liu:
V3I-STAL: Visual Vehicle-to-Vehicle Interaction via Simultaneous Tracking and Localization. 1117-1126 - Marco De Nadai, Radu-Laurentiu Vieriu, Gloria Zen, Stefan Dragicevic, Nikhil Naik, Michele Caraviello, César Augusto Hidalgo, Nicu Sebe, Bruno Lepri:
Are Safer Looking Neighborhoods More Lively?: A Multimodal Investigation into Urban Life. 1127-1135 - Rossano Schifanella, Paloma de Juan, Joel R. Tetreault, Liangliang Cao:
Detecting Sarcasm in Multimodal Social Platforms. 1136-1145 - Cristiano Carvalheiro, Rui Nóbrega, Hugo da Silva, Rui Rodrigues:
User Redirection and Direct Haptics in Virtual Environments. 1146-1155
Open Source Software Competition
- Chengxi Ye, Chen Zhao, Yezhou Yang, Cornelia Fermüller, Yiannis Aloimonos:
LightNet: A Versatile, Standalone Matlab-based Environment for Deep Learning. 1156-1159 - Guanyu Gao, Yonggang Wen:
Morph: A Fast and Scalable Cloud Transcoding System. 1160-1163 - Chun-Ying Huang, Ching-Ling Fan, Chih-Fan Hsu, Hsin-Yu Chang, Tsung-Han Tsai, Kuan-Ta Chen, Cheng-Hsin Hsu:
Smart Beholder: An Extensible Smart Lens Platform. 1164-1168 - Nikolaos Kardaris, Isidoros Rodomagoulakis, Vassilis Pitsikalis, Antonis Arvanitakis, Petros Maragos:
A Platform for Building New Human-Computer Interface Systems that Support Online Automatic Recognition of Audio-Gestural Commands. 1169-1173 - Sebastian Böck, Filip Korzeniowski, Jan Schlüter, Florian Krebs, Gerhard Widmer:
madmom: A New Python Audio and Music Signal Processing Library. 1174-1178 - Marko Viitanen, Ari Koivula, Ari Lemmetti, Arttu Ylä-Outinen, Jarno Vanne, Timo D. Hämäläinen:
Kvazaar: Open-Source HEVC/H.265 Encoder. 1179-1182 - Luca Rossetto, Ivan Giangreco, Claudiu Tanase, Heiko Schuldt:
vitrivr: A Flexible Retrieval Stack Supporting Multiple Query Modes for Searching in Multimedia Collections. 1183-1186 - Luis López-Fernández, Miguel Paris Diaz, Santiago Carot, Boni García, Micael Gallego, Francisco Gortázar, Raul Benitez Mejias, Jose A. Santos, David Fernández, Radu Tom Vlad, Iván Gracia, Francisco Javier Lopez:
Kurento: The WebRTC Modular Media Server. 1187-1191 - Tim Lenertz, Gauthier Lafruit:
Modular Parallelization Framework for Multi-Stream Video Processing. 1192-1196 - Kristian Skarseth, Henrik Bjørlo, Pål Halvorsen, Michael Riegler, Carsten Griwodz:
OpenVQ: A Video Quality Assessment Toolkit. 1197-1200 - Seyyed Salar Latifi Oskouei, Hossein Golestani, Matin Hashemi, Soheil Ghiasi:
CNNdroid: GPU-Accelerated Execution of Trained Deep Convolutional Neural Networks on Android. 1201-1205 - Bingchen Gong, Brendan Jou, Felix X. Yu, Shih-Fu Chang:
Tamp: A Library for Compact Deep Neural Networks with Structured Matrices. 1206-1209 - Christoph Lassner, Daniel Kappler, Martin Kiefel, Peter V. Gehler:
Barrista: Caffe Well-Served. 1210-1213 - Olivier Bélanger:
Pyo, the Python DSP toolbox. 1214-1217 - Julian F. P. Kooij:
SenseCap: Synchronized Data Collection with Microsoft Kinect2 and LeapMotion. 1218-1221 - Rufael Mekuria, Pablo César:
MP3DG-PCC, Open Source Software Framework for Implementation and Evaluation of Point Cloud Compression. 1222-1226
Learning & Hashing
- Keze Wang, Shengfu Zhai, Hui Cheng, Xiaodan Liang, Liang Lin:
Human Pose Estimation from Depth Images via Inference Embedded Multi-task Learning. 1227-1236 - Huei-Fang Yang, Kevin Lin, Chu-Song Chen:
Cross-batch Reference Learning for Deep Classification and Retrieval. 1237-1246 - Qi Dai, Jianguo Li, Jingdong Wang, Yu-Gang Jiang:
Binary Optimized Hashing. 1247-1256 - Min Wang, Wengang Zhou, Qi Tian, Zheng-Jun Zha, Houqiang Li:
Linear Distance Preserving Pseudo-Supervised and Unsupervised Hashing. 1257-1266
Transport & Experience
- Maarten Wijnants, Gustavo Rovelo, Peter Quax, Wim Lamotte:
A Pragmatically Designed Adaptive and Web-compliant Object-based Video Streaming Methodology: Implementation and Subjective Evaluation. 1267-1276 - Chao Chen, Mohammad Izadi, Anil C. Kokaram:
A Perceptual Quality Metric for Videos Distorted by Spatially Correlated Noise. 1277-1285 - Yang Yang, Yadan Luo, Weilun Chen, Fumin Shen, Jie Shao, Heng Tao Shen:
Zero-Shot Hashing via Transferring Supervised Knowledge. 1286-1295 - Abdelhak Bentaleb, Ali C. Begen, Roger Zimmermann:
SDNDASH: Improving QoE of HTTP Adaptive Streaming Using Software Defined Networking. 1296-1305
Topics in Multimedia IV
- Sreyasee Das Bhattacharjee, Junsong Yuan, Weixiang Hong, Xiang Ruan:
Query Adaptive Instance Search using Object Sketches. 1306-1315 - EunJin Kim, Hyeon-Jeong Suk:
Key Color Generation for Affective Multimedia Production: An Initial Method and Its Application. 1316-1325 - Dan Xu, Xavier Alameda-Pineda, Jingkuan Song, Elisa Ricci, Nicu Sebe:
Academic Coupled Dictionary Learning for Sketch-based Image Retrieval. 1326-1335 - Bo Wu, Wen-Huang Cheng, Yongdong Zhang, Tao Mei:
Time Matters: Multi-scale Temporalization of Social Media Popularity. 1336-1344
Analysis & Middleware
- Xu Shen, Xinmei Tian, Anfeng He, Shaoyan Sun, Dacheng Tao:
Transform-Invariant Convolutional Neural Networks for Image Classification and Search. 1345-1354 - Liang Zhang, Bingpeng Ma, Guorong Li, Qingming Huang, Qi Tian:
PL-ranking: A Novel Ranking Method for Cross-Modal Retrieval. 1355-1364 - Zhi-Qi Cheng, Yang Liu, Xiao Wu, Xian-Sheng Hua:
Video eCommerce: Towards Online Video Advertising. 1365-1374 - Chao Wu, Jia Jia, Wenwu Zhu, Xu Chen, Bowen Yang, Yaoxue Zhang:
Affective Contextual Mobile Recommender System. 1375-1384
Emotions, People and Faces
- Sicheng Zhao, Hongxun Yao, Yue Gao, Rongrong Ji, Wenlong Xie, Xiaolei Jiang, Tat-Seng Chua:
Predicting Personalized Emotion Perceptions of Social Images. 1385-1394 - Michael Xuelin Huang, Jiajia Li, Grace Ngai, Hong Va Leong:
StressClick: Sensing Stress from Gaze-Click Patterns. 1395-1404 - Jing Huo, Yang Gao, Yinghuan Shi, Wanqi Yang, Hujun Yin:
Ensemble of Sparse Cross-Modal Metrics for Heterogeneous Face Recognition. 1405-1414 - Jianglong Zhang, Liqiang Nie, Xiang Wang, Xiangnan He, Xianglin Huang, Tat-Seng Chua:
Shorter-is-Better: Venue Category Estimation from Micro-Video. 1415-1424
Doctoral Symposium
- Rajiv Ratn Shah:
Multimodal-based Multimedia Analysis, Retrieval, and Services in Support of Social Media Applications. 1425-1429 - Mengfan Tang:
Geospatial Multimedia Data for Situation Recognition. 1430-1434 - Sicheng Zhao:
Image Emotion Computing. 1435-1439 - Ana Garcia del Molino:
First Person View Video Summarization Subject to the User Needs. 1440-1444 - Quanzeng You:
Sentiment and Emotion Analysis for Social Multimedia: Methodologies and Applications. 1445-1449 - Charles D. Estes:
n-Dimensional Display Interface. 1450-1453 - Jingyuan Chen:
Multi-Modal Learning: Study on A Large-Scale Micro-Video Data Collection. 1454-1458 - Pascal Mettes:
Weakly-Supervised Recognition, Localization, and Explanation of Visual Entities. 1459-1463 - Yi-Jie Lu:
Zero-Example Multimedia Event Detection and Recounting with Unsupervised Evidence Localization. 1464-1468
Tutorials
- Xavier Alameda-Pineda, Timothy M. Hospedales, Elisa Ricci, Nicu Sebe, Xiaogang Wang:
Emerging Topics in Learning from Noisy and Missing Data. 1469-1470 - Rossano Schifanella, Bart Thomee:
The Lifecycle of Geotagged Multimedia Data. 1471-1472 - Wendy Ann Mansilla, Andrew Perkis:
Technology & Art in Stimulating Creative Placemaking in Public-Use Spaces. 1473-1474 - Vivek K. Singh, Siripen Pongpaichet, Ramesh C. Jain:
Situation Recognition from Multimodal Data. 1475-1476 - Maja Pantic, Vanessa Evers, Marc Peter Deisenroth, Luis Merino, Björn W. Schuller:
Social and Affective Robotics Tutorial. 1477-1478 - Gerald Friedland, Symeon Papadopoulos, Julia Bernd, Yiannis Kompatsiaris:
Multimedia Privacy. 1479-1480
Workshops
- Teresa Chambel, Rene Kaiser, Omar Niamut, Wei Tsang Ooi, Judith A. Redi:
AltMM 2016: 1st International Workshop on Multimedia Alternate Realities. 1481-1482 - Michel F. Valstar, Jonathan Gratch, Björn W. Schuller, Fabien Ringeval, Roddy Cowie, Maja Pantic:
Summary for AVEC 2016: Depression, Mood, and Emotion Recognition Workshop and Challenge. 1483-1484 - Bart Thomee, Damian Borth, Julia Bernd:
Multimedia COMMONS Workshop 2016 (MMCommons 2016): Datasets, Evaluation, and Reproducibility. 1485-1486 - Cathal Gurrin, Xavier Giró-i-Nieto, Petia Radeva, Mariella Dimiccoli, Håvard D. Johansen, Hideo Joho, Vivek K. Singh:
LTA 2016: The First Workshop on Lifelogging Tools and Applications. 1487-1488 - Stavroula G. Mougiakakou, Giovanni Maria Farinella, Keiji Yanai:
Overview of the ACM MultiMedia 2016 International Workshop on Multimedia Assisted Dietary Management. 1489-1490 - Susanne Boll, Kiyoharu Aizawa, Alexia Briasouli, Cathal Gurrin, Laleh Jalali, Jochen Meyer:
Multimedia for personal health and health care. 1491-1492 - Marie-Francine Moens, Katerina Pastra, Kate Saenko, Tinne Tuytelaars:
Vision and Language Integration Meets Multimedia Fusion: Proceedings of ACM Multimedia 2016 Workshop. 1493 - Mohamed Chetouani, Jeffrey F. Cohn, Albert Ali Salah:
Seventh International Workshop on Human Behavior Understanding (HBU 2016). 1494-1495
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.