default search action
22. MMM 2017: Reykjavik, Iceland
- Laurent Amsaleg, Gylfi Þór Guðmundsson, Cathal Gurrin, Björn Þór Jónsson, Shin'ichi Satoh:
MultiMedia Modeling - 23rd International Conference, MMM 2017, Reykjavik, Iceland, January 4-6, 2017, Proceedings, Part I. Lecture Notes in Computer Science 10132, Springer 2017, ISBN 978-3-319-51810-7
Full Papers Accepted for Oral Presentation
- Song Wang, Ruimin Hu, Shihong Chen, Xiaochen Wang, Yuhong Yang, Weiping Tu, Bo Peng:
3D Sound Field Reproduction at Non Central Point for NHK 22.2 System. 3-14 - Falk Böschen, Ansgar Scherp:
A Comparison of Approaches for Automated Text Extraction from Scholarly Figures. 15-27 - Yuanying Dai, Dong Liu, Feng Wu:
A Convolutional Neural Network Approach for Post-Processing in HEVC Intra Coding. 28-39 - Kojiro Fujii, Kazuaki Nakamura, Naoko Nitta, Noboru Babaguchi:
A Framework of Privacy-Preserving Image Recognition for Image-Based Information Services. 40-52 - Jun Yu:
A Real-Time 3D Visual Singing Synthesis: From Appearance to Internal Articulators. 53-64 - Sheng Chen, Bin Liu, Chang Wen Chen:
A Structural Coupled-Layer Tracking Method Based on Correlation Filters. 65-76 - David Antón, Gregorij Kurillo, Allen Y. Yang, Ruzena Bajcsy:
Augmented Telemedicine Platform for Real-Time Remote Medical Consultation. 77-89 - Qi-Chong Tian, Laurent D. Cohen:
Color Consistency for Photo Collections Without Gamut Problems. 90-101 - Nikiforos Pittaras, Foteini Markatopoulou, Vasileios Mezaris, Ioannis Patras:
Comparison of Fine-Tuning and Extension Strategies for Deep Convolutional Neural Networks. 102-114 - Huangjie Zheng, Jiangchao Yao, Ya Zhang:
Describing Geographical Characteristics with Social Images. 115-126 - Wu Feng, Dong Liu:
Fine-Grained Image Recognition from Click-Through Logs Using Deep Siamese Network. 127-138 - Lixuan Yang, Helena Rodriguez, Michel Crucianu, Marin Ferecatu:
Fully Convolutional Network with Superpixel Parsing for Fashion Web Image Segmentation. 139-151 - Feng Su, Hao Xue:
Graph-Based Multimodal Music Mood Classification in Discriminative Latent Space. 152-163 - Zhiwei Wang, Xin Yang:
Joint Face Detection and Initialization for Face Alignment. 164-175 - Shanshan Ai, Caiyan Jia, Zhineng Chen:
Large-Scale Product Classification via Spatial Attention Based CNN Learning and Multi-class Regression. 176-188 - Wissam J. Baddar, Dae Hoe Kim, Yong Man Ro:
Learning Features Robust to Image Variations with Siamese Networks for Facial Expression Recognition. 189-200 - Guan-Qun Yang, Xin-Shun Xu, Shanqing Guo, Xiaolin Wang:
M3LH: Multi-modal Multi-label Hashing for Large Scale Data Search. 201-213 - Shyi-Chyi Cheng, Jui-Yuan Su, Jing-Min Chen, Jun-Wei Hsieh:
Model-Based 3D Scene Reconstruction Using a Moving RGB-D Camera. 214-225 - Mark Claypool, Ragnhild Eg, Kjetil Raaen:
Modeling User Performance for Moving Target Selection with a Delayed Mouse. 226-237 - Shuangqun Li, Wu Liu, Huadong Ma, Huiyuan Fu:
Multi-attribute Based Fire Detection in Diverse Surveillance Videos. 238-250 - Giorgos Kordopatis-Zilos, Symeon Papadopoulos, Ioannis Patras, Yiannis Kompatsiaris:
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers. 251-263 - Xinchun Qian, Wengang Zhou, Houqiang Li:
No-Reference Image Quality Assessment Based on Internal Generative Mechanism. 264-276 - Yu Liu, Yanming Guo, Michael S. Lew:
On the Exploration of Convolutional Fusion Networks for Visual Recognition. 277-289 - Tzu-Yi Hung, Sriram Vaikundam, Vidhya Natarajan, Liang-Tien Chia:
Phase Fourier Reconstruction for Anomaly Detection on Metal Surface Using Salient Irregularity. 290-302 - Fabian Lorenzo Dayrit, Ryosuke Kimura, Yuta Nakashima, Ambrosio Blanco, Hiroshi Kawasaki, Katsushi Ikeuchi, Tomokazu Sato, Naokazu Yokoya:
ReMagicMirror: Action Learning Using Human Reenactment with the Mirror Metaphor. 303-315 - Yi Rong, Shengwu Xiong, Yongsheng Gao:
Robust Image Classification via Low-Rank Double Dictionary Learning. 316-328 - Ruo-Ze Liu, Xin Sun, Hailiang Xu, Palaiahnakote Shivakumara, Feng Su, Tong Lu, Ruoyu Yang:
Robust Scene Text Detection for Multi-script Languages Using Deep Learning. 329-340 - Jianqiang Xu, Yao Lu:
Robust Visual Tracking Based on Multi-channel Compressive Features. 341-352 - Ze Yang, Kai Zhang, Yudong Liang, Jinjun Wang:
Single Image Super-Resolution with a Parameter Economic Residual-Like Convolutional Neural Network. 353-364 - Ionut C. Duta, Bogdan Ionescu, Kiyoharu Aizawa, Nicu Sebe:
Spatio-Temporal VLAD Encoding for Human Action Recognition in Videos. 365-378 - Chengdong Liu, Zhouhui Lian, Yingmin Tang, Jianguo Xiao:
Structure-Aware Image Resizing for Chinese Characters. 379-390 - Lu Feng, Xin-Shun Xu, Shanqing Guo, Xiaolin Wang:
Supervised Class Graph Preserving Hashing for Image Retrieval and Classification. 391-403 - Yiyang Zhou, Wenhai Wang, Wenjie Guan, Yirui Wu, Heng Lai, Tong Lu, Min Cai:
Visual Robotic Object Grasping Through Combining RGB-D Data and 3D Meshes. 404-415 - Yu Liu, Yanming Guo, Michael S. Lew:
What Convnets Make for Image Captioning? 416-428 - Ryo Kawahata, Atsushi Shimada, Rin-Ichiro Taniguchi:
What are Good Design Gestures? - -Towards User- and Machine-friendly Interface-. 429-440
SS1: Social Media Retrieval and Recommendation
- Jie Liu, Sheng Tang, Yu Li:
Collaborative Dictionary Learning and Soft Assignment for Sparse Coding of Image Features. 443-451 - Yuting Su, Huijing Wang:
LingoSent - A Platform for Linguistic Aware Sentiment Analysis for Social Media Messages. 452-464 - Liang Xie, Lei Zhu, Zhiyong Cheng:
Multi-Task Multi-modal Semantic Hashing for Web Image Retrieval with Limited Supervision. 465-477 - Yu Bao, Haojie Li:
Object-Based Aggregation of Deep Features for Image Retrieval. 478-489 - Shun Liu, Hongtao Xie, Chuan Zhou, Zhendong Mao:
Uyghur Language Text Detection in Complex Background Images Using Enhanced MSERs. 490-500
SS2: Modeling Multimedia Behaviors
- Chen Yan, Peng Wang, Haitian Pang, Lifeng Sun, Shiqiang Yang:
CELoF: WiFi Dwell Time Estimation in Free Environment. 503-514 - Liancheng Xiang, Jitao Sang, Changsheng Xu:
Demographic Attribute Inference from Social Multimedia Behaviors: A Cross-OSN Approach. 515-526 - Zhengyuan Pang, Lifeng Sun, Zhi Wang, Yuan Xie, Shiqiang Yang:
Understanding Performance of Edge Prefetching. 527-539 - Zaher Hinbarji, Rami Albatal, Cathal Gurrin:
User Identification by Observing Interactions with GUIs. 540-549 - Yuhua Jia, Liang Bai, Peng Wang, Jinlin Guo, Yuxiang Xie, Tianyuan Yu:
Utilizing Locality-Sensitive Hash Learning for Cross-Media Retrieval. 550-561
SS3: Multimedia Computing for Intelligent Life
- Chung-Wei Yeh, Tse-Yu Pan, Min-Chun Hu:
A Sensor-Based Official Basketball Referee Signals Recognition System Using Deep Belief Networks. 565-575 - Ling Wang, Yu Bao, Haojie Li, Xin Fan, Zhongxuan Luo:
Compact CNN Based Video Representation for Efficient Video Copy Detection. 576-587 - Jingjing Chen, Lei Pang, Chong-Wah Ngo:
Cross-Modal Recipe Retrieval: How to Cook this Dish? 588-600 - Wu Liu, Jiangyu Liu, Xiaoyan Gu, Kun Liu, Xiaowei Dai, Huadong Ma:
Deep Learning Based Intelligent Basketball Arena with Energy Image. 601-613 - Hong Liu, Jun Wang, Xiangdong Wang, Yueliang Qian:
Efficient Multi-scale Plane Extraction Based RGBD Video Segmentation. 614-625 - Kai-Lung Hua, Irawati Nurmala Sari, Mei-Chen Yeh:
Human Pose Tracking Using Online Latent Structured Support Vector Machine. 626-637 - Shiyu Zhang, Bailan Feng, Zhineng Chen, Xiangsheng Huang:
Micro-Expression Recognition by Aggregating Local Spatio-Temporal Patterns. 638-648 - Qing Wang, Jiansu Pu, Yuanfang Guo, Zheng Hu, Hui Tian:
egoPortray: Visual Exploration of Mobile Communication Signature from Egocentric Network Perspective. 649-661 - Jordi Sanchez-Riera, Jun-Ming Lin, Kai-Lung Hua, Wen-Huang Cheng, Arvin Wen Tsui:
i-Stylist: Finding the Right Dress Through Your Social Networks. 662-673
SS4: Multimedia and Multimodal Interaction for Health and Basic Care Applications
- Yasuhiro Shibasaki, Kotaro Funakoshi, Koichi Shinoda:
Boredom Recognition Based on Users' Spontaneous Behaviors in Multiparty Human-Robot Interactions. 677-689 - Karim Aderghal, Manuel Boissenin, Jenny Benois-Pineau, Gwénaëlle Catheline, Karim Afdel:
Classification of sMRI for AD Diagnosis with Convolutional Neuronal Networks: A Pilot 2-D+ \epsilon Study on ADNI. 690-701 - Stefan Petscharnig, Klaus Schöffmann:
Deep Learning for Shot Classification in Gynecologic Surgery Videos. 702-713 - Georgios Meditskos, Stefanos Vrochidis, Ioannis Kompatsiaris:
Description Logics and Rules for Multimodal Situational Awareness in Healthcare. 714-725 - Jun Yu:
Speech Synchronized Tongue Animation by Combining Physiology Modeling and X-ray Image Fitting. 726-737
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.