default search action
MMM 2024, Amsterdam, The Netherlands - Part II
- Stevan Rudinac, Alan Hanjalic, Cynthia C. S. Liem, Marcel Worring, Björn Þór Jónsson, Bei Liu, Yoko Yamakata:
MultiMedia Modeling - 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 - February 2, 2024, Proceedings, Part II. Lecture Notes in Computer Science 14555, Springer 2024, ISBN 978-3-031-53307-5 - Yuxuan Zhang, Huibin Tan, Long Lan, Xiao Teng, Jing Ren, Yongjun Zhang:
Self-distillation Enhanced Vertical Wavelet Spatial Attention for Person Re-identification. 1-13 - Tao Zhang, Ju Zhang, Yicheng Zou, Yu Zhang:
High Capacity Reversible Data Hiding in Encrypted Images Based on Pixel Value Preprocessing and Block Classification. 14-27 - Xin Dong, Rui Wang, Sanyi Zhang, Lihua Jing:
HPattack: An Effective Adversarial Attack for Human Parsing. 28-41 - Fahong Wang, Zhao Liu, Jie Lei, Zeyu Zou, Wentao Han, Juan Xu, Xuan Li, Zunlei Feng, Ronghua Liang:
Dynamic-Static Graph Convolutional Network for Video-Based Facial Expression Recognition. 42-55 - Kezhou Chen, Shuo Wang, Yanbin Hao:
Hierarchical Supervised Contrastive Learning for Multimodal Sentiment Analysis. 56-69 - Xi Gu, Yuanyuan Xu, Kun Zhu:
Semantic Importance-Based Deep Image Compression Using a Generative Approach. 70-81 - Wenbin Gan, Minh-Son Dao, Koji Zettsu:
Drive-CLIP: Cross-Modal Contrastive Safety-Critical Driving Scenario Representation Learning and Zero-Shot Driving Risk Analysis. 82-97 - Peide Zhu, Zhen Wang, Manabu Okumura, Jie Yang:
MRHF: Multi-stage Retrieval and Hierarchical Fusion for Textbook Question Answering. 98-111 - Tongwei Ma, Lilian Zhang, Bo Sun, Chen Fan:
Multi-scale Decomposition Dehazing with Polarimetric Vision. 112-126 - Qianqian Jin, Fazhi He, Wei Tang:
CLF-Net: A Few-Shot Cross-Language Font Generation Method. 127-140 - Yixing Lu, Zhaoxin Fan, Min Xu:
Multi-dimensional Fusion and Consistency for Semi-supervised Medical Image Segmentation. 141-155 - Sze An Peter Tan, Guangyu Gao, Jia Zhao:
Audio-Visual Segmentation by Leveraging Multi-scaled Features Learning. 156-169 - Wei Liu, Jun Li, Zhijian Wu, Jianhua Xu, Bo Yang:
Multi-head Hashing with Orthogonal Decomposition for Cross-modal Retrieval. 170-183 - Guangrui Liu, Wei Wu:
Fusion Boundary and Gradient Enhancement Network for Camouflage Object Detection. 184-198 - Carlo Bretti, Pascal Mettes, Hendrik Vincent Koops, Daan Odijk, Nanne van Noord:
Find the Cliffhanger: Multi-modal Trailerness in Soap Operas. 199-212 - Ruichen Li, Lei Wu, Pei Dong, Minggang He:
SM-GAN: Single-Stage and Multi-object Text Guided Image Editing. 213-226 - Shilong Yu, Chenhui Yang:
MAVAR-SE: Multi-scale Audio-Visual Association Representation Network for End-to-End Speaker Extraction. 227-238 - Gia-Bao Le, Van-Tien Nguyen, Trung-Nghia Le, Minh-Triet Tran:
NearbyPatchCL: Leveraging Nearby Patches for Self-supervised Patch-Level Multi-class Classification in Whole-Slide Images. 239-252 - Songkang Dai, Song-Lu Chen, Qi Liu, Chao Zhu, Yan Liu, Feng Chen, Xu-Cheng Yin:
Improving Small License Plate Detection with Bidirectional Vehicle-Plate Relation. 253-266 - Linyi Qian, Qian Huang, Yulin Chen, Junzhou Chen:
A Purified Stacking Ensemble Framework for Cytology Classification. 267-280 - Jing Zhang, Wei Wu:
SEAS-Net: Segment Exchange Augmentation for Semi-supervised Brain Tumor Segmentation. 281-295 - Lihua Du, Wei Wu, Chen Li:
Super-Resolution-Assisted Feature Refined Extraction for Small Objects in Remote Sensing Images. 296-309 - Zhenlei Cui, Zhenhua Tang, Jianze Li, Kai Chen:
Lightweight Image Captioning Model Based on Knowledge Distillation. 310-324 - Yuan-Yuan Liu, Qi Liu, Song-Lu Chen, Feng Chen, Xu-Cheng Yin:
Irregular License Plate Recognition via Global Information Integration. 325-339 - Xiaohai Zhang, Jinming Zhang, Jianliang Li, Ming Chen:
TNT-Net: Point Cloud Completion by Transformer in Transformer. 340-352 - Jiacheng Chen, Fei Wu, Wanliang Wang, Haoxin Sheng:
Fourier Transformer for Joint Super-Resolution and Reconstruction of MR Image. 353-364 - Yangjie Cao, Bo Wang, Zhenqiang Li, Jie Li:
MVD-NeRF: Resolving Shape-Radiance Ambiguity via Mitigating View Dependency. 365-378 - Jingzhi Zhang, Xudong Li, Linghui Sun, Chengjie Bai:
DPM-Det: Diffusion Model Object Detection Based on DPM-Solver++ Guided Sampling. 379-393 - Sicheng Wang, Hao Jiang, Lei Xiang:
CT-MVSNet: Efficient Multi-view Stereo with Cross-Scale Transformer. 394-408 - Feifei Xu, Wang Zhou, Tao Sun, Jiahao Lu, Ziheng Yu, Guangzhen Li:
A Coarse and Fine Grained Masking Approach for Video-Grounded Dialogue. 409-422 - Xiaotong Bu, Jiwen Dong, Mengjiao Zhang, Guang Feng, Xizhan Gao, Sijie Niu:
Deep Self-supervised Subspace Clustering with Triple Loss. 423-436 - Baotong Su, Wenguang Zheng:
LigCDnet:Remote Sensing Image Cloud Detection Based on Lightweight Framework. 437-450 - Qizhen Chen, Xin Chen, Xiaoling Deng, Yubin Lan:
Gait Recognition Based on Temporal Gait Information Enhancing. 451-463 - Yanyan Jiao, Wenzhu Yang, Wenjie Xing:
Learning Complementary Instance Representation with Parallel Adaptive Graph-Based Network for Action Detection. 464-478 - Xu Chen, Zhibin Zhang:
CESegNet:Context-Enhancement Semantic Segmentation Network Based on Transformer. 479-493 - Lu Zhang, Jingliang Peng, Na Lv:
MoCap-Video Data Retrieval with Deep Cross-Modal Learning. 494-506 - Guangjie Yang, Dajian Zhong, Yu-Jie Xiong, Hongjian Zhan:
LRATNet: Local-Relationship-Aware Transformer Network for Table Structure Recognition. 507-520
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.