Enhancing Multimedia Recommendation Through Item-Item Semantic Denoising and Global Preference Awareness

Zhang, Yanlong; Zheng, Shangfei; Zhou, Qian; Chen, Wei; Zhao, Lei

doi:10.1007/978-3-031-46661-8_53

Yanlong Zhang¹⁵,
Shangfei Zheng¹⁵,
Qian Zhou¹⁵,
Wei Chen¹⁵ &
…
Lei Zhao¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 14176))

Included in the following conference series:

International Conference on Advanced Data Mining and Applications

1093 Accesses

Abstract

Multimedia recommendation aims to predict whether users will interact with multimodal items. A few recent works that explicitly learn the semantic structure between items using multimodal features manifest impressive performance gains. This is mainly attributed to the capability of graph convolutional networks (GCNs) to learn superior item representations by propagating and aggregating information from high-order neighbors on the semantic structure. However, they still suffer from two major challenges: a) the noisy relations (edges) in the item-item semantic structure disrupt information propagation and generate low-quality item representations, which impairs the effectiveness and robustness of existing methods; b) the lack of an optimization objective that exploits informative samples and global preference information leads to suboptimal training of the model, which makes users and items indistinguishable in the embedding space. To overcome these challenges, we propose Enhancing Multi media Recommendation through Item-Item Semantic Denoising and Global Preference Awareness (MMGPA). Specifically, the model contains the following two components: (1) a modal semantic representation network is carefully designed to learn the high-quality multimodal representation of items by modeling the denoised item-item semantic structure, and (2) a global preference-aware optimization objective prioritizes the most informative hard sample pairs while constraining the multiple preference distances to better separate the embedding space. Extensive experimental results demonstrate that the proposed method outperforms various state-of-the-art competitors on three public benchmark datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 11210; Price includes VAT (Japan)

Softcover Book: JPY 14013; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Exploiting heterogeneous information isolation and multi-view aggregation for multimodal recommendation

Article 19 November 2024

Feature-enhanced embedding learning for heterogeneous collaborative filtering

Article 22 June 2022

Recommendation Based Heterogeneous Information Network and Neural Network Model

References

Chen, J., Zhang, H., He, X., Nie, L., Liu, W., Chua, T.: Attentive collaborative filtering: multimedia recommendation with item- and component-level attention. In: SIGIR, pp. 335–344. ACM (2017)
Google Scholar
Chen, M., Wei, Z., Huang, Z., Ding, B., Li, Y.: Simple and deep graph convolutional networks. In: ICML. Proceedings of Machine Learning Research, vol. 119, pp. 1725–1735. PMLR (2020)
Google Scholar
Chen, T., Kornblith, S., Norouzi, M., Hinton, G.E.: A simple framework for contrastive learning of visual representations. In: ICML. Proceedings of Machine Learning Research, vol. 119, pp. 1597–1607. PMLR (2020)
Google Scholar
Chen, Y., Wu, L., Zaki, M.J.: Iterative deep graph learning for graph neural networks: better and robust node embeddings. In: NeurIPS (2020)
Google Scholar
Ding, J., Quan, Y., Yao, Q., Li, Y., Jin, D.: Simplify and robustify negative sampling for implicit collaborative filtering. In: NeurIPS (2020)
Google Scholar
Gao, Z., Cheng, Z., Pérez, F., Sun, J., Volkovs, M.: MCL: mixed-centric loss for collaborative filtering. In: WWW, pp. 2339–2347. ACM (2022)
Google Scholar
He, R., McAuley, J.J.: VBPR: visual Bayesian personalized ranking from implicit feedback. In: AAAI, pp. 144–150. AAAI Press (2016)
Google Scholar
He, X., Chen, T., Kan, M., Chen, X.: Trirank: review-aware explainable recommendation by modeling aspects. In: CIKM, pp. 1661–1670. ACM (2015)
Google Scholar
He, X., Deng, K., Wang, X., Li, Y., Zhang, Y., Wang, M.: Lightgcn: simplifying and powering graph convolution network for recommendation. In: SIGIR, pp. 639–648. ACM (2020)
Google Scholar
Kipf, T.N., Welling, M.: Semi-supervised classification with graph convolutional networks. In: ICLR (Poster). OpenReview.net (2017)
Google Scholar
Liu, Q., Wu, S., Wang, L.: Deepstyle: learning user preferences for visual recommendation. In: SIGIR, pp. 841–844. ACM (2017)
Google Scholar
Ma, C., Ma, L., Zhang, Y., Tang, R., Liu, X., Coates, M.: Probabilistic metric learning with adaptive margin for top-k recommendation. In: KDD, pp. 1036–1044. ACM (2020)
Google Scholar
McAuley, J.J., Targett, C., Shi, Q., van den Hengel, A.: Image-based recommendations on styles and substitutes. In: SIGIR, pp. 43–52. ACM (2015)
Google Scholar
McPherson, M., Smith-Lovin, L., Cook, J.M.: Birds of a feather: homophily in social networks. Ann. Rev. Sociol. 27(1), 415–444 (2001)
Article Google Scholar
Mu, Z., Zhuang, Y., Tan, J., Xiao, J., Tang, S.: Learning hybrid behavior patterns for multimedia recommendation. In: ACM Multimedia, pp. 376–384. ACM (2022)
Google Scholar
Reimers, N., Gurevych, I.: Sentence-bert: sentence embeddings using siamese bert-networks. In: EMNLP/IJCNLP (1), pp. 3980–3990. Association for Computational Linguistics (2019)
Google Scholar
Rendle, S., Freudenthaler, C., Gantner, Z., Schmidt-Thieme, L.: BPR: bayesian personalized ranking from implicit feedback. In: UAI, pp. 452–461. AUAI Press (2009)
Google Scholar
Rong, Y., Huang, W., Xu, T., Huang, J.: Dropedge: towards deep graph convolutional networks on node classification. In: ICLR. OpenReview.net (2020)
Google Scholar
Sohn, K.: Improved deep metric learning with multi-class n-pair loss objective. In: NIPS, pp. 1849–1857 (2016)
Google Scholar
Song, H.O., Xiang, Y., Jegelka, S., Savarese, S.: Deep metric learning via lifted structured feature embedding. In: CVPR, pp. 4004–4012. IEEE Computer Society (2016)
Google Scholar
Wang, X., He, X., Wang, M., Feng, F., Chua, T.: Neural graph collaborative filtering. In: SIGIR, pp. 165–174. ACM (2019)
Google Scholar
Wang, X., Han, X., Huang, W., Dong, D., Scott, M.R.: Multi-similarity loss with general pair weighting for deep metric learning. In: CVPR, pp. 5022–5030. Computer Vision Foundation/IEEE (2019)
Google Scholar
Wei, Y., Wang, X., Nie, L., He, X., Chua, T.: Graph-refined convolutional network for multimedia recommendation with implicit feedback. In: ACM Multimedia, pp. 3541–3549. ACM (2020)
Google Scholar
Wei, Y., Wang, X., Nie, L., He, X., Hong, R., Chua, T.: MMGCN: multi-modal graph convolution network for personalized recommendation of micro-video. In: ACM Multimedia, pp. 1437–1445. ACM (2019)
Google Scholar
Wu, F., Jr., A.H.S., Zhang, T., Fifty, C., Yu, T., Weinberger, K.Q.: Simplifying graph convolutional networks. In: ICML. Proceedings of Machine Learning Research, vol. 97, pp. 6861–6871. PMLR (2019)
Google Scholar
Wu, J., Wang, X., Feng, F., He, X., Chen, L., Lian, J., Xie, X.: Self-supervised graph learning for recommendation. In: SIGIR, pp. 726–735. ACM (2021)
Google Scholar
Zhang, J., Zhu, Y., Liu, Q., Wu, S., Wang, S., Wang, L.: Mining latent structures for multimedia recommendation. In: ACM Multimedia, pp. 3872–3880. ACM (2021)
Google Scholar
Zhang, J., Zhu, Y., Liu, Q., Zhang, M., Wu, S., Wang, L.: Latent structure mining with contrastive modality fusion for multimedia recommendation. IEEE Trans. Knowl. Data Eng. (2022)
Google Scholar

Download references

Acknowledgments

This work is supported by the National Natural Science Foundation of China No. 62272332, the Major Program of the Natural Science Foundation of Jiangsu Higher Education Institutions of China No. 22KJA520006.

Author information

Authors and Affiliations

School of Computer Science and Technology, Soochow University, Suzhou, China
Yanlong Zhang, Shangfei Zheng, Qian Zhou, Wei Chen & Lei Zhao

Authors

Yanlong Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Shangfei Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Qian Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Wei Chen
View author publications
You can also search for this author in PubMed Google Scholar
Lei Zhao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wei Chen .

Editor information

Editors and Affiliations

Northeastern University, Shenyang, China
Xiaochun Yang
The University of Indonesia, Depok, Indonesia
Heru Suhartanto
Beijing Institute of Technology, Beijing, China
Guoren Wang
Northeastern University, Shenyang, China
Bin Wang
University of Technology Sydney, Sydney, NSW, Australia
Jing Jiang
Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
Bing Li
Sun Yat-sen University, Guangzhou, China
Huaijie Zhu
Anhui University, Hefei, China
Ningning Cui

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, Y., Zheng, S., Zhou, Q., Chen, W., Zhao, L. (2023). Enhancing Multimedia Recommendation Through Item-Item Semantic Denoising and Global Preference Awareness. In: Yang, X., et al. Advanced Data Mining and Applications. ADMA 2023. Lecture Notes in Computer Science(), vol 14176. Springer, Cham. https://doi.org/10.1007/978-3-031-46661-8_53

Download citation

DOI: https://doi.org/10.1007/978-3-031-46661-8_53
Published: 05 November 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-46660-1
Online ISBN: 978-3-031-46661-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Enhancing Multimedia Recommendation Through Item-Item Semantic Denoising and Global Preference Awareness

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Exploiting heterogeneous information isolation and multi-view aggregation for multimodal recommendation

Feature-enhanced embedding learning for heterogeneous collaborative filtering

Recommendation Based Heterogeneous Information Network and Neural Network Model

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Enhancing Multimedia Recommendation Through Item-Item Semantic Denoising and Global Preference Awareness

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Exploiting heterogeneous information isolation and multi-view aggregation for multimodal recommendation

Feature-enhanced embedding learning for heterogeneous collaborative filtering

Recommendation Based Heterogeneous Information Network and Neural Network Model

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation