{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,12,21]],"date-time":"2023-12-21T00:27:54Z","timestamp":1703118474906},"reference-count":44,"publisher":"Association for Computing Machinery (ACM)","issue":"12","funder":[{"name":"National Key Research Development Program of China","award":["2022YFC3803202"]},{"name":"Major Project of Anhui Province","award":["202203a05020011"]},{"name":"Anhui Province Key Research and Development Program","award":["202304a05020068"]},{"name":"General Programmer of the National Natural Science Foundation of China","award":["62376084"]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["71971002"],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Major Projects of Science and Technology in Anhui Province","award":["202003a060-20016"]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2023,12,31]]},"abstract":"\n Recently,\n E<\/jats:bold>\n motion\n R<\/jats:bold>\n ecognition in\n C<\/jats:bold>\n onversation (ERC) has attracted much attention and has become a hot topic in the field of natural language processing. Conversation is conducted in chronological order; current utterance is more likely influenced by nearby utterances. At the same time, speaker dependency also plays a core role in the conversation dynamic. The combined effect of the sequence-aware information and the speaker-aware information makes the emotion\u2019s dynamic change. However, past works used simple information fusion methods to model the two kinds of information but ignored their interactive influence. Thus, we propose a novel method entitled SIGAT (\n S<\/jats:bold>\n peaker-aware\n I<\/jats:bold>\n nteractive\n G<\/jats:bold>\n raph\n A<\/jats:bold>\n ttention Ne\n t<\/jats:bold>\n work) to solve the problem. The core module is a mutual interactive module in which a dual-connection (self-connection and interact-connection) graph attention network is constructed. The advantage of SIGAT is modeling the speaker-aware and sequence-aware information in a unified graph and updating them simultaneously. In this way, we model the interactive influence of them and obtain the final representations, which have richer contextual clues. Experimental results on the four public datasets demonstrate that SIGAT outperforms the state-of-the-art models.\n <\/jats:p>","DOI":"10.1145\/3627806","type":"journal-article","created":{"date-parts":[[2023,11,7]],"date-time":"2023-11-07T12:17:57Z","timestamp":1699359477000},"page":"1-18","update-policy":"http:\/\/dx.doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Speaker-Aware Interactive Graph Attention Network for Emotion Recognition in Conversation"],"prefix":"10.1145","volume":"22","author":[{"ORCID":"http:\/\/orcid.org\/0000-0001-6607-7025","authenticated-orcid":false,"given":"Zhaohong","family":"Jia","sequence":"first","affiliation":[{"name":"Anhui University, China"}]},{"ORCID":"http:\/\/orcid.org\/0000-0002-1421-3918","authenticated-orcid":false,"given":"Yunwei","family":"Shi","sequence":"additional","affiliation":[{"name":"Anhui University, Hefei Comprehensive National Science Center, China"}]},{"ORCID":"http:\/\/orcid.org\/0000-0001-9385-263X","authenticated-orcid":false,"given":"Weifeng","family":"Liu","sequence":"additional","affiliation":[{"name":"Anhui University, China"}]},{"ORCID":"http:\/\/orcid.org\/0000-0003-3178-9721","authenticated-orcid":false,"given":"Zhenhua","family":"Huang","sequence":"additional","affiliation":[{"name":"Anhui University, China"}]},{"ORCID":"http:\/\/orcid.org\/0000-0001-9750-7032","authenticated-orcid":false,"given":"Xiao","family":"Sun","sequence":"additional","affiliation":[{"name":"Hefei University of Technology, Hefei Comprehensive National Science Center, China"}]}],"member":"320","published-online":{"date-parts":[[2023,12,19]]},"reference":[{"key":"e_1_3_1_2_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10579-008-9076-6"},{"key":"e_1_3_1_3_2","doi-asserted-by":"crossref","first-page":"225","DOI":"10.18653\/v1\/2020.acl-main.21","volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics","author":"Chai Zi","year":"2020","unstructured":"Zi Chai and Xiaojun Wan. 2020. Learning to ask more: Semi-autoregressive sequential question generation under dual-graph interaction. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 225\u2013237."},{"key":"e_1_3_1_4_2","first-page":"1725","volume-title":"Proceedings of the 37th International Conference on Machine Learning, ICML","volume":"119","author":"Chen Ming","year":"2020","unstructured":"Ming Chen, Zhewei Wei, Zengfeng Huang, Bolin Ding, and Yaliang Li. 2020. Simple and deep graph convolutional networks. In Proceedings of the 37th International Conference on Machine Learning, ICML, Vol. 119. 1725\u20131735."},{"key":"e_1_3_1_5_2","first-page":"2470","volume-title":"Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings","author":"Ghosal Deepanway","year":"2020","unstructured":"Deepanway Ghosal, Navonil Majumder, Alexander Gelbukh, Rada Mihalcea, and Soujanya Poria. 2020. COSMIC: COmmonSense knowledge for eMotion Identification in Conversations. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings. 2470\u20132481."},{"key":"e_1_3_1_6_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1015"},{"key":"e_1_3_1_7_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1280"},{"key":"e_1_3_1_8_2","first-page":"2122","volume-title":"Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)","volume":"1","author":"Hazarika Devamanyu","year":"2018","unstructured":"Devamanyu Hazarika, Soujanya Poria, Amir Zadeh, Erik Cambria, Louis-Philippe Morency, and Roger Zimmermann. 2018b. Conversational memory network for emotion recognition in dyadic dialogue videos. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), Vol. 1. 2122\u20132132."},{"key":"e_1_3_1_9_2","first-page":"5666","volume-title":"Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, ACL\/IJCNLP (Volume 1: Long Papers)","author":"Hu Jingwen","year":"2021","unstructured":"Jingwen Hu, Yuchen Liu, Jinming Zhao, and Qin Jin. 2021. MMGCN: Multimodal fusion via deep graph convolution network for emotion recognition in conversation. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, ACL\/IJCNLP (Volume 1: Long Papers). 5666\u20135675."},{"key":"e_1_3_1_10_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-main.597"},{"key":"e_1_3_1_11_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2021.03.076"},{"key":"e_1_3_1_12_2","first-page":"8002","volume-title":"The 34th AAAI Conference on Artificial Intelligence, AAAI 2020","author":"Jiao Wenxiang","year":"2020","unstructured":"Wenxiang Jiao, Michael R. Lyu, and Irwin King. 2020. Real-time emotion recognition via attention gated hierarchical memory network. In The 34th AAAI Conference on Artificial Intelligence, AAAI 2020. 8002\u20138009."},{"key":"e_1_3_1_13_2","first-page":"397","volume-title":"Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers)","author":"Jiao Wenxiang","year":"2019","unstructured":"Wenxiang Jiao, Haiqin Yang, Irwin King, and Michael R. Lyu. 2019. HiGRU: Hierarchical gated recurrent units for utterance-level emotion recognition. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 397\u2013406."},{"key":"e_1_3_1_14_2","doi-asserted-by":"crossref","unstructured":"Y. Kim. 2014. Convolutional neural networks for sentence classification. arXiv:1408.5882.","DOI":"10.3115\/v1\/D14-1181"},{"key":"e_1_3_1_15_2","article-title":"Semi-supervised classification with graph convolutional networks","author":"Kipf Thomas N.","year":"2016","unstructured":"Thomas N. Kipf and Max Welling. 2016. Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016).","journal-title":"arXiv preprint arXiv:1609.02907"},{"key":"e_1_3_1_16_2","first-page":"5669","volume-title":"Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2022","author":"Lee Joosung","year":"2022","unstructured":"Joosung Lee and Wooin Lee. 2022. CoMPM: Context modeling with speaker\u2019s pre-trained memory tracking for emotion recognition in conversation. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2022. Association for Computational Linguistics, 5669\u20135679."},{"key":"e_1_3_1_17_2","first-page":"4190","volume-title":"Proceedings of the 28th International Conference on Computational Linguistics","author":"Li Jingye","year":"2020","unstructured":"Jingye Li, Donghong Ji, Fei Li, Meishan Zhang, and Yijiang Liu. 2020. HiTrans: A transformer-based context- and speaker-sensitive model for emotion detection in conversations. In Proceedings of the 28th International Conference on Computational Linguistics. 4190\u20134200."},{"key":"e_1_3_1_18_2","article-title":"A hierarchical transformer with speaker modeling for emotion recognition in conversation","author":"Li Jiangnan","year":"2020","unstructured":"Jiangnan Li, Zheng Lin, Peng Fu, Qingyi Si, and Weiping Wang. 2020. A hierarchical transformer with speaker modeling for emotion recognition in conversation. arXiv preprint arXiv:2012.14781 (2020).","journal-title":"arXiv preprint arXiv:2012.14781"},{"key":"e_1_3_1_19_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2021.09.057"},{"key":"e_1_3_1_20_2","first-page":"986","volume-title":"Proceedings of the 8th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)","volume":"1","author":"Li Yanran","year":"2017","unstructured":"Yanran Li, Hui Su, Xiaoyu Shen, Wenjie Li, Ziqiang Cao, and Shuzi Niu. 2017. DailyDialog: A manually labelled multi-turn dialogue dataset. In Proceedings of the 8th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), Vol. 1. 986\u2013995."},{"key":"e_1_3_1_21_2","first-page":"1610","volume-title":"Findings of the Association for Computational Linguistics: ACL 2022, Dublin, Ireland, May 22-27, 2022","author":"Li Zaijing","year":"2022","unstructured":"Zaijing Li, Fengxiao Tang, Ming Zhao, and Yusen Zhu. 2022. EmoCaps: Emotion capsule based model for conversational emotion recognition. In Findings of the Association for Computational Linguistics: ACL 2022, Dublin, Ireland, May 22-27, 2022, Smaranda Muresan, Preslav Nakov, and Aline Villavicencio (Eds.). 1610\u20131618."},{"key":"e_1_3_1_22_2","doi-asserted-by":"crossref","first-page":"985","DOI":"10.1109\/TASLP.2021.3049898","article-title":"CTNet: Conversational transformer network for emotion recognition","author":"Lian Zheng","year":"2021","unstructured":"Zheng Lian, Bin Liu, and Jianhua Tao. 2021. CTNet: Conversational transformer network for emotion recognition. IEEE\/ACM Trans. Audio, Speech and Lang. Proc. (2021), 985\u20131000.","journal-title":"IEEE\/ACM Trans. Audio, Speech and Lang. Proc."},{"key":"e_1_3_1_23_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2020.10.019"},{"key":"e_1_3_1_24_2","article-title":"RoBERTa: A robustly optimized BERT pretraining approach","author":"Liu Yinhan","year":"2019","unstructured":"Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, and Veselin Stoyanov. 2019. RoBERTa: A robustly optimized BERT pretraining approach. arXiv preprint arXiv:1907.11692 (2019).","journal-title":"arXiv preprint arXiv:1907.11692"},{"key":"e_1_3_1_25_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.48"},{"issue":"3","key":"e_1_3_1_26_2","article-title":"HAN-ReGRU: Hierarchical attention network with residual gated recurrent unit for emotion recognition in conversation","volume":"33","author":"Ma H.","year":"2021","unstructured":"H. Ma, J. Wang, L. Qian, and H. Lin. 2021. HAN-ReGRU: Hierarchical attention network with residual gated recurrent unit for emotion recognition in conversation. Neural Computing and Applications 33, 3 (2021).","journal-title":"Neural Computing and Applications"},{"key":"e_1_3_1_27_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.inffus.2020.06.011"},{"key":"e_1_3_1_28_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33016818"},{"key":"e_1_3_1_29_2","article-title":"I-GCN: Incremental graph convolution network for conversation emotion detection","author":"Nie Weizhi","year":"2021","unstructured":"Weizhi Nie, Rihao Chang, Minjie Ren, Yuting Su, and Anan Liu. 2021. I-GCN: Incremental graph convolution network for conversation emotion detection. IEEE Transactions on Multimedia (2021).","journal-title":"IEEE Transactions on Multimedia"},{"key":"e_1_3_1_30_2","doi-asserted-by":"crossref","first-page":"2539","DOI":"10.18653\/v1\/D15-1303","volume-title":"Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, EMNLP","author":"Poria Soujanya","year":"2015","unstructured":"Soujanya Poria, Erik Cambria, and Alexander F. Gelbukh. 2015. Deep convolutional neural network textual features and multiple kernel learning for utterance-level multimodal sentiment analysis. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, EMNLP, Llu\u00eds M\u00e0rquez, Chris Callison-Burch, Jian Su, Daniele Pighin, and Yuval Marton (Eds.). 2539\u20132544."},{"key":"e_1_3_1_31_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P17-1081"},{"key":"e_1_3_1_32_2","doi-asserted-by":"crossref","first-page":"527","DOI":"10.18653\/v1\/P19-1050","volume-title":"ACL 2019: The 57th Annual Meeting of the Association for Computational Linguistics","author":"Poria Soujanya","year":"2019","unstructured":"Soujanya Poria, Devamanyu Hazarika, Navonil Majumder, Rada Mihalcea, Gautam Naik, and Erik Cambria. 2019. MELD: A multimodal multi-party dataset for emotion recognition in conversations. In ACL 2019: The 57th Annual Meeting of the Association for Computational Linguistics. 527\u2013536."},{"key":"e_1_3_1_33_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2019.2929050"},{"key":"e_1_3_1_34_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v35i15.17616"},{"key":"e_1_3_1_35_2","article-title":"LR-GCN: Latent relation-aware graph convolutional network for conversational emotion recognition","author":"Ren Minjie","year":"2021","unstructured":"Minjie Ren, Xiangdong Huang, Wenhui Li, Dan Song, and Weizhi Nie. 2021. LR-GCN: Latent relation-aware graph convolutional network for conversational emotion recognition. IEEE Transactions on Multimedia (2021).","journal-title":"IEEE Transactions on Multimedia"},{"key":"e_1_3_1_36_2","first-page":"593","volume-title":"The Semantic Web \u2014 15th International Conference, ESWC 2018","volume":"10843","author":"Schlichtkrull Michael Sejr","year":"2018","unstructured":"Michael Sejr Schlichtkrull, Thomas N. Kipf, Peter Bloem, Rianne van den Berg, Ivan Titov, and Max Welling. 2018. Modeling relational data with graph convolutional networks. In The Semantic Web \u2014 15th International Conference, ESWC 2018, Vol. 10843. 593\u2013607."},{"key":"e_1_3_1_37_2","first-page":"13789","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","author":"Shen Weizhou","year":"2021","unstructured":"Weizhou Shen, Junqing Chen, Xiaojun Quan, and Zhixian Xie. 2021. DialogXL: All-in-one XLNet for multi-party conversation emotion recognition. In Proceedings of the AAAI Conference on Artificial Intelligence. 13789\u201313797."},{"key":"e_1_3_1_38_2","first-page":"1551","volume-title":"Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)","author":"Shen Weizhou","year":"2021","unstructured":"Weizhou Shen, Siyue Wu, Yunyi Yang, and Xiaojun Quan. 2021. Directed acyclic graph network for conversational emotion recognition. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 1551\u20131560."},{"key":"e_1_3_1_39_2","first-page":"8542","volume-title":"IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022","author":"Song Xiaohui","year":"2022","unstructured":"Xiaohui Song, Liangjun Zang, Rong Zhang, Songlin Hu, and Longtao Huang. 2022. EmotionFlow: Capture the dialogue level emotion transitions. In IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022. IEEE, 8542\u20138546."},{"key":"e_1_3_1_40_2","volume-title":"Proceedings of the 6th International Conference on Learning Representations, ICLR","author":"Velickovic Petar","year":"2018","unstructured":"Petar Velickovic, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Li\u00f2, and Yoshua Bengio. 2018. Graph attention networks. In Proceedings of the 6th International Conference on Learning Representations, ICLR."},{"key":"e_1_3_1_41_2","article-title":"Adapted dynamic memory network for emotion recognition in conversation","author":"Xing Songlong","year":"2020","unstructured":"Songlong Xing, Sijie Mai, and Haifeng Hu. 2020. Adapted dynamic memory network for emotion recognition in conversation. IEEE Transactions on Affective Computing (2020).","journal-title":"IEEE Transactions on Affective Computing"},{"key":"e_1_3_1_42_2","first-page":"5753","volume-title":"Advances in Neural Information Processing Systems","author":"Yang Zhilin","year":"2019","unstructured":"Zhilin Yang, Zihang Dai, Yiming Yang, Jaime Carbonell, Russ R. Salakhutdinov, and Quoc V. Le. 2019. XLNet: Generalized autoregressive pretraining for language understanding. In Advances in Neural Information Processing Systems. 5753\u20135763."},{"key":"e_1_3_1_43_2","first-page":"44","volume-title":"AAAI Workshops","author":"Zahiri Sayyed M.","year":"2017","unstructured":"Sayyed M. Zahiri and Jinho D. Choi. 2017. Emotion detection on TV show transcripts with sequence-based convolutional neural networks. In AAAI Workshops. 44\u201352."},{"key":"e_1_3_1_44_2","doi-asserted-by":"crossref","first-page":"165","DOI":"10.18653\/v1\/D19-1016","volume-title":"Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)","author":"Zhong Peixiang","year":"2019","unstructured":"Peixiang Zhong, Di Wang, and Chunyan Miao. 2019. Knowledge-enriched transformer for emotion detection in textual conversations. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). 165\u2013176."},{"key":"e_1_3_1_45_2","first-page":"1571","volume-title":"Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, ACL\/IJCNLP 2021 (Volume 1: Long Papers), 2021","author":"Zhu Lixing","year":"2021","unstructured":"Lixing Zhu, Gabriele Pergola, Lin Gui, Deyu Zhou, and Yulan He. 2021. Topic-driven and knowledge-aware transformer for dialogue emotion detection. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, ACL\/IJCNLP 2021 (Volume 1: Long Papers), 2021, Chengqing Zong, Fei Xia, Wenjie Li, and Roberto Navigli (Eds.). 1571\u20131582."}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3627806","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,12,20]],"date-time":"2023-12-20T06:19:09Z","timestamp":1703053149000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3627806"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,12,19]]},"references-count":44,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2023,12,31]]}},"alternative-id":["10.1145\/3627806"],"URL":"https:\/\/doi.org\/10.1145\/3627806","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"value":"2375-4699","type":"print"},{"value":"2375-4702","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,12,19]]},"assertion":[{"value":"2022-03-29","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-09-28","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-12-19","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}