{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,7,21]],"date-time":"2024-07-21T11:40:21Z","timestamp":1721562021147},"reference-count":32,"publisher":"MDPI AG","issue":"22","license":[{"start":{"date-parts":[[2021,11,11]],"date-time":"2021-11-11T00:00:00Z","timestamp":1636588800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100007301","name":"Kunming University of Science and Technology","doi-asserted-by":"publisher","award":["Grant No.61732005;Grant No.61866019,61972186,61761026;Grant No. 202103AA080015,202002AD080001;Grant No. 2019FA023;Grant No. 2019HB006"],"id":[{"id":"10.13039\/501100007301","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"Finding the news of same case from the large numbers of case-involved news is an important basis for public opinion analysis. Existing text clustering methods usually based on topic models which only use topic and case infomation as the global features of documents, so distinguishing between different cases with similar types remains a challenge. The contents of documents contain rich local features. Taking into account the internal features of news, the information of cases and the contributions provided by different topics, we propose a clustering method of case-involved news, which combines topic network and multi-head attention mechanism. Using case information and topic information to construct a topic network, then extracting the global features by graph convolution network, thus realizing the combination of case information and topic information. At the same time, the local features are extracted by multi-head attention mechanism. Finally, the fusion of global features and local features is realized by variational auto-encoder, and the learned latent representations are used for clustering. The experiments show that the proposed method significantly outperforms the state-of-the-art unsupervised clustering methods.<\/jats:p>","DOI":"10.3390\/s21227501","type":"journal-article","created":{"date-parts":[[2021,11,12]],"date-time":"2021-11-12T04:04:46Z","timestamp":1636689886000},"page":"7501","source":"Crossref","is-referenced-by-count":2,"title":["A Clustering Method of Case-Involved News by Combining Topic Network and Multi-Head Attention Mechanism"],"prefix":"10.3390","volume":"21","author":[{"ORCID":"http:\/\/orcid.org\/0000-0002-8289-6036","authenticated-orcid":false,"given":"Cunli","family":"Mao","sequence":"first","affiliation":[{"name":"Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China"},{"name":"Yunnan Key Laboratory of Artificial Intelligence, Kunming University of Science and Technology, Kunming 650500, China"}]},{"given":"Haoyuan","family":"Liang","sequence":"additional","affiliation":[{"name":"Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China"},{"name":"Yunnan Key Laboratory of Artificial Intelligence, Kunming University of Science and Technology, Kunming 650500, China"}]},{"given":"Zhengtao","family":"Yu","sequence":"additional","affiliation":[{"name":"Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China"},{"name":"Yunnan Key Laboratory of Artificial Intelligence, Kunming University of Science and Technology, Kunming 650500, China"}]},{"given":"Yuxin","family":"Huang","sequence":"additional","affiliation":[{"name":"Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China"},{"name":"Yunnan Key Laboratory of Artificial Intelligence, Kunming University of Science and Technology, Kunming 650500, China"}]},{"given":"Junjun","family":"Guo","sequence":"additional","affiliation":[{"name":"Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China"},{"name":"Yunnan Key Laboratory of Artificial Intelligence, Kunming University of Science and Technology, Kunming 650500, China"}]}],"member":"1968","published-online":{"date-parts":[[2021,11,11]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Rusu, D., Hodson, J., and Kimball, A. (2014, January 22\u201327). Unsupervised techniques for extracting and clustering complex events in news. Proceedings of the Second Workshop on EVENTS: Definition, Detection, Coreference, and Representation, Baltimore, MD, USA.","DOI":"10.3115\/v1\/W14-2905"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"39501","DOI":"10.1109\/ACCESS.2018.2855437","article-title":"A survey of clustering with deep learning: From the perspective of network architecture","volume":"6","author":"Min","year":"2018","journal-title":"IEEE Access"},{"key":"ref_3","first-page":"100","article-title":"Algorithm AS 136: A K-means clustering algorithm","volume":"28","author":"Hartigan","year":"1979","journal-title":"J. R. Stat. Soc. Ser. C (Appl. Stat.)"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1016\/j.knosys.2016.06.031","article-title":"An efficient approximation to the K-means clustering for massive data","volume":"117","author":"Lozano","year":"2017","journal-title":"Knowl.-Based Syst."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Kumar, J., Shao, J., Uddin, S., and Ali, W. (2020, January 5\u201310). An online semantic-enhanced Dirichlet model for short text stream clustering. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Online.","DOI":"10.18653\/v1\/2020.acl-main.70"},{"key":"ref_6","unstructured":"Kingma, D.P., and Welling, M. (2013). Auto-encoding variational bayes. arXiv."},{"key":"ref_7","first-page":"3371","article-title":"Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion","volume":"11","author":"Vincent","year":"2010","journal-title":"J. Mach. Learn. Res."},{"key":"ref_8","unstructured":"Kipf, T.N., and Welling, M. (2016). Semi-supervised classification with graph convolutional networks. arXiv."},{"key":"ref_9","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, \u0141., and Polosukhin, I. (2017, January 4\u20139). Attention is all you need, In Proceedings of the 31st Conference on Neural Information Processing Systems, Long Beach, CA, USA."},{"key":"ref_10","unstructured":"Devlin, J., Chang, M.W., Lee, K., and Toutanova, K. (2019, January 2\u20137). Bert: Pre-training of deep bidirectional transformers for language understanding. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Minneapolis, MN, USA."},{"key":"ref_11","first-page":"993","article-title":"Latent dirichlet allocation","volume":"3","author":"Blei","year":"2003","journal-title":"J. Mach. Learn. Res."},{"key":"ref_12","first-page":"6505","article-title":"Document informed neural autoregressive topic models with distributional prior","volume":"33","author":"Gupta","year":"2019","journal-title":"Proc. AAAI Conf. Artif. Intell."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Mousa, A., and Schuller, B. (2017, January 3\u20137). Contextual bidirectional long short-term memory recurrent neural network language models: A generative approach to sentiment analysis. Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers, Valencia, Spain.","DOI":"10.18653\/v1\/E17-1096"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Vu, N.T., Gupta, P., Adel, H., and Sch\u00fctze, H. (2017, January 20\u201325). Bi-directional recurrent neural network with ranking loss for spoken language understanding. Proceedings of the 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, China.","DOI":"10.1109\/ICASSP.2016.7472841"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"439","DOI":"10.1162\/tacl_a_00325","article-title":"Topic modeling in embedding spaces","volume":"8","author":"Dieng","year":"2020","journal-title":"Trans. Assoc. Comput. Linguist."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Bianchi, F., Terragni, S., and Hovy, D. (2020). Pre-training is a hot topic: Contextualized document embeddings improve topic coherence. arXiv.","DOI":"10.18653\/v1\/2021.acl-short.96"},{"key":"ref_17","unstructured":"Srivastava, A., and Sutton, C. (2017). Autoencoding variational inference for topic models. arXiv."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Song, C., Liu, F., Huang, Y., Wang, L., and Tan, T. (2013, January 20\u201323). Auto-encoder based data clustering. Proceedings of the Iberoamerican Congress on Pattern Recognition, Havana, Cuba.","DOI":"10.1007\/978-3-642-41822-8_15"},{"key":"ref_19","unstructured":"Xie, J., Girshick, R., and Farhadi, A. (2016, January 19\u201324). Unsupervised deep embedding for clustering analysis. Proceedings of the International Conference on Machine Learning, New York, NY, USA."},{"key":"ref_20","unstructured":"Yang, B., Fu, X., Sidiropoulos, N.D., and Hong, M. (2017, January 6\u201311). Towards k-means-friendly spaces: Simultaneous deep learning and clustering. Proceedings of the International Conference on Machine Learning, Sydney, Australia."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Wang, C., Pan, S., Hu, R., Long, G., Jiang, J., and Zhang, C. (2019, January 10\u201316). Attributed graph clustering: A deep attentional embedding approach. Proceedings of the International Joint Conference on Artificial Intelligence, Macao, China.","DOI":"10.24963\/ijcai.2019\/509"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Bo, D., Wang, X., Shi, C., Zhu, M., Lu, E., and Cui, P. (2020, January 20\u201324). Structural deep clustering network. Proceedings of the Web Conference, Taipei, Taiwan.","DOI":"10.1145\/3366423.3380214"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Chiu, B., Sahu, S.K., Thomas, D., Sengupta, N., and Mahdy, M. (2020, January 5\u201310). Autoencoding Keyword Correlation Graph for Document Clustering. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Online.","DOI":"10.18653\/v1\/2020.acl-main.366"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Bahdanau, D., Chorowski, J., Serdyuk, D., Brakel, P., and Bengio, Y. (2016, January 20\u201325). End-to-end attention-based large vocabulary speech recognition. Proceedings of the 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, China.","DOI":"10.1109\/ICASSP.2016.7472618"},{"key":"ref_25","first-page":"56","article-title":"Case-involved Public Opinion News Summarization with Case Elements Guidance","volume":"34","author":"Han","year":"2020","journal-title":"J. Chin. Inf. Process."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Guo, X., Gao, L., Liu, X., and Yin, J. (2017, January 19\u201325). Improved Deep Embedded Clustering with Local Structure Preservation. Proceedings of the IJCAI\u201917, Melbourne, Australia.","DOI":"10.24963\/ijcai.2017\/243"},{"key":"ref_27","unstructured":"McConville, R., Santos-Rodriguez, R., Piechocki, R.J., and Craddock, I. (2019). N2d:(not too) deep clustering via clustering the local manifold of an autoencoded embedding. arXiv."},{"key":"ref_28","unstructured":"Paszke, A., Gross, S., Chintala, S., Chanan, G., Yang, E., DeVito, Z., Lin, Z., Desmaison, A., Antiga, L., and Lerer, A. Automatic differentiation in PyTorch. Proceedings of the NIPS Autodiff Workshop."},{"key":"ref_29","unstructured":"Kingma, D.P., and Ba, J. (2014). Adam: A method for stochastic optimization. arXiv."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1002\/nav.3800020109","article-title":"The Hungarian method for the assignment problem","volume":"2","author":"Kuhn","year":"1955","journal-title":"Nav. Res. Logist. Q."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"568","DOI":"10.1109\/TPAMI.2010.88","article-title":"Parallel spectral clustering in distributed systems","volume":"33","author":"Chen","year":"2010","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_32","first-page":"2837","article-title":"Information theoretic measures for clusterings comparison: Variants, properties, normalization and correction for chance","volume":"11","author":"Vinh","year":"2010","journal-title":"J. Mach. Learn. Res."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/21\/22\/7501\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,7,21]],"date-time":"2024-07-21T11:13:43Z","timestamp":1721560423000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/21\/22\/7501"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,11,11]]},"references-count":32,"journal-issue":{"issue":"22","published-online":{"date-parts":[[2021,11]]}},"alternative-id":["s21227501"],"URL":"https:\/\/doi.org\/10.3390\/s21227501","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,11,11]]}}}