{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,8,5]],"date-time":"2024-08-05T20:43:48Z","timestamp":1722890628120},"reference-count":71,"publisher":"Association for Computing Machinery (ACM)","issue":"4","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Inf. Syst."],"published-print":{"date-parts":[[2023,10,31]]},"abstract":"The development of deep neural networks and the emergence of pre-trained language models such as BERT allow to increase performance on many NLP tasks. However, these models do not meet the same popularity for tweet stream summarization, which is probably because their computation limitation requires to drastically truncate the textual input.<\/jats:p>\n Our contribution in this article is threefold. First, we propose a neural model to automatically and incrementally summarize huge tweet streams. This extractive model combines in an original way pre-trained language models and vocabulary frequency based representations to predict tweet salience. An additional advantage of the model is that it automatically adapts the size of the output summary according to the input tweet stream. Second, we detail an original methodology to construct tweet stream summarization datasets requiring little human effort. Third, we release the TES 2012-2016 dataset constructed using the aforementioned methodology. Baselines, oracle summaries, gold standard, and qualitative assessments are made publicly available.<\/jats:p>\n To evaluate our approach, we conducted extensive quantitative experiments using three different tweet collections as well as an additional qualitative evaluation. Results show that our method outperforms state-of-the-art ones. We believe that this work opens avenues of research for incremental summarization, which has not received much attention yet.<\/jats:p>","DOI":"10.1145\/3581786","type":"journal-article","created":{"date-parts":[[2023,1,21]],"date-time":"2023-01-21T11:44:18Z","timestamp":1674301458000},"page":"1-33","update-policy":"http:\/\/dx.doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":7,"title":["TSSuBERT: How to Sum Up Multiple Years of Reading in a Few Tweets"],"prefix":"10.1145","volume":"41","author":[{"ORCID":"http:\/\/orcid.org\/0000-0001-8859-0313","authenticated-orcid":false,"given":"Alexis","family":"Dusart","sequence":"first","affiliation":[{"name":"IRIT, Universit\u00e9 de Toulouse, CNRS, Toulouse INP, UT3, France"}]},{"ORCID":"http:\/\/orcid.org\/0000-0003-3414-3803","authenticated-orcid":false,"given":"Karen","family":"Pinel-Sauvagnat","sequence":"additional","affiliation":[{"name":"IRIT, Universit\u00e9 de Toulouse, CNRS, Toulouse INP, UT3, France"}]},{"ORCID":"http:\/\/orcid.org\/0000-0003-3494-7561","authenticated-orcid":false,"given":"Gilles","family":"Hubert","sequence":"additional","affiliation":[{"name":"IRIT, Universit\u00e9 de Toulouse, CNRS, Toulouse INP, UT3, France"}]}],"member":"320","published-online":{"date-parts":[[2023,4,10]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.1145\/3308558.3313396"},{"key":"e_1_3_2_3_2","volume-title":"Proceedings of the 5th International Conference on Learning Representations: Conference Track Proceedings (ICLR\u201917).","author":"Arora Sanjeev","year":"2017","unstructured":"Sanjeev Arora, Yingyu Liang, and Tengyu Ma. 2017. A simple but tough-to-beat baseline for sentence embeddings. In Proceedings of the 5th International Conference on Learning Representations: Conference Track Proceedings (ICLR\u201917).https:\/\/openreview.net\/forum?id=SyK00v5xx."},{"key":"e_1_3_2_4_2","doi-asserted-by":"publisher","DOI":"10.1145\/3341981.3344241"},{"key":"e_1_3_2_5_2","doi-asserted-by":"publisher","DOI":"10.1145\/290941.291025"},{"key":"e_1_3_2_6_2","first-page":"3698","volume-title":"Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI\u201916)","author":"Chang Yi","year":"2016","unstructured":"Yi Chang, Jiliang Tang, Dawei Yin, Makoto Yamada, and Yan Liu. 2016. Timeline summarization from social media with life cycle models. In Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI\u201916). 3698\u20133704. http:\/\/www.ijcai.org\/Abstract\/16\/520."},{"key":"e_1_3_2_7_2","volume-title":"Proceedings of the 27th Text Retrieval Conference (TREC\u201918)","author":"Chellal Abdelhamid","year":"2018","unstructured":"Abdelhamid Chellal and Mohand Boughanem. 2018. IRIT at TREC Real-Time Summarization 2018. In Proceedings of the 27th Text Retrieval Conference (TREC\u201918). https:\/\/trec.nist.gov\/pubs\/trec27\/papers\/IRIT-RT.pdf."},{"key":"e_1_3_2_8_2","doi-asserted-by":"publisher","DOI":"10.1162\/coli_a_00004"},{"key":"e_1_3_2_9_2","doi-asserted-by":"publisher","DOI":"10.1145\/3274784.3274788"},{"key":"e_1_3_2_10_2","volume-title":"Proceedings of the Document Understanding Workshop (DUC\u201905)","author":"Dang Hoa Trang","year":"2005","unstructured":"Hoa Trang Dang. 2005. Overview of DUC 2005. In Proceedings of the Document Understanding Workshop (DUC\u201905). https:\/\/duc.nist.gov\/pubs\/2005papers\/OVERVIEW05.pdf."},{"key":"e_1_3_2_11_2","volume-title":"Proceedings of the 1st Text Analysis Conference (TAC\u201908)","author":"Dang Hoa Trang","year":"2008","unstructured":"Hoa Trang Dang and Karolina Owczarzak. 2008. Overview of the TAC 2008 update summarization task. In Proceedings of the 1st Text Analysis Conference (TAC\u201908). https:\/\/tac.nist.gov\/publications\/2008\/additional.papers\/update_summ_overview08.proceedings.pdf."},{"key":"e_1_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N19-1423"},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.naacl-main.384"},{"key":"e_1_3_2_14_2","first-page":"763","volume-title":"Proceedings of COLING 2012: Technical Papers","author":"Duan Yajuan","year":"2012","unstructured":"Yajuan Duan, Zhimin Chen, Furu Wei, Ming Zhou, and Heung-Yeung Shum. 2012. Twitter topic summarization by ranking tweets using social influence and content quality. In Proceedings of COLING 2012: Technical Papers. 763\u2013780. https:\/\/www.aclweb.org\/anthology\/C12-1047\/."},{"key":"e_1_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.1145\/3412841.3441946"},{"key":"e_1_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.1109\/MIS.2018.033001411"},{"key":"e_1_3_2_17_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2020.113679"},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1102"},{"key":"e_1_3_2_19_2","volume-title":"Proceedings of the 27th Text Retrieval Conference (TREC\u201918)","author":"Fern\u00e1ndez Javi","year":"2018","unstructured":"Javi Fern\u00e1ndez, Fernando Llopis, Yoan Guti\u00e9rrez, Patricio Mart\u00ednez-Barco, Jos\u00e9 M. G\u00f3mez, and Rafael Mu\u00f1oz. 2018. GPLSI at TREC 2018 RTS track. In Proceedings of the 27th Text Retrieval Conference (TREC\u201918). https:\/\/trec.nist.gov\/pubs\/trec27\/papers\/UA-GPLSI-RT.pdf."},{"key":"e_1_3_2_20_2","volume-title":"Proceedings of the 23rd Text Retrieval Conference (TREC\u201914)","volume":"500","author":"Frank John R.","year":"2014","unstructured":"John R. Frank, Max Kleiman-Weiner, Daniel A. Roberts, Ellen M. Voorhees, and Ian Soboroff. 2014. Evaluating stream filtering for entity profile updates in TREC 2012, 2013, and 2014. In Proceedings of the 23rd Text Retrieval Conference (TREC\u201914), Vol. 500-308. http:\/\/trec.nist.gov\/pubs\/trec23\/papers\/overview-kba.pdf."},{"key":"e_1_3_2_21_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-016-9475-9"},{"key":"e_1_3_2_22_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.120"},{"key":"e_1_3_2_23_2","doi-asserted-by":"publisher","DOI":"10.35111\/0z6y-q265"},{"key":"e_1_3_2_24_2","doi-asserted-by":"publisher","DOI":"10.1609\/icwsm.v5i1.14190"},{"key":"e_1_3_2_25_2","doi-asserted-by":"publisher","DOI":"10.5555\/2969239.2969428"},{"key":"e_1_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.703"},{"key":"e_1_3_2_27_2","doi-asserted-by":"publisher","DOI":"10.1145\/3366424.3382678"},{"key":"e_1_3_2_28_2","first-page":"15347","volume-title":"Proceedings of the 23rd Annual Conference on Innovative Applications of Artificial Intelligence (IAAI-\u201921)","author":"Li Quanzhi","year":"2021","unstructured":"Quanzhi Li and Qiong Zhang. 2021. Twitter event summarization by exploiting semantic terms and graph network. In Proceedings of the 23rd Annual Conference on Innovative Applications of Artificial Intelligence (IAAI-\u201921). 15347\u201315354. https:\/\/ojs.aaai.org\/index.php\/AAAI\/article\/view\/17802."},{"key":"e_1_3_2_29_2","volume-title":"Proceedings of the 26th Text Retrieval Conference (TREC\u201917)","author":"Lin Jimmy","year":"2017","unstructured":"Jimmy Lin, Salman Mohammed, Royal Sequiera, Luchen Tan, Nimesh Ghelani, Mustafa Abualsaud, Richard McCreadie, Dmitrijs Milajevs, and Ellen M. Voorhees. 2017. Overview of the TREC 2017 Real-Time Summarization Track. In Proceedings of the 26th Text Retrieval Conference (TREC\u201917). https:\/\/trec.nist.gov\/pubs\/trec26\/papers\/Overview-RT.pdf."},{"key":"e_1_3_2_30_2","volume-title":"Proceedings of the 25th Text Retrieval Conference (TREC\u201916)","author":"Lin Jimmy","year":"2016","unstructured":"Jimmy Lin, Adam Roegiest, Luchen Tan, Richard McCreadie, Ellen M. Voorhees, and Fernando Diaz. 2016. Overview of the TREC 2016 Real-Time Summarization Track. In Proceedings of the 25th Text Retrieval Conference (TREC\u201916). https:\/\/trec.nist.gov\/pubs\/trec25\/papers\/Overview-RT.pdf."},{"key":"e_1_3_2_31_2","first-page":"1699","volume-title":"Proceedings of COLING 2012: Technical Papers","author":"Liu Xiaohua","year":"2012","unstructured":"Xiaohua Liu, Yitong Li, Furu Wei, and Ming Zhou. 2012. Graph-based multi-tweet summarization using social signals. In Proceedings of COLING 2012: Technical Papers. 1699\u20131714. https:\/\/www.aclweb.org\/anthology\/C12-1104\/."},{"key":"e_1_3_2_32_2","first-page":"3728","volume-title":"Proceedings of the Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP\u201919)","author":"Liu Yang","year":"2019","unstructured":"Yang Liu and Mirella Lapata. 2019. Text summarization with pretrained encoders. In Proceedings of the Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP\u201919). 3728\u20133738. https:\/\/aclanthology.org\/D19-1387.pdf."},{"key":"e_1_3_2_33_2","doi-asserted-by":"publisher","DOI":"10.1145\/3529754"},{"key":"e_1_3_2_34_2","doi-asserted-by":"publisher","DOI":"10.1017\/S1351324901002741"},{"key":"e_1_3_2_35_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/k18-1023"},{"key":"e_1_3_2_36_2","first-page":"691","volume-title":"Proceedings of the 16th International Conference on Information Systems for Crisis Response and Management (ISCRAM\u201919)","author":"McCreadie Richard","year":"2019","unstructured":"Richard McCreadie, Cody Buntain, and Ian Soboroff. 2019. TREC incident streams: Finding actionable information on social media. In Proceedings of the 16th International Conference on Information Systems for Crisis Response and Management (ISCRAM\u201919). 691\u2013705. http:\/\/eprints.gla.ac.uk\/183409\/."},{"key":"e_1_3_2_37_2","doi-asserted-by":"publisher","DOI":"10.1145\/3158671"},{"key":"e_1_3_2_38_2","first-page":"3075","volume-title":"Proceedings of the 31st AAAI Conference on Artificial Intelligence","author":"Nallapati Ramesh","year":"2017","unstructured":"Ramesh Nallapati, Feifei Zhai, and Bowen Zhou. 2017. SummaRuNNer: A recurrent neural network based sequence model for extractive summarization of documents. In Proceedings of the 31st AAAI Conference on Artificial Intelligence. 3075\u20133081. http:\/\/aaai.org\/ocs\/index.php\/AAAI\/AAAI17\/paper\/view\/14636."},{"key":"e_1_3_2_39_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1206"},{"key":"e_1_3_2_40_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N18-1158"},{"key":"e_1_3_2_41_2","first-page":"145","volume-title":"Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL\u201904","author":"Nenkova Ani","year":"2004","unstructured":"Ani Nenkova and Rebecca Passonneau. 2004. Evaluating content selection in summarization: The Pyramid method. In Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL\u201904). 145\u2013152. https:\/\/aclanthology.org\/N04-1019."},{"key":"e_1_3_2_42_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-demos.2"},{"key":"e_1_3_2_43_2","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/E14-4046"},{"key":"e_1_3_2_44_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.findings-emnlp.217"},{"key":"e_1_3_2_45_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.3301232"},{"key":"e_1_3_2_46_2","doi-asserted-by":"publisher","DOI":"10.1145\/2484028.2484052"},{"key":"e_1_3_2_47_2","doi-asserted-by":"publisher","DOI":"10.1145\/3178541"},{"key":"e_1_3_2_48_2","doi-asserted-by":"publisher","DOI":"10.1145\/2806416.2806485"},{"key":"e_1_3_2_49_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCSS.2019.2937899"},{"key":"e_1_3_2_50_2","doi-asserted-by":"publisher","DOI":"10.1145\/3209978.3210030"},{"key":"e_1_3_2_51_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D15-1044"},{"key":"e_1_3_2_52_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCSS.2019.2945172"},{"key":"e_1_3_2_53_2","doi-asserted-by":"publisher","DOI":"10.48550\/ARXIV.1910.01108"},{"key":"e_1_3_2_54_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P17-1099"},{"key":"e_1_3_2_55_2","volume-title":"Proceedings of the 27th Text Retrieval Conference (TREC\u201918)","author":"Sequiera Royal","year":"2018","unstructured":"Royal Sequiera, Luchen Tan, and Jimmy Lin. 2018. Overview of the TREC 2018 Real-Time Summarization Track. In Proceedings of the 27th Text Retrieval Conference (TREC\u201918). https:\/\/trec.nist.gov\/pubs\/trec27\/papers\/Overview-RTS.pdf."},{"key":"e_1_3_2_56_2","doi-asserted-by":"publisher","DOI":"10.1093\/comjnl\/bxt109"},{"key":"e_1_3_2_57_2","doi-asserted-by":"publisher","DOI":"10.1145\/2484028.2484045"},{"key":"e_1_3_2_58_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i05.6420"},{"key":"e_1_3_2_59_2","doi-asserted-by":"publisher","DOI":"10.5555\/2627435.2670313"},{"key":"e_1_3_2_60_2","doi-asserted-by":"publisher","DOI":"10.1145\/3077136.3080677"},{"key":"e_1_3_2_61_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-16354-3_26"},{"key":"e_1_3_2_62_2","doi-asserted-by":"publisher","DOI":"10.1145\/2484028.2484163"},{"key":"e_1_3_2_63_2","first-page":"5998","volume-title":"Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017. 5998\u20136008. https:\/\/proceedings.neurips.cc\/paper\/2017\/file\/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf."},{"key":"e_1_3_2_64_2","doi-asserted-by":"publisher","DOI":"10.1145\/1871437.1871476"},{"key":"e_1_3_2_65_2","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2017\/581"},{"key":"e_1_3_2_66_2","first-page":"2204","volume-title":"Proceedings of the 24th European Conference on Artificial Intelligence (ECAI\u201920)","author":"Werner Matheus","year":"2020","unstructured":"Matheus Werner and Eduardo Laber. 2020. Speeding up word mover\u2019s distance and its variants via properties of distances between embeddings. In Proceedings of the 24th European Conference on Artificial Intelligence (ECAI\u201920). 2204\u20132211. https:\/\/ecai2020.eu\/papers\/888_paper.pdf."},{"key":"e_1_3_2_67_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.acl-long.360"},{"key":"e_1_3_2_68_2","doi-asserted-by":"publisher","DOI":"10.1145\/2009916.2010016"},{"key":"e_1_3_2_69_2","first-page":"9410","volume-title":"Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAI\u201920), the 32nd Innovative Applications of Artificial Intelligence Conference (IAAI\u201920), and the 10th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI\u201920)","author":"Yang Min","year":"2020","unstructured":"Min Yang, Chengming Li, Fei Sun, Zhou Zhao, Ying Shen, and Chenglin Wu. 2020. Be relevant, non-redundant, and timely: Deep reinforcement learning for real-time event summarization. In Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAI\u201920), the 32nd Innovative Applications of Artificial Intelligence Conference (IAAI\u201920), and the 10th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI\u201920). 9410\u20139417. https:\/\/ojs.aaai.org\/index.php\/AAAI\/article\/view\/6483."},{"key":"e_1_3_2_70_2","first-page":"11328","volume-title":"Proceedings of the 37th International Conference on Machine Learning (ICML\u201920)","author":"Zhang Jingqing","year":"2020","unstructured":"Jingqing Zhang, Yao Zhao, Mohammad Saleh, and Peter J. Liu. 2020. PEGASUS: Pre-training with extracted gap-sentences for abstractive summarization. In Proceedings of the 37th International Conference on Machine Learning (ICML\u201920). 11328\u201311339. http:\/\/proceedings.mlr.press\/v119\/zhang20ae\/zhang20ae.pdf."},{"key":"e_1_3_2_71_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.552"},{"key":"e_1_3_2_72_2","doi-asserted-by":"publisher","DOI":"10.1002\/asi.24026"}],"container-title":["ACM Transactions on Information Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3581786","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,4,10]],"date-time":"2023-04-10T10:34:13Z","timestamp":1681122853000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3581786"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,4,10]]},"references-count":71,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2023,10,31]]}},"alternative-id":["10.1145\/3581786"],"URL":"https:\/\/doi.org\/10.1145\/3581786","relation":{},"ISSN":["1046-8188","1558-2868"],"issn-type":[{"value":"1046-8188","type":"print"},{"value":"1558-2868","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,4,10]]},"assertion":[{"value":"2022-02-08","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-01-16","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-04-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}