{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,3,26]],"date-time":"2024-03-26T23:19:21Z","timestamp":1711495161035},"reference-count":45,"publisher":"Emerald","issue":"5","license":[{"start":{"date-parts":[[2020,11,10]],"date-time":"2020-11-10T00:00:00Z","timestamp":1604966400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["LHT"],"published-print":{"date-parts":[[2022,11,22]]},"abstract":"Purpose<\/jats:title>The World Wide Web has become an essential platform for a news publication, and it has become one of the primary sources of information dissemination in the past few years. Electronic media, i.e., television channels, magazines and newspapers, have started publishing news online. This online information is prompt to be disappeared because of short life-span and imperative to be archived for the long-term and future generations. This paper presents a content-based similarity measure based on the headings of the news articles for linking digital news stories published in various newspapers during the preservation process that helps to ensure future accessibility.<\/jats:p><\/jats:sec>Design\/methodology\/approach<\/jats:title>To evaluate the accuracy and assess the effectiveness and worth of the proposed measure for linking news articles in Digital News Story Archive (DNSA), we adopted both, system-centric and user-centric (human judgment) evaluation over different datasets of news articles.<\/jats:p><\/jats:sec>Findings<\/jats:title>The proposed similarity measure is evaluated using different sizes of datasets, and the results are compared by both user-centric technique, i.e., expert judgment and system-centric techniques, i.e., cosine similarity measure, extended Jaccard measure and common ratio measure for stories (CRMS). The comparison helps to get a broader impact and can be helpful for generalization of the measure for different categories of news articles. Multiple experiments have conducted the findings of which showed that the measure presented viable results for national and international news, while best results for linking sports news articles during preservation based on headings.<\/jats:p><\/jats:sec>Originality\/value<\/jats:title>The DNSA preserves a huge number of news articles from multiple news sources and to link with a vast collection, which encourages to introduce an efficient linking mechanism with few terms to manipulate. The CRMS is modified to deal with the headings of news articles as a part of the digital news stories preservation framework and comprehensively analysed.<\/jats:p><\/jats:sec>","DOI":"10.1108\/lht-07-2020-0157","type":"journal-article","created":{"date-parts":[[2020,11,10]],"date-time":"2020-11-10T06:32:26Z","timestamp":1604989946000},"page":"1359-1383","source":"Crossref","is-referenced-by-count":5,"title":["The role of news title for linking during preservation process in digital archives"],"prefix":"10.1108","volume":"40","author":[{"ORCID":"http:\/\/orcid.org\/0000-0003-4656-1041","authenticated-orcid":false,"given":"Muzammil","family":"Khan","sequence":"first","affiliation":[]},{"given":"Sarwar Shah","family":"Khan","sequence":"additional","affiliation":[]},{"ORCID":"http:\/\/orcid.org\/0000-0002-3576-8365","authenticated-orcid":false,"given":"Arshad","family":"Ahmad","sequence":"additional","affiliation":[]},{"given":"Arif Ur","family":"Rahman","sequence":"additional","affiliation":[]}],"member":"140","published-online":{"date-parts":[[2020,11,10]]},"reference":[{"key":"key2022112909401451800_ref001","first-page":"485","article-title":"Personalised click shaping through Lagrangian duality for online recommendation","year":"2012"},{"issue":"4","key":"key2022112909401451800_ref002","doi-asserted-by":"crossref","first-page":"442","DOI":"10.1177\/0165551513480107","article-title":"Sustaining accessibility of information through digital preservation: a literature review","volume":"39","year":"2013","journal-title":"Journal of Information Science"},{"key":"key2022112909401451800_ref003","article-title":"A signal based approach to news recommendation","year":"2016"},{"key":"key2022112909401451800_ref004","first-page":"1","article-title":"Semantics based news recommendation","year":"2012"},{"key":"key2022112909401451800_ref005","first-page":"1836","article-title":"Information-theoretic and set-theoretic similarity","year":"2016"},{"key":"key2022112909401451800_ref006","volume-title":"The Associated Press Stylebook and Briefing on Media Law","year":"2014"},{"issue":"2","key":"key2022112909401451800_ref007","first-page":"105","article-title":"On indexing of key words","volume":"16","year":"2004","journal-title":"Acta Editologica"},{"key":"key2022112909401451800_ref008","article-title":"A data curation experiment at u. porto using dspace","year":"2011"},{"key":"key2022112909401451800_ref009","volume-title":"Basic Public Affair Specialist Course, Newswriting","year":"2012"},{"key":"key2022112909401451800_ref010","first-page":"825","article-title":"An analysis of recommender algorithms for online news","volume-title":"CLEF (Working Notes)","year":"2014"},{"key":"key2022112909401451800_ref011","article-title":"Why headlines are important","year":"2013","journal-title":"Sports Lens Newspaper"},{"key":"key2022112909401451800_ref012","doi-asserted-by":"crossref","first-page":"16702","DOI":"10.1109\/ACCESS.2020.2967792","article-title":"News recommendation systems-accomplishments, challenges and future directions","volume":"8","year":"2020","journal-title":"IEEE Access"},{"key":"key2022112909401451800_ref013","first-page":"583","article-title":"Real-time news recommender system","year":"2010"},{"key":"key2022112909401451800_ref014","first-page":"482","article-title":"News junkie: providing personalised newsfeeds via analysis of information novelty","year":"2004"},{"key":"key2022112909401451800_ref015","first-page":"437","article-title":"Personalised news recommendation based on collaborative filtering","year":"2012"},{"issue":"3-4","key":"key2022112909401451800_ref016","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1300\/J104v40n03_02","article-title":"Understanding metadata and metadata schemes","volume":"40","year":"2005","journal-title":"Cataloging and Classification Quarterly"},{"key":"key2022112909401451800_ref017","first-page":"350","article-title":"Digital news story preservation framework","year":"2015"},{"issue":"1","key":"key2022112909401451800_ref018","doi-asserted-by":"crossref","first-page":"71","DOI":"10.6017\/ital.v38i1.10181","article-title":"A systematic approach towards web preservation","volume":"38","year":"2019","journal-title":"Information Technology and Libraries"},{"key":"key2022112909401451800_ref019","first-page":"85","article-title":"Normalising digital news-stories for preservation","year":"2016"},{"issue":"6","key":"key2022112909401451800_ref050","first-page":"430","article-title":"Exploring the digital world of newspaper archives","volume":"32","year":"2017","journal-title":"Science Technology Journal (Ci\u00eancia e T\u00e9cnica Vitivin\u00edcola)"},{"key":"key2022112909401451800_ref020","first-page":"127","article-title":"Term-based approach for linking digital news stories","year":"2018"},{"key":"key2022112909401451800_ref021","first-page":"50","article-title":"The role of named entities in linking news articles during preservation","year":"2018"},{"key":"key2022112909401451800_ref022","doi-asserted-by":"publisher","DOI":"10.1177\/0165551520937614","article-title":"A content-based technique for linking dual language news articles in an archive","year":"2020","journal-title":"Journal of Information Science"},{"key":"key2022112909401451800_ref023","volume-title":"Using Text Processing Techniques for Linking News Stories for Digital Preservation","year":"2018"},{"issue":"2","key":"key2022112909401451800_ref024","doi-asserted-by":"crossref","first-page":"282","DOI":"10.1108\/02635571111115182","article-title":"Text recommender system using user's usage patterns","volume":"111","year":"2011","journal-title":"Industrial Management and Data Systems"},{"issue":"1","key":"key2022112909401451800_ref025","doi-asserted-by":"crossref","first-page":"143","DOI":"10.1016\/j.eswa.2005.11.010","article-title":"Moners: a news recommender for the mobile web","volume":"32","year":"2007","journal-title":"Expert Systems with Applications"},{"key":"key2022112909401451800_ref026","first-page":"261","article-title":"Complementarity, F-score, and NLP evaluation","volume":"16","year":"2016","journal-title":"Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC)"},{"key":"key2022112909401451800_ref027","first-page":"305","article-title":"News recommendation via hypergraph learning: encapsulation of user behavior and news content","year":"2013"},{"issue":"5","key":"key2022112909401451800_ref028","doi-asserted-by":"crossref","first-page":"754","DOI":"10.1007\/s11390-011-0175-2","article-title":"Personalised news recommendation: a review and an experimental investigation","volume":"26","year":"2011","journal-title":"Journal of Computer Science and Technology"},{"issue":"7","key":"key2022112909401451800_ref029","doi-asserted-by":"crossref","first-page":"3168","DOI":"10.1016\/j.eswa.2013.11.020","article-title":"Modeling and broadening temporal user interest in personalised news recommendation","volume":"41","year":"2014","journal-title":"Expert Systems with Applications"},{"key":"key2022112909401451800_ref030","first-page":"158","article-title":"Enhancing subject metadata with automated weighting in the medical domain: a comparison of different measures","year":"2015"},{"key":"key2022112909401451800_ref031","first-page":"829","article-title":"Recommender systems","volume":"1","year":"2010","journal-title":"Encyclopedia of Machine Learning"},{"key":"key2022112909401451800_ref032","doi-asserted-by":"crossref","first-page":"829","DOI":"10.1007\/978-0-387-30164-8_705","article-title":"Recommender systems","volume-title":"Encyclopedia of Machine Learning","year":"2011"},{"key":"key2022112909401451800_ref033","volume-title":"The Importance of a Good Headline","year":"2012"},{"key":"key2022112909401451800_ref034","first-page":"37","article-title":"Emotional news recommender system","year":"2015"},{"key":"key2022112909401451800_ref035","first-page":"157","article-title":"A user-centric evaluation framework for recommender systems","year":"2011"},{"key":"key2022112909401451800_ref036","first-page":"81","article-title":"Model migration approach for database preservation","year":"2010"},{"issue":"4","key":"key2022112909401451800_ref037","first-page":"1093","article-title":"Term based similarity measure for text classification and clustering using fuzzy c-means algorithm","volume":"3","year":"2014","journal-title":"International Journal of Engineering Sciences and Research Technology (IJSETR)"},{"key":"key2022112909401451800_ref038","first-page":"237","article-title":"Do recommendations matter? : news recommendation in real life","year":"2014"},{"key":"key2022112909401451800_ref039","article-title":"Recommendation system for news reader","year":"2013"},{"issue":"4","key":"key2022112909401451800_ref040","first-page":"35","article-title":"Modern information retrieval: a brief overview","volume":"24","year":"2001","journal-title":"IEEE Data Engineering Bulletin"},{"key":"key2022112909401451800_ref041","first-page":"487","article-title":"Data mining cluster analysis: basic concepts and algorithms","year":"2013","journal-title":"Introduction to Data Mining"},{"key":"key2022112909401451800_ref042","article-title":"Similarity for news recommender systems","year":"2006"},{"key":"key2022112909401451800_ref043","volume-title":"Purpose of Headlines Is to Draw Readers into a Story","year":"2012"},{"issue":"1","key":"key2022112909401451800_ref044","doi-asserted-by":"crossref","first-page":"12","DOI":"10.1108\/AAOUJ-03-2019-0015","article-title":"The research trends in recommender systems for e-learning","volume":"14","year":"2019","journal-title":"Asian Association of Open Universities Journal"}],"container-title":["Library Hi Tech"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/LHT-07-2020-0157\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/LHT-07-2020-0157\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,11,29]],"date-time":"2022-11-29T09:40:57Z","timestamp":1669714857000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/LHT-07-2020-0157\/full\/html"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,11,10]]},"references-count":45,"journal-issue":{"issue":"5","published-online":{"date-parts":[[2020,11,10]]},"published-print":{"date-parts":[[2022,11,22]]}},"alternative-id":["10.1108\/LHT-07-2020-0157"],"URL":"https:\/\/doi.org\/10.1108\/lht-07-2020-0157","relation":{},"ISSN":["0737-8831"],"issn-type":[{"value":"0737-8831","type":"print"}],"subject":[],"published":{"date-parts":[[2020,11,10]]}}}