{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,8,23]],"date-time":"2024-08-23T19:09:46Z","timestamp":1724440186645},"reference-count":26,"publisher":"World Scientific Pub Co Pte Ltd","issue":"02","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Artif. Intell. Tools"],"published-print":{"date-parts":[[2024,3]]},"abstract":" Text classification involves organizing textual information into predefined classes, a task which is particularly useful in domains like sentiment analysis, spam detection, and content labeling. In India, where a massive amount of information is generated daily through newspapers and social media, Hindi is one of the most widely used and spoken languages. However, there is limited research on Hindi text classification and, particularly, regarding Hindi news classification. This paper presents a research study to classify Hindi news articles published in Hindi-language newspapers in India by using and comparing various Machine Learning (ML) and Deep Learning (DL) algorithms. To prepare the textual news data for classification, pre-processing and feature engineering techniques, such as count vectorizer, Tf-Idf vectorizer and Doc2Vec, were used and applied to convert texts into vectors. This pre-processing step on the textual data was very challenging due to the presence of multimodal words, conjunctions, punctuation, and special characters in Hindi texts. The study considered Hindi news headlines from predetermined categories (Science, Sports, Entertainment and Business) and, among the different ML and DL models tested and evaluated, Linear Regression with Doc2Vec vectorizer and SGD classifier with Tf-Idf vectorizer produced best accuracies of 97.04% and 96.59%, respectively. The best performing DL model was found to be the Bi-LSTM with an accuracy of approximately 97% on the testing data. <\/jats:p>","DOI":"10.1142\/s0218213023500641","type":"journal-article","created":{"date-parts":[[2023,10,11]],"date-time":"2023-10-11T15:13:52Z","timestamp":1697037232000},"source":"Crossref","is-referenced-by-count":1,"title":["Classifying Hindi News Using Various Machine Learning and Deep Learning Techniques"],"prefix":"10.1142","volume":"33","author":[{"given":"Anusha","family":"Chhabra","sequence":"first","affiliation":[{"name":"Biometric Research Laboratory, IT Department, DTU, Delhi, India"}]},{"given":"Monika","family":"Arora","sequence":"additional","affiliation":[{"name":"IT Department, BPIT, Delhi, India"}]},{"given":"Arpit","family":"Sharma","sequence":"additional","affiliation":[{"name":"Computer Science and Technology, Delhi Technological University, New Delhi, India"}]},{"given":"Harsh","family":"Singh","sequence":"additional","affiliation":[{"name":"Computer Science and Technology, Delhi Technological University, New Delhi, India"}]},{"given":"Saurabh","family":"Verma","sequence":"additional","affiliation":[{"name":"Tiger Analytics, New Delhi, India"}]},{"given":"Rachna","family":"Jain","sequence":"additional","affiliation":[{"name":"IT Department, BPIT, Delhi, India"}]},{"given":"Biswaranjan","family":"Acharya","sequence":"additional","affiliation":[{"name":"Department of Computer Engineering-AI, Marwadi University, Rajkot, Gujarat, India"}]},{"given":"Vassilis C.","family":"Gerogiannis","sequence":"additional","affiliation":[{"name":"Department of Digital Systems, University of Thessaly, Larissa, Greece"}]},{"given":"Dimitrios","family":"Tzimos","sequence":"additional","affiliation":[{"name":"Department of Digital Systems, University of Thessaly, Larissa, Greece"}]},{"given":"Andreas","family":"Kanavos","sequence":"additional","affiliation":[{"name":"Department of Informatics, Ionian University, Corfu, Greece"}]}],"member":"219","published-online":{"date-parts":[[2024,3,30]]},"reference":[{"key":"S0218213023500641BIB002","first-page":"48","volume":"8","author":"Arif H.","year":"2016","journal-title":"ICICC"},{"key":"S0218213023500641BIB003","doi-asserted-by":"publisher","DOI":"10.1007\/978-981-16-2597-8_27"},{"key":"S0218213023500641BIB004","doi-asserted-by":"publisher","DOI":"10.7763\/LNSE.2014.V2.134"},{"key":"S0218213023500641BIB005","doi-asserted-by":"publisher","DOI":"10.1016\/j.jksuci.2015.11.003"},{"key":"S0218213023500641BIB006","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010933404324"},{"key":"S0218213023500641BIB008","doi-asserted-by":"crossref","first-page":"94","DOI":"10.1007\/978-3-030-44689-5_9","volume-title":"11th Int. Conf. on Intelligent Human Computer Interaction (IHCI)","volume":"11886","author":"Joshi R.","year":"2019"},{"key":"S0218213023500641BIB009","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-018-09677-1"},{"key":"S0218213023500641BIB010","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2019.2906197"},{"key":"S0218213023500641BIB011","doi-asserted-by":"publisher","DOI":"10.1142\/S0218213019600108"},{"key":"S0218213023500641BIB012","doi-asserted-by":"publisher","DOI":"10.1142\/S0218213016500184"},{"key":"S0218213023500641BIB013","doi-asserted-by":"publisher","DOI":"10.1016\/j.compeleceng.2017.09.011"},{"issue":"1","key":"S0218213023500641BIB014","first-page":"4","volume":"1","author":"Khan A.","year":"2010","journal-title":"Journal of Advances in Information Technology"},{"key":"S0218213023500641BIB015","doi-asserted-by":"publisher","DOI":"10.3390\/info10040150"},{"key":"S0218213023500641BIB017","first-page":"1188","volume-title":"31st Int. Conf. on Machine Learning (ICML)","volume":"32","author":"Le Q. V.","year":"2014"},{"key":"S0218213023500641BIB018","doi-asserted-by":"publisher","DOI":"10.5220\/0010022404170424"},{"key":"S0218213023500641BIB019","doi-asserted-by":"publisher","DOI":"10.3390\/a16030167"},{"issue":"3","key":"S0218213023500641BIB020","first-page":"1578","volume":"7","author":"Rani K.","year":"2016","journal-title":"International Journal of Computer Science and Information Technologies (IJCSIT)"},{"key":"S0218213023500641BIB021","doi-asserted-by":"publisher","DOI":"10.1109\/78.650093"},{"key":"S0218213023500641BIB022","first-page":"313","volume":"7","author":"Seshadri S.","year":"2016","journal-title":"IIOAB"},{"key":"S0218213023500641BIB023","doi-asserted-by":"publisher","DOI":"10.1109\/ICDSE47409.2019.8971796"},{"key":"S0218213023500641BIB024","doi-asserted-by":"publisher","DOI":"10.1109\/CIFEr.2014.6924063"},{"issue":"8","key":"S0218213023500641BIB025","volume":"7","author":"Usman M.","year":"2016","journal-title":"International Journal of Advanced Computer Science and Applications (IJACSA)"},{"key":"S0218213023500641BIB026","doi-asserted-by":"publisher","DOI":"10.1007\/s00521-022-07650-2"},{"key":"S0218213023500641BIB027","doi-asserted-by":"publisher","DOI":"10.1016\/S0893-6080(05)80023-1"},{"key":"S0218213023500641BIB028","doi-asserted-by":"publisher","DOI":"10.4304\/jcp.7.12.2913-2920"},{"key":"S0218213023500641BIB029","doi-asserted-by":"publisher","DOI":"10.1109\/ICSESS.2018.8663882"}],"container-title":["International Journal on Artificial Intelligence Tools"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218213023500641","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,3,29]],"date-time":"2024-03-29T04:56:17Z","timestamp":1711688177000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S0218213023500641"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,3]]},"references-count":26,"journal-issue":{"issue":"02","published-print":{"date-parts":[[2024,3]]}},"alternative-id":["10.1142\/S0218213023500641"],"URL":"https:\/\/doi.org\/10.1142\/s0218213023500641","relation":{},"ISSN":["0218-2130","1793-6349"],"issn-type":[{"value":"0218-2130","type":"print"},{"value":"1793-6349","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,3]]}}}