Abstract
Accurate prediction of stock trends is of great crucial to guiding investment management and financial policymaking. In recent years, text content generated by investors on various media platforms has had a big impact on stock movements. However, most of the existing studies apply equal weight to the text content to construct the sentiment index and ignore the fact that the influence of the sentiment index on the stock market will decrease with the increase of time interval. In this paper, we propose a model based on dual classifier coupling and sentiment analysis to study the trend of stock index. (1) We come up with a sentiment index weighted based on the reading volume to predict the trend of the stock index. (2) The exponentially weighted moving average (EWMA) model is used to modify the weighted sentiment index to obtain the modified sentiment index. (3) The maximum information coefficient (MIC) is applied to calculate the correlation between the closing price of the stock index and the modified sentiment index to obtain the best-modified sentiment index. (4) Two frequently used classifiers, convolutional neural network (CNN) and support vector machine (SVM), are used to build a dual-classifier coupling prediction model and adopt it for the final classification prediction. Two Chinese stock indexes (SSE 50 and CSI 300) are used to evaluate the predictive effect of the proposed model. In the practice case of the SSE 50 Index, the precision, accuracy, recall rate, F1-score, and AUC reached 81.38%, 80.99%, 80.99%, 81.04%, and 81.22%, respectively, after adding the modified sentiment index. In the practical case of the CSI 300 Index, the precision, accuracy, recall rate, F1-score, and AUC reached 80.07%, 80.07%, 80.07%, 80.06%, and 80.03%, respectively, after adding the modified sentiment index. It can be found through the experimental results that adding the investor sentiment index can advance the trend prediction of the stock index, and adding the modified sentiment index has a more obvious improvement effect. After adding the modified investor sentiment index, the prediction results of our proposed two-classifier coupled CNN-SVM model are much better than other benchmark models.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data Availability
The data that support the findings of this study are sourced from Eastmoney.com, a leading Internet financial service platform in China.
References
Guegan D. Chaos in economics and finance. Annu Rev Control. 2009;33(1):89–93.
Feuerriegel S, Gordon J. Long-term stock index forecasting based on text mining of regulatory disclosures. Decis Support Syst. 2018;112:88–97.
Xing FZ, Cambria E, Welsch RE. Natural language based financial forecasting: a survey. Artif Intell Rev. 2018;50:49–73.
Thakkar A, Chaudhari K. A comprehensive survey on deep neural networks for stock market: the need, challenges, and future directions. Expert Syst Appl. 2021;177:114800.
Zhang W, Wang M, Zhu Y-C. Does government information release really matter in regulating contagion-evolution of negative emotion during public emergencies? From the perspective of cognitive big data analytics, Int J Inf Manage. 2020;50:498–514.
Christie W, Huang R. Following the pied piper: do individual returns herd around the market? Financial Analysts Journal - Financ Anal J. 1995;51:31–7.
Gillam RA, Guerard J, John B, Cahan R. News volume information: beyond earnings forecasting in a global stock selection model. Int J Forecast. 2015;31:575–81.
Li Q, Wang TieJun, Li P, Liu L, Gong Q, Chen Y. The effect of news and public mood on stock movements. Inf Sci. 2014;278:826–40.
RenR WuDD, Liu T. Forecasting stock market movement direction using sentiment analysis and support vector machine. IEEE Syst J. 2019;13:760–70.
Nassirtoussi AK, Aghabozorgi S, Wah TY, Ngo DCL. Text mining for market prediction: a systematic review. Expert Syst Appl. 2014;41(16):7653–70.
Ma Y, Mao R, Lin Q, Wu P, Cambria E. Multi-source aggregated classification for stock price movement prediction. Information Fusion. 2023;91:515–28.
Li B, Chan KCC, Ou C, Ruifeng S. Discovering public sentiment in social media for predicting stock movement of publicly listed companies. Inf Syst. 2017;69:81–92.
Wang Q, Xu W, Zheng H. Combining the wisdom of crowds and technical analysis for financial market prediction using deep random subspace ensembles. Neurocomputing. 2018;299:51–61.
Mao R, Liu Q, He K, Li W, Cambria E. The biases of pre-trained language models: an empirical study on prompt-based sentiment analysis and emotion detection. IEEE Trans Affect Comput. 2020.
Kim SH, Kim D. Investor sentiment from internet message postings and the predictability of stock returns. J Econ Behav Organ. 2014;107:708–29.
Mao R, Li X, Ge M, Cambria E. MetaPro: a computational metaphor processing model for text pre-processing. Inf Fusion. 2022;86:30–43.
Yu Y, Duan W, Cao Q. The impact of social and conventional media on firm equity value: a sentiment analysis approach. Decis Support Syst. 2013;55(4):919–26.
Han X, Li Y. Can investor sentiment be a momentum time-series predictor? Evidence from China, J Empir Finance. 2017;42:212–39.
Gong X, Zhang W, Wang J, Wang C. Investor sentiment and stock volatility: new evidence. Int Rev Financ Anal. 2022;73:101612.
Long W, Zhao M, Tang Y. Can the Chinese volatility index reflect investor sentiment? Int Rev Financ Anal. 2021;73:101612.
Fama EF. The behavior of stock-market prices. J Bus. 1965;38(1):34–105.
Robert P. Schumaker, Hsinchun Chen, A quantitative stock prediction system based on financial news. Inf Process Manage. 2009;45(5):571–83.
Chen X, Ma X, Wang H, Li X, Zhang C. A hierarchical attention network for stock prediction based on attentive multi-view news learning. Neurocomputing. 2022;504:1–15.
Zhan XK, Li YH, Li RX, Gu XW, Habimana O, Wang HZ. Stock price prediction using time convolution long short-term memory network, in: KSEM 2018: Int J Manag Sci Eng Manag. 2018;11061:461–468.
Yamaguchi K. Reexamination of stock price reaction to environmental performance: a GARCH application. Ecol Econ. 2008;68(1):345–52.
Zhang GP. Time series forecasting using a hybrid ARIMA and neural network model. Neurocomputing. 2003;50(1):159–75.
Rojas I, Valenzuela O, Rojas F, Guillen A, Herrera LJ, Pomares H, Marquez L, Pasadas M. Soft-computing techniques and ARMA model for time series prediction. Neurocomputing. 2008;71(4–6):519–37.
Das M, Ghosh SK. Data-driven approaches for meteorological time series prediction: a comparative study of the state-of-the-art computational intelligence techniques. Pattern Recogn Lett. 2018;105:155–64.
Thakur M, Kumar D. A hybrid financial trading support system using multi-category classifiers and random forest. Appl Soft Comput. 2018;67:337–49. https://doi.org/10.1016/j.asoc.2018.03.006.
Chen YJ, Hao YJ. Integrating principle component analysis and weighted support vector machine for stock trading signals prediction. Neurocomputing. 2018;321:381–402.
Ning KF, Liu M, Dong MY. A new robust ELM method based on a Bayesian framework with heavy-tailed distribution and weighted likelihood function. Neurocomputing. 2015;149(B):891–903.
Nikou M, Mansourfar G, Bagherzadeh J. Stock price prediction using DEEP learning algorithm and its comparison with machine learning algorithms. Intell Sys Acc Fin Mgmt. 2019;26(4):164–74.
Knowledge extraction and retention based continual learning by using convolutional autoencoder-based learning classifier system, Inf Sci. 2022;591:287–305.
Huang W, Nakamori Y, Wang SY. Forecasting stock market movement direction with support vector machine. Comput Oper Res. 2005;32(10):2513–22.
Illa PK, Parvathala B, Sharma AK. Stock price prediction methodology using random forest algorithm and support vector machine, Materials Today: Proceedings, 2022;56(4):1776–82.
Fischer T, Krauss C. Deep learning with long short-term memory networks for financial market predictions. Eur J Oper Res. 2018;270(2):654–69.
Ma C, Yan S. Deep learning in the Chinese stock market: the role of technical indicators, Finance Res Lett, 2022;49:103025.
Kim HY, Won CH. Forecasting the volatility of stock price index: a hybrid model integrating LSTM with multiple GARCH-type models, Expert Syst Appl. 2018.
Jing N, Wu Z, Wang H. A hybrid model integrating deep learning with investor sentiment analysis for stock price prediction, Expert Syst Appl. 2021;178:115019.
Wang J, Wang X, Li J, Wang H. A prediction model of CNN-TLSTM for USD/CNY exchange rate prediction. IEEE Access. 2021;9:73346–54.
Li W, Zhu L, Shi Y. User reviews: sentiment analysis using lexicon integrated two-channel CNN-LSTM family models. Appl Soft Comput J. 2020.
Shah J, Vaidya D, Shah M. A comprehensive review on multiple hybrid deep learning approaches for stock prediction. Int Syst Appl. 2022;16:200111.
Tian H, Gao C, Xiao X, Liu H, He B, Wu H, Wang H, Wu F. Skep: sentiment knowledge enhanced pre-training for sentiment analysis, arXiv preprint arXiv:2005.05635, 2020:4067–4076.
Jiang F, Lee J, Martin X, Zhou G. Manager sentiment and stock returns. J Financ Econ. 2019;132(1):126–49.
Roberts SW. Control chart tests based on geometric moving averages. Technometrics. 1959;1(3):239–50.
Reshef DN, Reshef YA, Finucane HK, Grossman SR, McVean G, Turnbaugh PJ, Lander ES, Mitzenmacher M, Sabeti PC. Detecting novel associations in large data sets. Science. 2011;80(334):1518–24.
Sun G, Li J, Dai J, Song Z, Lang F. Feature selection for IoT based on maximal information coefficient, Future Gener. Comput Syst. 2018;89:606–16.
LeCun Y, et al. Backpropagation applied to handwritten zip code recognition. Neural Comput. 1989;1(4):541–51.
Cortes C, Vapnik V. Support-vector networks. Mach Learn. 1995;20(3):273–97.
Funding
This research was supported by the National Natural Science Foundation of China (Grant No. 71971122 and 71501101).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Ethical Approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Informed Consent
Informed consent was obtained from all individual participants included in the study.
Conflict of Interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Wang, J., Zhu, S. A Novel Stock Index Direction Prediction Based on Dual Classifier Coupling and Investor Sentiment Analysis. Cogn Comput 15, 1023–1041 (2023). https://doi.org/10.1007/s12559-023-10137-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12559-023-10137-4