Abstract
With the trend of open science, efforts have been made to openly utilize research data. Considering the use of shared research data for interdisciplinary research, developing a researcher-friendly abstract writing method in different research fields is pertinent. In this study, we focus on abstracts from Scientific Data, a journal specializing in research data. We examine the influence of each part of speech on the utilization of research data through multiple regression analysis of the number of occurrences of the part of speech, the number of words and index-keywords in the abstract, and the number of citations research data article. Based on these results, we set the explanatory variables as the number of nouns, verbs, the other parts of speech, words, and index-keywords in the abstract. Thereafter, we developed a classifier to estimate the number of citations using machine learning. An analysis of the relationship between the number of citations and index keywords was also conducted.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Data Share: https://datashare.ed.ac.uk/. Last accessed 1 Dec 2022
Hrynaszkiewicz, I., Yoko, S.: Open access and open data journals to facilitate scientific data reuse. Inf. Manage. 57–9, 629–640 (2014)
Scientific Data: https://www.nature.com/sdata/. Last accessed 1 Dec 2022
Data in Brief: https://journals.elsevier.com/data-in-brief. Last accessed 1 Dec 2022
Earth System Science Data: https://www.earth-system-science-data.net. Last accessed 1 Dec 2022
Chemical Data Collections: https://www.sciencedirect.com/journal/chemical-data-collections. Last accessed 1 Dec 2022
Naoto, K.: A Study of Point of View on Inheritance of Tacit Knowledge to Promote Utilization of Research Data, Research Report of Information Processing Society of Japan, pp. 1–6. IOT (2022)
Kai, N., Shimbaru, T.: Characteristic analysis of data description in highly cited research data. IIAI Lett. Institut. Res. 1, 10–18 (2022)
Kai, N., Yoshihisa, T., Shimbaru, T., Yano, H., Tanushi, H.: Citation Estimation Method Using Abstracts of Research Data Articles: Using Abstracts of Scientific Data Articles as An Example, IPSJ SIG Technical Report IOT-60, pp. 1–5 (2023)
Jinfang, N.: Overcoming inadequate documentation. Proc. Am. Soc. Inform. Sci. Technol. 46, 1–14 (2010)
Wong, K.F., Wu, M., Li, W.: Extractive summarization using supervised and semi-supervised learning. In: International Conference on Computational Linguistics, pp. 985–992 (2008)
Matsumoto, K., Sakurai, D., Inoue, R.: Trends in nursing research on palliative care: content analysis of abstracts using text mining. Bull. Tokai Univ. Health Sci. 23, 107–112 (2018)
Koyama, T., Michimen, K.: Rehabilitation of central nervous system disorders: characteristics of medical care. Rehabil. Med. 58, 317–325 (2021)
Higuchi, K.: Quantitative Text Analysis for Social Research: Toward Inheritance and Development of Content Analysis, 2nd edn. Nakanishiya Shuppan, Japan (2020)
Stanford Log-linear Part-Of-Speech Tagger. https://nlp.stanford.edu/software/tagger.shtml. Last accessed 1 Dec 2022
R Core Team: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/ (2022). Last accessed on 1 Dec 2022
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Kai, N., Yoshihisa, T., Shimbaru, T., Yano, H., Tanushi, H. (2024). Citation Estimation Method Using Abstracts of Research Data Articles: A Focus on Scientific Data. In: Barolli, L. (eds) Advances on P2P, Parallel, Grid, Cloud and Internet Computing . 3PGCIC 2023. Lecture Notes on Data Engineering and Communications Technologies, vol 189. Springer, Cham. https://doi.org/10.1007/978-3-031-46970-1_1
Download citation
DOI: https://doi.org/10.1007/978-3-031-46970-1_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-46969-5
Online ISBN: 978-3-031-46970-1
eBook Packages: EngineeringEngineering (R0)