Citation Estimation Method Using Abstracts of Research Data Articles: A Focus on Scientific Data | SpringerLink
Skip to main content

Citation Estimation Method Using Abstracts of Research Data Articles: A Focus on Scientific Data

  • Conference paper
  • First Online:
Advances on P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC 2023)

Abstract

With the trend of open science, efforts have been made to openly utilize research data. Considering the use of shared research data for interdisciplinary research, developing a researcher-friendly abstract writing method in different research fields is pertinent. In this study, we focus on abstracts from Scientific Data, a journal specializing in research data. We examine the influence of each part of speech on the utilization of research data through multiple regression analysis of the number of occurrences of the part of speech, the number of words and index-keywords in the abstract, and the number of citations research data article. Based on these results, we set the explanatory variables as the number of nouns, verbs, the other parts of speech, words, and index-keywords in the abstract. Thereafter, we developed a classifier to estimate the number of citations using machine learning. An analysis of the relationship between the number of citations and index keywords was also conducted.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 28599
Price includes VAT (Japan)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 35749
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Data Share: https://datashare.ed.ac.uk/. Last accessed 1 Dec 2022

  2. Hrynaszkiewicz, I., Yoko, S.: Open access and open data journals to facilitate scientific data reuse. Inf. Manage. 57–9, 629–640 (2014)

    Google Scholar 

  3. Scientific Data: https://www.nature.com/sdata/. Last accessed 1 Dec 2022

  4. Data in Brief: https://journals.elsevier.com/data-in-brief. Last accessed 1 Dec 2022

  5. Earth System Science Data: https://www.earth-system-science-data.net. Last accessed 1 Dec 2022

  6. Chemical Data Collections: https://www.sciencedirect.com/journal/chemical-data-collections. Last accessed 1 Dec 2022

  7. Naoto, K.: A Study of Point of View on Inheritance of Tacit Knowledge to Promote Utilization of Research Data, Research Report of Information Processing Society of Japan, pp. 1–6. IOT (2022)

    Google Scholar 

  8. Kai, N., Shimbaru, T.: Characteristic analysis of data description in highly cited research data. IIAI Lett. Institut. Res. 1, 10–18 (2022)

    Article  Google Scholar 

  9. Kai, N., Yoshihisa, T., Shimbaru, T., Yano, H., Tanushi, H.: Citation Estimation Method Using Abstracts of Research Data Articles: Using Abstracts of Scientific Data Articles as An Example, IPSJ SIG Technical Report IOT-60, pp. 1–5 (2023)

    Google Scholar 

  10. Jinfang, N.: Overcoming inadequate documentation. Proc. Am. Soc. Inform. Sci. Technol. 46, 1–14 (2010)

    Google Scholar 

  11. Wong, K.F., Wu, M., Li, W.: Extractive summarization using supervised and semi-supervised learning. In: International Conference on Computational Linguistics, pp. 985–992 (2008)

    Google Scholar 

  12. Matsumoto, K., Sakurai, D., Inoue, R.: Trends in nursing research on palliative care: content analysis of abstracts using text mining. Bull. Tokai Univ. Health Sci. 23, 107–112 (2018)

    Google Scholar 

  13. Koyama, T., Michimen, K.: Rehabilitation of central nervous system disorders: characteristics of medical care. Rehabil. Med. 58, 317–325 (2021)

    Google Scholar 

  14. Higuchi, K.: Quantitative Text Analysis for Social Research: Toward Inheritance and Development of Content Analysis, 2nd edn. Nakanishiya Shuppan, Japan (2020)

    Google Scholar 

  15. Stanford Log-linear Part-Of-Speech Tagger. https://nlp.stanford.edu/software/tagger.shtml. Last accessed 1 Dec 2022

  16. R Core Team: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/ (2022). Last accessed on 1 Dec 2022

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tomoki Yoshihisa .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kai, N., Yoshihisa, T., Shimbaru, T., Yano, H., Tanushi, H. (2024). Citation Estimation Method Using Abstracts of Research Data Articles: A Focus on Scientific Data. In: Barolli, L. (eds) Advances on P2P, Parallel, Grid, Cloud and Internet Computing . 3PGCIC 2023. Lecture Notes on Data Engineering and Communications Technologies, vol 189. Springer, Cham. https://doi.org/10.1007/978-3-031-46970-1_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-46970-1_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-46969-5

  • Online ISBN: 978-3-031-46970-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics