A Systematic Review of Wikidata in GLAM Institutions: a Labs Approach | SpringerLink
Skip to main content

A Systematic Review of Wikidata in GLAM Institutions: a Labs Approach

  • Conference paper
  • First Online:
Linking Theory and Practice of Digital Libraries (TPDL 2024)

Abstract

GLAM (Galleries, Libraries, Archives and Museums) organisations host extensive digital collections that have been made available to the public for over three decades. Advances in technology have facilitated the reuse of digital collections as rich data sources. In this context, Wikidata has emerged as a leading, innovative, and collaborative platform that enriches the digital collections provided by GLAM institutions. However, a comprehensive analysis of the current status, potential and challenges of its use in the GLAM sector is still lacking. This paper presents a systematic review of Wikidata use in GLAM institutions within the context of the work of the International GLAM Labs Community (glamlabs.io). The results summarise academic literature on Wikidata projects. The paper argues that Wikidata’s potential in the domain of reuse, an important component of the FAIR principles, can be better integrated into the practices of GLAM institutions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 13727
Price includes VAT (Japan)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 17159
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    https://data.rijksmuseum.nl/.

  2. 2.

    https://datos.bne.es/.

  3. 3.

    https://dblp.uni-trier.de/.

  4. 4.

    https://github.com/hibernator11/wikidata-review.

  5. 5.

    https://journal.code4lib.org/.

  6. 6.

    https://openhumanitiesdata.metajnl.com/.

  7. 7.

    https://www.wikidata.org/wiki/Wikidata:WikidataCon_2023.

References

  1. Alam, M., de Boer, V., Daga, E., van Erp, M., Hyvönen, E., Meroño-Peñuela, A.: Editorial of the special issue on cultural heritage and semantic web. Semantic Web 14(2), 155–158 (2023). https://doi.org/10.3233/SW-223187

  2. Ames, S., Lewis, S.: Disrupting the library: digital scholarship and Big Data at the National Library of Scotland. Big Data Soc. 7(2), 2053951720970576 (2020). https://doi.org/10.1177/2053951720970576

  3. Bartalesi, V., Pratelli, N., Lenzi, E.: Linking different scientific digital libraries in digital humanities: the IMAGO case study. Int. J. Digit. Libr. 23(4), 303–317 (2022). https://doi.org/10.1007/s00799-022-00331-4

  4. Beghaeiraveri, S.A.H., et al.: Wikidata subsetting: approaches, tools, and evaluation. Semantic Web (2023). https://www.semantic-web-journal.net/system/files/swj3491.pdf

  5. Bianchini, C.: Wikidata for JLIS. it. a new step forward mapping Italian library and information science journals. JLIS. it 12(1), 29–38 (2021). https://doi.org/10.4403/jlis.it-12680, https://jlis.fupress.net/index.php/jlis/article/view/13

  6. Bianchini, C., Bargioni, S., Pellizzari di San Girolamo, C.C.: Beyond VIAF: Wikidata as a complementary tool for authority control in libraries. Inform. Technol. Libr. 40(2) (2021). https://doi.org/10.6017/ital.v40i2.12959, https://ital.corejournals.org/index.php/ital/article/view/12959

  7. Bianchini, C., Sardo, L.: Wikidata: a new perspective towards universal bibliographic control. JLIS. it 13(1), 291–311 (2022). https://doi.org/10.4403/jlis.it-12725, https://jlis.fupress.net/index.php/jlis/article/view/439

  8. Bianchini, C., Spinelli, P.: Wikidata at fondazione levi (Venice, Italy). a case study for the publication of data about Fondo Gambara, a collection of 202 musicians’ portraits. JLIS. it 11(3), 16–38 (2020). https://doi.org/10.4403/jlis.it-12648, https://jlis.fupress.net/index.php/jlis/article/view/33

  9. Blankemeyer, B.: Opening our deep backfiles: identifying open and public domain serial content in library collections. Ser. Rev. 47(3–4), 145–146 (2021). https://doi.org/10.1080/00987913.2021.1939922

  10. Boccone, A.: The role of the Wikidata librarian in a renewed Bibliographical Universe: “next generation metadata”, next generation librarians. JLIS. it 13(2), 45–57 (2022). https://doi.org/10.36253/jlis.it-460, https://jlis.it/index.php/jlis/article/view/460

  11. Boccone, A., Maio, T.: Libraries and librarians in the Covid-19 Wikiproject: authority control, quality content and linked open data. AIB Stud. 60(2) (2020). https://doi.org/10.2426/aibstudi-12189, https://aibstudi.aib.it/article/view/12189

  12. Budgen, D., Brereton, P.: Performing systematic literature reviews in software engineering. In: Osterweil, L.J., Rombach, H.D., Soffa, M.L. (eds.) 28th International Conference on Software Engineering (ICSE 2006), Shanghai, China, 20-28 May 2006, pp. 1051–1052. ACM (2006). https://doi.org/10.1145/1134285.1134500

  13. Cabrerizo, F.J., Morente-Molinera, J.A., Pérez, I.J., Gijón, J.L., Herrera-Viedma, E.: A decision support system to develop a quality management in academic digital libraries. Inf. Sci. 323, 48–58 (2015). https://doi.org/10.1016/j.ins.2015.06.022

  14. Canal, F.Z., et al.: A survey on facial emotion recognition techniques: a state-of-the-art literature review. Inf. Sci. 582, 593–617 (2022). https://doi.org/10.1016/j.ins.2021.10.005

  15. Candela, G.: An automatic data quality approach to assess semantic data from cultural heritage institutions. J. Am. Soc. Inf. Sci. 74(7), 866–878 (2023). https://doi.org/10.1002/asi.24761, https://asistdl.onlinelibrary.wiley.com/doi/abs/10.1002/asi.24761

  16. Candela, G.: Towards a semantic approach in GLAM labs: the case of the data foundry at the national library of Scotland. J. Inf. Sci., 01655515231174386 (2023). https://doi.org/10.1177/01655515231174386

  17. Candela, G., Chambers, S., Sherratt, T.: An approach to assess the quality of Jupyter projects published by GLAM institutions. J. Assoc. Inform. Sci. Technol. n/a(n/a) (2023). https://doi.org/10.1002/asi.24835, https://asistdl.onlinelibrary.wiley.com/doi/abs/10.1002/asi.24835

  18. Candela, G., Escobar, P., Carrasco, R.C., Marco-Such, M.: A linked open data framework to enhance the discoverability and impact of culture heritage. J. Inf. Sci. 45(6) (2019). https://doi.org/10.1177/0165551518812658

  19. Candela, G., Escobar, P., Carrasco, R.C., Marco-Such, M.: Evaluating the quality of linked open data in digital libraries. J. Inf. Sci. 48(1), 21–43 (2022). https://doi.org/10.1177/0165551520930951

  20. Candela, G., Escobar, P., Sáez, D., Marco-Such, M.: A Shape Expression approach for assessing the quality of linked open data in libraries. Semantic Web 14(2), 159–179 (2023). https://doi.org/10.3233/SW-210441

  21. Candela, G., et al.: A checklist to publish collections as data in GLAM institutions. CoRR abs/2304.02603 (2023). https://doi.org/10.48550/arXiv.2304.02603

  22. Candela, G., et al.: An ontological approach for unlocking the colonial archive. J. Comput. Cult. Herit. (2023). https://doi.org/10.1145/3594727, just Accepted

  23. Cantallops, M.M., Sánchez-Alonso, S., García-Barriocanal, E.: A systematic literature review on Wikidata. Data Technol. Appl. 53(3), 250–268 (2019). https://doi.org/10.1108/DTA-12-2018-0110

  24. Chambers, S., et al.: Position statements - collections as data: state of the field and future directions (2023)

    Google Scholar 

  25. Chen, Y.: An investigation of linked data catalogue features in libraries, archives, and museums: a checklist approach. Electron. Libr. 41(5), 700–721 (2023). https://doi.org/10.1108/EL-03-2023-0070

  26. Clark, J.A., Williams, H.K.R., Rossmann, D.: Wikidata and knowledge graphs in practice: using semantic SEO to create discoverable, accessible, machine-readable definitions of the people, places, and services in libraries and archives. Inf. Serv. Use 42(3–4), 377–390 (2022). https://doi.org/10.3233/ISU-220171

  27. Colla, D., Goy, A., Leontino, M., Magro, D.: Wikidata support in the creation of rich semantic metadata for historical archives. Appl. Sci. 11(10) (2021). https://doi.org/10.3390/app11104378, https://www.mdpi.com/2076-3417/11/10/4378

  28. Cornolti, M., Ferragina, P., Ciaramita, M., Rüd, S., Schütze, H.: SMAPH: A piggyback approach for entity-linking in web queries. ACM Trans. Inf. Syst. 37(1) (2018). https://doi.org/10.1145/3284102

  29. Cramer, T., German, C., Jefferies, N., Wise, A.: A perpetual motion machine: The preserved digital scholarly record. Learn. Publ. 36(2), 312–318 (2023). https://doi.org/10.1002/leap.1494

  30. Dijkshoorn, C., et al.: The Rijksmuseum collection as linked data. Semantic Web 9(2), 221–230 (2018). https://doi.org/10.3233/SW-170257

  31. Elizarov, A., Gafurova, P., Lipachev, E.: Wikidata in metadata formation methods for documents of digital mathematical library. In: Scientific service & Internet: proceedings of the 23rd All-Russian Scientific Conference (September 20-23, 2021, online), CEUR, vol. 230, pp. 23–33. Keldysh Institute of Applied Mathematics (2021). https://doi.org/10.20948/abrau-2021-3-ceur

  32. Färber, M., Braun, C., Popovic, N., Saier, T., Noullet, K.: Which publications’ metadata are in which bibliographic databases? A system for exploration. In: Frommholz, I., Mayr, P., Cabanac, G., Verberne, S. (eds.) Proceedings of the 12th International Workshop on Bibliometric-enhanced Information Retrieval co-located with 44th European Conference on Information Retrieval (ECIR 2022), Stavanger, Norway (hybrid), April 10th, 2022. CEUR Workshop Proceedings, vol. 3230, pp. 39–44. CEUR-WS.org (2022). https://ceur-ws.org/Vol-3230/paper-06.pdf

  33. Feliciati, P.: Call me by your name: towards an authority data control shared between archives and libraries. JLIS. it 13(1), 203–214 (2022). https://doi.org/10.4403/jlis.it-12733, https://jlis.fupress.net/index.php/jlis/article/view/432

  34. Fischer, B.: Towards an open and collaborative authority control. JLIS. it 13(1), 283–290 (2022). https://doi.org/10.4403/jlis.it-12767, https://jlis.fupress.net/index.php/jlis/article/view/438

  35. Freire, N., Manguinhas, H., Isaac, A.: An observational study of equivalence links in cultural heritage linked data for agents. In: Hall, M., Merčun, T., Risse, T., Duchateau, F. (eds.) TPDL 2020. LNCS, vol. 12246, pp. 62–70. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-54956-5_5

    Chapter  Google Scholar 

  36. Freire, N., Proença, D.: RDF reasoning on large ontologies: a study on cultural heritage and Wikidata. In: Maglogiannis, I., Iliadis, L., Pimenidis, E. (eds.) AIAI 2020. IAICT, vol. 583, pp. 381–393. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-49161-1_32

    Chapter  Google Scholar 

  37. Iorio, A.D., Rossi, D.: Capturing and managing knowledge using social software and semantic web technologies. Inf. Sci. 432, 1–21 (2018). https://doi.org/10.1016/j.ins.2017.12.009

  38. Jain, N., Múnera, A.S., Ehmueller, J., Krestel, R.: Generation of training data for named entity recognition of artworks. Semantic Web 14(2), 239–260 (2023). https://doi.org/10.3233/SW-223177

  39. Jeremy Myntti, Nicole Lewis, A.M.M., Rockwell, K.: Regional connections to national authority files. Cataloging Classif. Q. 58(1), 76–89 (2020). https://doi.org/10.1080/01639374.2019.1690087

  40. Kitchenham, B.: Procedures for performing systematic reviews (2004). https://www.inf.ufsc.br/~aldo.vw/kitchenham.pdf

  41. Larsson, A., Ånäs, S., Zeinstra, M., Marynowski, P.: Wikimedia commons data roundtripping. https://meta.wikimedia.org/wiki/Wikimedia_Commons_Data_Roundtripping

  42. Ma, L., Li, M., Zhang, W., Li, J., Liu, T.: Unstructured text enhanced open-domain dialogue system: a systematic survey. ACM Trans. Inf. Syst. 40(1), 9:1–9:44 (2022). https://doi.org/10.1145/3464377

  43. Mahey, M., et al.: Open a GLAM lab. International GLAM Labs Community, Book Sprint, Doha, Qatar (2019). https://doi.org/10.21428/16ac48ec.f54af6ae

  44. Marcondes, C.H.: Integrated classification schemas to interlink cultural heritage collections over the web using LOD technologies. Int. J. Metadata Semant. Ontol. 15(3), 170–177 (2021). https://doi.org/10.1504/IJMSO.2021.123040, https://www.inderscienceonline.com/doi/abs/10.1504/IJMSO.2021.123040

  45. Navarrete, T., Villaespesa, E.: Image-based information: paintings in Wikipedia. J. Documentation 77(2), 359–380 (2021). https://doi.org/10.1108/JD-03-2020-0044, https://doi.org/10.1108/JD-03-2020-0044

  46. Nesterov, A., Hollink, L., van Erp, M., van Ossenbruggen, J.: A knowledge graph of contentious terminology for inclusive representation of cultural heritage. In: Pesquita, C., et al. (eds.) The Semantic Web - 20th International Conference, ESWC 2023, Hersonissos, Crete, Greece, May 28 - June 1, 2023, Proceedings. LNCS, vol. 13870, pp. 502–519. Springer, Cham (2023). https://doi.org/10.1007/978-3-031-33455-9_30

  47. Nguyen, B.X., Dinneen, J.D., Luczak-Roesch, M.: A novel method for resolving and completing authors’ country affiliation data in bibliographic records. J. Data Inform. Sci. 5(3), 97–115 (2020). https://doi.org/10.2478/jdis-2020-0020

  48. Nielsen, F.Å., Mietchen, D., Willighagen, E.: Scholia and scientometrics with Wikidata. In: Scientometrics 2017, pp. 237–259 (2017). https://doi.org/10.1007/978-3-319-70407-4_36, https://arxiv.org/pdf/1703.04222

  49. Obregón Sierra, A.: Insertion of metadata from Spanish libraries in Wikidata: a linked open data model. Revista Española de Documentación Científica 45(3), a330 (2022). https://doi.org/10.3989/redc.2022.3.1870, https://redc.revistas.csic.es/index.php/redc/article/view/1363

  50. Padilla, T.: Responsible operations: data science, machine learning, and AI in libraries (2019). https://doi.org/10.25333/xk7z-9g97

  51. Padilla, T., Allen, L., Frost, H., Potvin, S., Russey Roke, E., Varner, S.: Final report — always already computational: collections as data (2019). https://doi.org/10.5281/zenodo.3152935

  52. Page, M.J., et al.: The Prisma 2020 statement: an updated guideline for reporting systematic reviews. BMJ 372 (2021). https://doi.org/10.1136/bmj.n71, https://www.bmj.com/content/372/bmj.n71

  53. Polley, K.L., Tompkins, V.T., Honick, B.J., Qin, J.: Named entity disambiguation for archival collections: Metadata, Wikidata, and linked data. Proc. Assoc. Inform. Sci. Technol. 58(1), 520–524 (2021). https://doi.org/10.1002/pra2.490, https://asistdl.onlinelibrary.wiley.com/doi/abs/10.1002/pra2.490

  54. Poulter, M., Sheppard, N.: Wikimedia and universities: contributing to the global commons in the age of disinformation. Insights: the UKSG journal (2020). https://doi.org/10.1629/uksg.509

  55. Rivera, M.J., Teruel, M.A., Maté, A., Trujillo, J.: Diagnosis and prognosis of mental disorders by means of EEG and deep learning: a systematic mapping study. Artif. Intell. Rev., 1–43 (2021). https://doi.org/10.1007/s10462-021-09986-y

  56. Rossenova, L., Duchesne, P., Blümel, I.: Wikidata and Wikibase as complementary research data management services for cultural heritage data. In: Kaffee, L., Razniewski, S., Amaral, G., Alghamdi, K.S. (eds.) Proceedings of the 3rd Wikidata Workshop 2022 co-located with the 21st International Semantic Web Conference (ISWC2022), Virtual Event, Hanghzou, China, October 2022. CEUR Workshop Proceedings, vol. 3262. CEUR-WS.org (2022). https://ceur-ws.org/Vol-3262/paper15.pdf

  57. Shafee, T., Mietchen, D., Lubiana, T., Jemielniak, D., Waagmeester, A.: Ten quick tips for editing Wikidata. PLoS Comput. Biol. 19(7) (2023). https://doi.org/10.1371/journal.pcbi.1011235

  58. Sonzini, V.: Gender equality in library science and book history Italian journals: a focus on boards, authors and peer-reviewers. JLIS. it 14(1), 81–98 (Dec 2022). https://doi.org/10.36253/jlis.it-509, https://jlis.it/index.php/jlis/article/view/509

  59. Taniguchi, S.: Data provenance and administrative information in library linked data: reviewing RDA in RDF, BIBFRAME, and Wikidata. Cataloging Classif. Q. 61(1), 67–90 (2023). https://doi.org/10.1080/01639374.2023.2178048

  60. Tharani, K.: Much more than a mere technology: a systematic review of Wikidata in libraries. J. Acad. Librariansh. 47(2), 102326 (2021). https://doi.org/10.1016/j.acalib.2021.102326, https://www.sciencedirect.com/science/article/pii/S0099133321000173

  61. Thornton, K., Seals-Nutt, K., Remoortel, M.V., Birkholz, J.M., Potter, P.D.: Linking women editors of periodicals to the Wikidata knowledge graph. Semantic Web 14(2), 443–455 (2023). https://doi.org/10.3233/SW-222845

  62. Ukwoma, S.C., Osadebe, N.E., Okafor, V.N., Ezeani, C.N.: Unveiling the veiled: Wikipedia collaborating with academic libraries in Africa in creating visibility for African women through Art+Feminism Wikipedia edit-a-thon. Digit. Libr. Perspect. 37(4), 449–462 (2021). https://doi.org/10.1108/DLP-08-2020-0079

  63. Wilkinson, et al.: The fair guiding principles for scientific data management and stewardship. Sci. Data 3 (2016). https://doi.org/10.1038/sdata.2016.18

  64. Yang, M.Y.R., Yang, S., Lin, J.: Integration of text and geospatial search for hydrographic datasets using the lucene search library. In: Aizawa, A., Mandl, T., Carevic, Z., Hinze, A., Mayr, P., Schaer, P. (eds.) JCDL 2022: The ACM/IEEE Joint Conference on Digital Libraries in 2022, Cologne, Germany, June 20-24, 2022, p. 36. ACM (2022). https://doi.org/10.1145/3529372.3533280

  65. Zhao, F.: A systematic review of Wikidata in digital humanities projects. Digit. Scholarsh. Humanit. 38(2), 852–874 (2023). https://doi.org/10.1093/llc/fqac083

  66. Zhitomirsky-Geffet, M., Minster, S.: Cultural information bubbles: a new approach for automatic ethical evaluation of digital artwork collections based on Wikidata. Digit. Scholarsh. Humanit. 38(2), 891–911 (2023). https://doi.org/10.1093/llc/fqac076

  67. Zhu, L., Xu, A., Deng, S., Heng, G., Li, X.: Entity management using Wikidata for cultural heritage information. Cataloging Classif. Q. 61(1), 20–46 (2023). https://doi.org/10.1080/01639374.2023.2188338

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gustavo Candela .

Editor information

Editors and Affiliations

Appendix A. Summary of the projects used for the systematic review

Appendix A. Summary of the projects used for the systematic review

Appendix A. Summary of the projects used for the systematic review 

This section presents the summary of the projects used for the systematic review as is shown in Table 4.

Table 4. Summary of the projects used for the systematic review. Additional information is provided regarding the application according to the TaDiRAH vocabulary and the identifier in Wikidata.

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Candela, G., Cuper, M., Holownia, O., Gabriëls, N., Dobreva, M., Mahey, M. (2024). A Systematic Review of Wikidata in GLAM Institutions: a Labs Approach. In: Antonacopoulos, A., et al. Linking Theory and Practice of Digital Libraries. TPDL 2024. Lecture Notes in Computer Science, vol 15178. Springer, Cham. https://doi.org/10.1007/978-3-031-72440-4_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-72440-4_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-72439-8

  • Online ISBN: 978-3-031-72440-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics