Abstract
GLAM (Galleries, Libraries, Archives and Museums) organisations host extensive digital collections that have been made available to the public for over three decades. Advances in technology have facilitated the reuse of digital collections as rich data sources. In this context, Wikidata has emerged as a leading, innovative, and collaborative platform that enriches the digital collections provided by GLAM institutions. However, a comprehensive analysis of the current status, potential and challenges of its use in the GLAM sector is still lacking. This paper presents a systematic review of Wikidata use in GLAM institutions within the context of the work of the International GLAM Labs Community (glamlabs.io). The results summarise academic literature on Wikidata projects. The paper argues that Wikidata’s potential in the domain of reuse, an important component of the FAIR principles, can be better integrated into the practices of GLAM institutions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Alam, M., de Boer, V., Daga, E., van Erp, M., Hyvönen, E., Meroño-Peñuela, A.: Editorial of the special issue on cultural heritage and semantic web. Semantic Web 14(2), 155–158 (2023). https://doi.org/10.3233/SW-223187
Ames, S., Lewis, S.: Disrupting the library: digital scholarship and Big Data at the National Library of Scotland. Big Data Soc. 7(2), 2053951720970576 (2020). https://doi.org/10.1177/2053951720970576
Bartalesi, V., Pratelli, N., Lenzi, E.: Linking different scientific digital libraries in digital humanities: the IMAGO case study. Int. J. Digit. Libr. 23(4), 303–317 (2022). https://doi.org/10.1007/s00799-022-00331-4
Beghaeiraveri, S.A.H., et al.: Wikidata subsetting: approaches, tools, and evaluation. Semantic Web (2023). https://www.semantic-web-journal.net/system/files/swj3491.pdf
Bianchini, C.: Wikidata for JLIS. it. a new step forward mapping Italian library and information science journals. JLIS. it 12(1), 29–38 (2021). https://doi.org/10.4403/jlis.it-12680, https://jlis.fupress.net/index.php/jlis/article/view/13
Bianchini, C., Bargioni, S., Pellizzari di San Girolamo, C.C.: Beyond VIAF: Wikidata as a complementary tool for authority control in libraries. Inform. Technol. Libr. 40(2) (2021). https://doi.org/10.6017/ital.v40i2.12959, https://ital.corejournals.org/index.php/ital/article/view/12959
Bianchini, C., Sardo, L.: Wikidata: a new perspective towards universal bibliographic control. JLIS. it 13(1), 291–311 (2022). https://doi.org/10.4403/jlis.it-12725, https://jlis.fupress.net/index.php/jlis/article/view/439
Bianchini, C., Spinelli, P.: Wikidata at fondazione levi (Venice, Italy). a case study for the publication of data about Fondo Gambara, a collection of 202 musicians’ portraits. JLIS. it 11(3), 16–38 (2020). https://doi.org/10.4403/jlis.it-12648, https://jlis.fupress.net/index.php/jlis/article/view/33
Blankemeyer, B.: Opening our deep backfiles: identifying open and public domain serial content in library collections. Ser. Rev. 47(3–4), 145–146 (2021). https://doi.org/10.1080/00987913.2021.1939922
Boccone, A.: The role of the Wikidata librarian in a renewed Bibliographical Universe: “next generation metadata”, next generation librarians. JLIS. it 13(2), 45–57 (2022). https://doi.org/10.36253/jlis.it-460, https://jlis.it/index.php/jlis/article/view/460
Boccone, A., Maio, T.: Libraries and librarians in the Covid-19 Wikiproject: authority control, quality content and linked open data. AIB Stud. 60(2) (2020). https://doi.org/10.2426/aibstudi-12189, https://aibstudi.aib.it/article/view/12189
Budgen, D., Brereton, P.: Performing systematic literature reviews in software engineering. In: Osterweil, L.J., Rombach, H.D., Soffa, M.L. (eds.) 28th International Conference on Software Engineering (ICSE 2006), Shanghai, China, 20-28 May 2006, pp. 1051–1052. ACM (2006). https://doi.org/10.1145/1134285.1134500
Cabrerizo, F.J., Morente-Molinera, J.A., Pérez, I.J., Gijón, J.L., Herrera-Viedma, E.: A decision support system to develop a quality management in academic digital libraries. Inf. Sci. 323, 48–58 (2015). https://doi.org/10.1016/j.ins.2015.06.022
Canal, F.Z., et al.: A survey on facial emotion recognition techniques: a state-of-the-art literature review. Inf. Sci. 582, 593–617 (2022). https://doi.org/10.1016/j.ins.2021.10.005
Candela, G.: An automatic data quality approach to assess semantic data from cultural heritage institutions. J. Am. Soc. Inf. Sci. 74(7), 866–878 (2023). https://doi.org/10.1002/asi.24761, https://asistdl.onlinelibrary.wiley.com/doi/abs/10.1002/asi.24761
Candela, G.: Towards a semantic approach in GLAM labs: the case of the data foundry at the national library of Scotland. J. Inf. Sci., 01655515231174386 (2023). https://doi.org/10.1177/01655515231174386
Candela, G., Chambers, S., Sherratt, T.: An approach to assess the quality of Jupyter projects published by GLAM institutions. J. Assoc. Inform. Sci. Technol. n/a(n/a) (2023). https://doi.org/10.1002/asi.24835, https://asistdl.onlinelibrary.wiley.com/doi/abs/10.1002/asi.24835
Candela, G., Escobar, P., Carrasco, R.C., Marco-Such, M.: A linked open data framework to enhance the discoverability and impact of culture heritage. J. Inf. Sci. 45(6) (2019). https://doi.org/10.1177/0165551518812658
Candela, G., Escobar, P., Carrasco, R.C., Marco-Such, M.: Evaluating the quality of linked open data in digital libraries. J. Inf. Sci. 48(1), 21–43 (2022). https://doi.org/10.1177/0165551520930951
Candela, G., Escobar, P., Sáez, D., Marco-Such, M.: A Shape Expression approach for assessing the quality of linked open data in libraries. Semantic Web 14(2), 159–179 (2023). https://doi.org/10.3233/SW-210441
Candela, G., et al.: A checklist to publish collections as data in GLAM institutions. CoRR abs/2304.02603 (2023). https://doi.org/10.48550/arXiv.2304.02603
Candela, G., et al.: An ontological approach for unlocking the colonial archive. J. Comput. Cult. Herit. (2023). https://doi.org/10.1145/3594727, just Accepted
Cantallops, M.M., Sánchez-Alonso, S., García-Barriocanal, E.: A systematic literature review on Wikidata. Data Technol. Appl. 53(3), 250–268 (2019). https://doi.org/10.1108/DTA-12-2018-0110
Chambers, S., et al.: Position statements - collections as data: state of the field and future directions (2023)
Chen, Y.: An investigation of linked data catalogue features in libraries, archives, and museums: a checklist approach. Electron. Libr. 41(5), 700–721 (2023). https://doi.org/10.1108/EL-03-2023-0070
Clark, J.A., Williams, H.K.R., Rossmann, D.: Wikidata and knowledge graphs in practice: using semantic SEO to create discoverable, accessible, machine-readable definitions of the people, places, and services in libraries and archives. Inf. Serv. Use 42(3–4), 377–390 (2022). https://doi.org/10.3233/ISU-220171
Colla, D., Goy, A., Leontino, M., Magro, D.: Wikidata support in the creation of rich semantic metadata for historical archives. Appl. Sci. 11(10) (2021). https://doi.org/10.3390/app11104378, https://www.mdpi.com/2076-3417/11/10/4378
Cornolti, M., Ferragina, P., Ciaramita, M., Rüd, S., Schütze, H.: SMAPH: A piggyback approach for entity-linking in web queries. ACM Trans. Inf. Syst. 37(1) (2018). https://doi.org/10.1145/3284102
Cramer, T., German, C., Jefferies, N., Wise, A.: A perpetual motion machine: The preserved digital scholarly record. Learn. Publ. 36(2), 312–318 (2023). https://doi.org/10.1002/leap.1494
Dijkshoorn, C., et al.: The Rijksmuseum collection as linked data. Semantic Web 9(2), 221–230 (2018). https://doi.org/10.3233/SW-170257
Elizarov, A., Gafurova, P., Lipachev, E.: Wikidata in metadata formation methods for documents of digital mathematical library. In: Scientific service & Internet: proceedings of the 23rd All-Russian Scientific Conference (September 20-23, 2021, online), CEUR, vol. 230, pp. 23–33. Keldysh Institute of Applied Mathematics (2021). https://doi.org/10.20948/abrau-2021-3-ceur
Färber, M., Braun, C., Popovic, N., Saier, T., Noullet, K.: Which publications’ metadata are in which bibliographic databases? A system for exploration. In: Frommholz, I., Mayr, P., Cabanac, G., Verberne, S. (eds.) Proceedings of the 12th International Workshop on Bibliometric-enhanced Information Retrieval co-located with 44th European Conference on Information Retrieval (ECIR 2022), Stavanger, Norway (hybrid), April 10th, 2022. CEUR Workshop Proceedings, vol. 3230, pp. 39–44. CEUR-WS.org (2022). https://ceur-ws.org/Vol-3230/paper-06.pdf
Feliciati, P.: Call me by your name: towards an authority data control shared between archives and libraries. JLIS. it 13(1), 203–214 (2022). https://doi.org/10.4403/jlis.it-12733, https://jlis.fupress.net/index.php/jlis/article/view/432
Fischer, B.: Towards an open and collaborative authority control. JLIS. it 13(1), 283–290 (2022). https://doi.org/10.4403/jlis.it-12767, https://jlis.fupress.net/index.php/jlis/article/view/438
Freire, N., Manguinhas, H., Isaac, A.: An observational study of equivalence links in cultural heritage linked data for agents. In: Hall, M., Merčun, T., Risse, T., Duchateau, F. (eds.) TPDL 2020. LNCS, vol. 12246, pp. 62–70. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-54956-5_5
Freire, N., Proença, D.: RDF reasoning on large ontologies: a study on cultural heritage and Wikidata. In: Maglogiannis, I., Iliadis, L., Pimenidis, E. (eds.) AIAI 2020. IAICT, vol. 583, pp. 381–393. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-49161-1_32
Iorio, A.D., Rossi, D.: Capturing and managing knowledge using social software and semantic web technologies. Inf. Sci. 432, 1–21 (2018). https://doi.org/10.1016/j.ins.2017.12.009
Jain, N., Múnera, A.S., Ehmueller, J., Krestel, R.: Generation of training data for named entity recognition of artworks. Semantic Web 14(2), 239–260 (2023). https://doi.org/10.3233/SW-223177
Jeremy Myntti, Nicole Lewis, A.M.M., Rockwell, K.: Regional connections to national authority files. Cataloging Classif. Q. 58(1), 76–89 (2020). https://doi.org/10.1080/01639374.2019.1690087
Kitchenham, B.: Procedures for performing systematic reviews (2004). https://www.inf.ufsc.br/~aldo.vw/kitchenham.pdf
Larsson, A., Ånäs, S., Zeinstra, M., Marynowski, P.: Wikimedia commons data roundtripping. https://meta.wikimedia.org/wiki/Wikimedia_Commons_Data_Roundtripping
Ma, L., Li, M., Zhang, W., Li, J., Liu, T.: Unstructured text enhanced open-domain dialogue system: a systematic survey. ACM Trans. Inf. Syst. 40(1), 9:1–9:44 (2022). https://doi.org/10.1145/3464377
Mahey, M., et al.: Open a GLAM lab. International GLAM Labs Community, Book Sprint, Doha, Qatar (2019). https://doi.org/10.21428/16ac48ec.f54af6ae
Marcondes, C.H.: Integrated classification schemas to interlink cultural heritage collections over the web using LOD technologies. Int. J. Metadata Semant. Ontol. 15(3), 170–177 (2021). https://doi.org/10.1504/IJMSO.2021.123040, https://www.inderscienceonline.com/doi/abs/10.1504/IJMSO.2021.123040
Navarrete, T., Villaespesa, E.: Image-based information: paintings in Wikipedia. J. Documentation 77(2), 359–380 (2021). https://doi.org/10.1108/JD-03-2020-0044, https://doi.org/10.1108/JD-03-2020-0044
Nesterov, A., Hollink, L., van Erp, M., van Ossenbruggen, J.: A knowledge graph of contentious terminology for inclusive representation of cultural heritage. In: Pesquita, C., et al. (eds.) The Semantic Web - 20th International Conference, ESWC 2023, Hersonissos, Crete, Greece, May 28 - June 1, 2023, Proceedings. LNCS, vol. 13870, pp. 502–519. Springer, Cham (2023). https://doi.org/10.1007/978-3-031-33455-9_30
Nguyen, B.X., Dinneen, J.D., Luczak-Roesch, M.: A novel method for resolving and completing authors’ country affiliation data in bibliographic records. J. Data Inform. Sci. 5(3), 97–115 (2020). https://doi.org/10.2478/jdis-2020-0020
Nielsen, F.Å., Mietchen, D., Willighagen, E.: Scholia and scientometrics with Wikidata. In: Scientometrics 2017, pp. 237–259 (2017). https://doi.org/10.1007/978-3-319-70407-4_36, https://arxiv.org/pdf/1703.04222
Obregón Sierra, A.: Insertion of metadata from Spanish libraries in Wikidata: a linked open data model. Revista Española de Documentación Científica 45(3), a330 (2022). https://doi.org/10.3989/redc.2022.3.1870, https://redc.revistas.csic.es/index.php/redc/article/view/1363
Padilla, T.: Responsible operations: data science, machine learning, and AI in libraries (2019). https://doi.org/10.25333/xk7z-9g97
Padilla, T., Allen, L., Frost, H., Potvin, S., Russey Roke, E., Varner, S.: Final report — always already computational: collections as data (2019). https://doi.org/10.5281/zenodo.3152935
Page, M.J., et al.: The Prisma 2020 statement: an updated guideline for reporting systematic reviews. BMJ 372 (2021). https://doi.org/10.1136/bmj.n71, https://www.bmj.com/content/372/bmj.n71
Polley, K.L., Tompkins, V.T., Honick, B.J., Qin, J.: Named entity disambiguation for archival collections: Metadata, Wikidata, and linked data. Proc. Assoc. Inform. Sci. Technol. 58(1), 520–524 (2021). https://doi.org/10.1002/pra2.490, https://asistdl.onlinelibrary.wiley.com/doi/abs/10.1002/pra2.490
Poulter, M., Sheppard, N.: Wikimedia and universities: contributing to the global commons in the age of disinformation. Insights: the UKSG journal (2020). https://doi.org/10.1629/uksg.509
Rivera, M.J., Teruel, M.A., Maté, A., Trujillo, J.: Diagnosis and prognosis of mental disorders by means of EEG and deep learning: a systematic mapping study. Artif. Intell. Rev., 1–43 (2021). https://doi.org/10.1007/s10462-021-09986-y
Rossenova, L., Duchesne, P., Blümel, I.: Wikidata and Wikibase as complementary research data management services for cultural heritage data. In: Kaffee, L., Razniewski, S., Amaral, G., Alghamdi, K.S. (eds.) Proceedings of the 3rd Wikidata Workshop 2022 co-located with the 21st International Semantic Web Conference (ISWC2022), Virtual Event, Hanghzou, China, October 2022. CEUR Workshop Proceedings, vol. 3262. CEUR-WS.org (2022). https://ceur-ws.org/Vol-3262/paper15.pdf
Shafee, T., Mietchen, D., Lubiana, T., Jemielniak, D., Waagmeester, A.: Ten quick tips for editing Wikidata. PLoS Comput. Biol. 19(7) (2023). https://doi.org/10.1371/journal.pcbi.1011235
Sonzini, V.: Gender equality in library science and book history Italian journals: a focus on boards, authors and peer-reviewers. JLIS. it 14(1), 81–98 (Dec 2022). https://doi.org/10.36253/jlis.it-509, https://jlis.it/index.php/jlis/article/view/509
Taniguchi, S.: Data provenance and administrative information in library linked data: reviewing RDA in RDF, BIBFRAME, and Wikidata. Cataloging Classif. Q. 61(1), 67–90 (2023). https://doi.org/10.1080/01639374.2023.2178048
Tharani, K.: Much more than a mere technology: a systematic review of Wikidata in libraries. J. Acad. Librariansh. 47(2), 102326 (2021). https://doi.org/10.1016/j.acalib.2021.102326, https://www.sciencedirect.com/science/article/pii/S0099133321000173
Thornton, K., Seals-Nutt, K., Remoortel, M.V., Birkholz, J.M., Potter, P.D.: Linking women editors of periodicals to the Wikidata knowledge graph. Semantic Web 14(2), 443–455 (2023). https://doi.org/10.3233/SW-222845
Ukwoma, S.C., Osadebe, N.E., Okafor, V.N., Ezeani, C.N.: Unveiling the veiled: Wikipedia collaborating with academic libraries in Africa in creating visibility for African women through Art+Feminism Wikipedia edit-a-thon. Digit. Libr. Perspect. 37(4), 449–462 (2021). https://doi.org/10.1108/DLP-08-2020-0079
Wilkinson, et al.: The fair guiding principles for scientific data management and stewardship. Sci. Data 3 (2016). https://doi.org/10.1038/sdata.2016.18
Yang, M.Y.R., Yang, S., Lin, J.: Integration of text and geospatial search for hydrographic datasets using the lucene search library. In: Aizawa, A., Mandl, T., Carevic, Z., Hinze, A., Mayr, P., Schaer, P. (eds.) JCDL 2022: The ACM/IEEE Joint Conference on Digital Libraries in 2022, Cologne, Germany, June 20-24, 2022, p. 36. ACM (2022). https://doi.org/10.1145/3529372.3533280
Zhao, F.: A systematic review of Wikidata in digital humanities projects. Digit. Scholarsh. Humanit. 38(2), 852–874 (2023). https://doi.org/10.1093/llc/fqac083
Zhitomirsky-Geffet, M., Minster, S.: Cultural information bubbles: a new approach for automatic ethical evaluation of digital artwork collections based on Wikidata. Digit. Scholarsh. Humanit. 38(2), 891–911 (2023). https://doi.org/10.1093/llc/fqac076
Zhu, L., Xu, A., Deng, S., Heng, G., Li, X.: Entity management using Wikidata for cultural heritage information. Cataloging Classif. Q. 61(1), 20–46 (2023). https://doi.org/10.1080/01639374.2023.2188338
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix A. Summary of the projects used for the systematic review
Appendix A. Summary of the projects used for the systematic review
Appendix A. Summary of the projects used for the systematic review
This section presents the summary of the projects used for the systematic review as is shown in Table 4.
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Candela, G., Cuper, M., Holownia, O., Gabriëls, N., Dobreva, M., Mahey, M. (2024). A Systematic Review of Wikidata in GLAM Institutions: a Labs Approach. In: Antonacopoulos, A., et al. Linking Theory and Practice of Digital Libraries. TPDL 2024. Lecture Notes in Computer Science, vol 15178. Springer, Cham. https://doi.org/10.1007/978-3-031-72440-4_4
Download citation
DOI: https://doi.org/10.1007/978-3-031-72440-4_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-72439-8
Online ISBN: 978-3-031-72440-4
eBook Packages: Computer ScienceComputer Science (R0)