Exploring the Reconciliation of Cultural Data on Wikidata

experiment applied with the museum collection of the National Historical Museum





Semantic web, Linked open data, Cultural collections, Semantic enrichment


This study was developed from the perspective of the semantic web and the linked open data, focusing on the data reconciliation technique. It seeks to understand how the process of reconciling cultural data with Wikidata occurs through the programming of scripts in the Python language, with the aim of contributing to the understanding of how to apply a technique of semantic enrichment in cultural databases. As a methodology, the stages of the data reconciliation scripts development are described. And as results, the products of the scripts application are presented in the reconciliation of part of the data of National Historical Museum museological collection with the digital objects of Wikidata. It is concluded that the process of describing the scripts development allowed to better understand how data reconciliation occurs in cultural collections, and that more attention should be paid to the normalization of the collection data, and that this type of application expands the potential for networked socialization of knowledge.


Download data is not yet available.

Author Biographies

  • Luis Felipe Rosa Oliveira, Universidade de Brasília

    Doutorando em Ciência da Informação pela Universidade de Brasília (UnB) - Brasília, DF - Brasil. Mestre em Comunicação pela Universidade Federal de Goiás (UFG) – GO - Brasil.

  • Dalton Lopes Martins, Universidade de Brasília

    Pós-Doutorado pela Universidade de São Paulo (USP) – SP - Brasil. Doutor em Ciência da Informação pela Universidade de São Paulo (USP) - São Paulo, SP - Brasil. Professor da Universidade de Brasília (UnB) - Brasília, DF - Brasil.


BERNERS-LEE, T.; HENDLER, J.; LASSILA, O. The semantic web. Scientific american, v. 284, n. 5, p. 34-43, 2001.

BERNERS-LEE, T. Linked data principles. 2006. Disponível em:
<http://www.w3.org/DesignIssues/LinkedData.html> Acesso em: 10 set. 2020.

ISAAC, A.; MANGUINHAS, H.; STILLER, J.; CHARLES, V. (2015). Report on enrichment and evaluation. The Hague, Netherlands: Europeana Task Force on Enrichment and Evaluation. Disponível em:
<http://pro.europeana.eu/files/Europeana_Professional/EuropeanaTech/EuropeanaTech_taskforces/Enrichment_Evaluation/FinalReport_EnrichmentEvaluation_102015.pdf>. Acesso em 15 de abr. de 2020.

RUBERTO, D.L.V.G.; ANTONIAZZI, R. L. Análise e Comparação de Algoritmos de Similaridade e Distância entre strings Adaptados ao Português Brasileiro. In: Anais da XIII Escola Regional de Banco de Dados. SBC, 2017.

SANDERSON, R. “The Linked Data Snowball and Why We Need Reconciliation”, 2016. Disponível em: <https://www.slideshare.net/azaroth42/linked-data-snowball-or-why-we-need-reconciliation.> Acesso em: 28 mar. 2020.

SANTARÉM SEGUNDO, J. E. Web Semântica: Introdução a recuperação de dados usando SPARQL. Encontro Nacional de Pesquisas em Ciência da Informação (ENANCIB), v. 14, p. 3242-3261, 2014.

SOUZA, R. R.; ALVARENGA, L. A Web Semântica e suas contribuições para a ciência da informação. Ciência da Informação, v. 33, n. 1, p. 132-141, 2004.

VRANDEČIĆ, D.; KRÖTZSCH, M. Wikidata: a free collaborative knowledgebase. Communications of the ACM, v. 57, n. 10, p. 78-85, 2014.

ZENG, M. L. “Semantic enrichment for enhancing LAM data and supporting digital humanities. Review article”. El profesional de la información, v. 28, n. 1, 2019.

