Approaches for data reuse and the issue of scientific data reusability
DOI:
https://doi.org/10.18617/liinc.v15i2.4777Keywords:
Data Reuse, Scientific Reproducibility, Reusability, Open Science, Research DataAbstract
The availability of scientific assets through data repositories has been greatly increased as a result of government and institutional data sharing policies and mandates for publicly funded research, allowing data to be reused for purposes not always anticipated by primary researchers. Despite the fact that the argument favoring data sharing is strongly grounded in the possibilities of data reuse and its contributions to scientific advancement, this subject remains unobserved in discussions about data science and open science. This paper follows a narrative review method to take a closer look at data reuse in order to better conceptualize this term, while proposing an early classification of five distinct data reuse approaches (repurposing, aggregation, integration, meta-analysis and reanalysis) based on hypothetical cases and literature examples. It also explores the determinants of what constitutes reusable data, and the relationship between data reusability and documentation quality. It presents some challenges associated with data documentation and points out some initiatives and recommendations to overcome such problems. It expects to contribute not only for the conceptual advancement around the reusability and effective reuse of the data, but also to result in initiatives related to data documentation in order to increase the reuse potential of these scientific assets.
References
BORSBOOM, D. et al. False alarm? A comprehensive reanalysis of" Evidence that psychopathology symptom networks have limited replicability" by Forbes, Wright, Markon, and Krueger (2017). Journal of Abnormal Psychology, v. 126, n. 7, p. 989-999, 2017.
BRUNTON, G.; KNEALE, D.; SOWDEN, A. Caffeinated energy drinks and effects in UK young people: a secondary analysis of population-level datasets. Department of Health & Social Care Reviews Facility: London, 2019.
CAMILO, B. de F. et al. Sedentary behavior and nutritional status among older adults: a meta-analysis. Revista Brasileira de Medicina do Esporte, São Paulo, v. 24, n. 4, p. 310-315, 2018.
CHANDRA, A.; COPEN, C. E.; STEPHEN, E. H. Infertility service use in the United States: data from the National Survey of Family Growth, (1982-2010). National Health Statistics Reports, Hyattsville, n. 73, p.1-21, 2014.
CORTI, L. et al. Managing and sharing research data: a guide to good practice. SAGE: London, 2014.
CURTY, R. G. et al. Attitudes and norms affecting scientists’ data reuse. PLoS One, v. 12, n. 12, p. e0189288, 2017.
DANIELS, M. G. Data Reuse in Museum Contexts: experiences of Archaeologists and Botanists. 2014. Dissertation (Doctor of Philosophy in Information) University of Michigan, Ann Arbor, 2014.
DAVID, M. The science of data sharing: documentation. In: SIEBE, J. E. (Ed.). Sharing social science data: advantages and challenges: Thousand Oaks: Sage Publications, 1991. p. 91-115.
EVOIO Working Group. Reuse Cases. 2011. Disponível em: http://www.evoio.org/wiki/Reuse_Cases Acesso em: 9 jun. 2019.
FANIEL, I. M.; JACOBSEN, T. E. Reusing scientific data: How earthquake engineering researchers assess the reusability of colleagues’ data. Computer Supported Cooperative Work, v. 19, n. 3, 2010, p. 355-375, 2010.
FANIEL, I. M.; ZIMMERMAN, A. Beyond the data deluge: a research agenda for large-scale data sharing and reuse. International Journal of Digital Curation, v.6, n. 1, p. 58-69, 2011.
FEAR, K. M. Measuring and anticipating the impact of data reuse Dissertation (Doctor of Philosophy in Information) University of Michigan, Ann Arbor, 2013.
FORBES, M. K. et al. Evidence that psychopathology symptom networks have limited replicability. Journal of Abnormal Psychology, v. 126, n. 7, p. 969-988, 2017.
HEATON, J. Reworking Qualitative Data. Thousand Oaks: Sage Publications, 2004.
KIM, Y.; YOON, A. Scientists’ data reuse behaviors: A multilevel analysis. Journal of the Association for Information Science and Technology, v. 68, n.12, p.2709-2719, 2017
MARKUS, L. M. Toward a theory of knowledge reuse: Types of knowledge reuse situations and factors in reuse success. Journal of Management Information Systems, v. 18, n. 1, p. 57-93, 2001.
NATIONAL NETWORK OF LIBRARIES OF MEDICINE. Data reuse. Disponível em: https://nnlm.gov/data/thesaurus/data-reuse Acesso em: 9 jun. 2019.
NIU, J. Overcoming inadequate documentation. Proceedings of the American Society for Information Science and Technology, v. 46, n. 1, p. 1-14, 2009a.
NIU, J. Perceived Documentation Quality of Social Science Data. 2009. Dissertation (Doctor of Philosophy in Information) University of Michigan, Ann Arbor, 2009b. Disponível em: http://deepblue.lib.umich.edu/bitstream/handle/2027.42/63871/niujf_1.pdf?sequence=1 Acesso em: 10 jun. 2019.
PASQUETTO, I. V.; RANDLES, B. M.; BORGMAN, C. L. On the Reuse of Scientific Data. Data Science Journal, v. 16, n. 8, 2017.
PIGOTT, D. et al. An approach to managing repurposing of digitized knowledge assets. Australasian Journal of Information Systems, Malden, v. 9, n. 1, p. 92-103, 2001.
TENOPIR, C. et al. Changes in data sharing and data reuse practices and perceptions among scientists worldwide. PLoS One, v. 10, n. 8, p. e0134826, 2015.
THESSEN, A. E. et al. Data issues in the life sciences. ZooKeys, Sofia, v. 15, n.150, p. 15-51, 2011.
VAN DE SANDT et al. The Definition of Reuse. Data Science Journal, v. 18, n. 1, 2019.
VISSOCI, J. R. N. et al. Zika virus infection and microcephaly: Evidence regarding geospatial associations. PLoS neglected tropical diseases, v. 12, n. 4, p. e0006392, 2018.
WILKINSON, M. D. et al. The FAIR Guiding Principles for scientific data management and stewardship. Scientific Data, v. 3, 2016.
YEAGER, A. JAMA Journals Retract Six Papers by Cornell Researcher. The Scientist, 19 set. 2018. Disponível em: https://www.the-scientist.com/news-opinion/jama-journals-retract-six-papers-by-cornell-food-scientist--64828 Acesso em: 9 jun. 2019.
ZIMMERMAN, A. S. Data sharing and secondary use of scientific data: experiences of ecologists. 2003. Thesis (Doctor of Philosophy in Information and Library Studies Dissertation) - University of Michigan, Michigan, 2003.
ZIMMERMAN, A. S. New Knowledge from Old Data: The Role of Standards in the Sharing and Re- use of Ecological Data. Science, Technology & Human Values, Ann Arbor, v. 33, n. 5, p. 631-652, 2008.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Renata Curty

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
Authors retain copyright and grant Liinc em Revista the right of first publication with the work simultaneously licensed under a Creative Commons Attribution 4.0 International License.
The authors have permission and are encouraged to deposit their manuscripts and versios of record (VoR) in their personal web pages or institutional repositories, generic repositories etc., before (pre-print) or after (post-print) the publication in Liinc em Revista, according to its open access depositing policy registered in the Directory of Editorial Policies of Brazilian Journals (DIADORIM), kindly providing a link to the article published on Liinc's website.
Liinc em Revista, published by Instituto Brasileiro de Informação em Ciência e Tecnologia, is licensed under a Creative Commons Attribution 4.0 International License – CC BY 4.0