77 Best practices for data management and metadata creation for collaborative biostatistics teams

Kelsey Karnik; Maggie Lang; Emily Slade

doi:10.1017/cts.2024.755

77 Best practices for data management and metadata creation for collaborative biostatistics teams

Published online by Cambridge University Press: 11 April 2025

Kelsey Karnik ,

Maggie Lang and

Emily Slade

Show author details

Kelsey Karnik: Affiliation:
University of Kentucky
Maggie Lang: Affiliation:
University of Kentucky
Emily Slade: Affiliation:
University of Kentucky

Article contents

Abstract

Rights & Permissions

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

Objectives/Goals: Our goal is to enhance communication and documentation in collaborative biostatistics by refining data management and metadata processes. We aim to capture critical data collection and generation information, improve transparency and reproducibility, and foster stronger researcher partnerships for more effective collaborations. Methods/Study Population: Traditional statistical analysis plans (SAP) often miss essential contextual knowledge from collaborators, leading to gaps that hinder reproducibility and limit future data use. Biostatistics teams at the University of Kentucky have updated their strategies to better capture important details about data origins and collection processes. By focusing on clear, comprehensive documentation early in the research process, we aim to preserve foundational data insights and improve collaboration efficiency. Our Biostatistics, Epidemiology, and Research Design (BERD) team has established best practices for addressing data management structures with collaborators across medical and healthcare fields – covering all project stages, from initial data collection to metadata creation and dataset finalization. Results/Anticipated Results: We will detail the processes used to improve data management structures and the observed results of these processes. For example, initiating deeper discussions about data origins and collection processes as early as possible in the collaboration has resulted in a more comprehensive project narrative that lays the foundation for effective collaboration. By engaging with project leaders early in the process, we can confirm that critical details about how data were collected and processed are documented, improving both the transparency and reproducibility of research findings. Streamlining the processes of capturing this information makes it more accessible and useful for those with limited statistical backgrounds, which is particularly relevant for faculty and staff in BERD communities and Clinical and Translational Science Awards Programs. Discussion/Significance of Impact: Nuanced data documentation structures are crucial for transforming raw data into meaningful, reusable datasets. Our initiatives promote clear communication, enhanced efficiency, and streamlined workflows. Translational science researchers can benefit from improving data management and metadata to boost long-term collaborative success.

Type: Biostatistics, Epidemiology, and Research Design
Information: Journal of Clinical and Translational Science , Volume 9 , Issue s1 , April 2025 , pp. 24

DOI: https://doi.org/10.1017/cts.2024.755 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (https://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is unaltered and is properly cited. The written permission of Cambridge University Press must be obtained for commercial re-use or in order to create a derivative work.

Article contents

77 Best practices for data management and metadata creation for collaborative biostatistics teams

Abstract

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests