Supporting_Scientific_Data

Good Data Management Practice

Good Data Management Practice extends the notion of supporting research findings with good data to supporting research data with practical approaches and activities.

The following ten guiding principles are not a set of requirements or expectations, but rather ways of thinking about supporting research data in the context of twenty-first century science.

Principle Description
High quality scientific data requires effective data management Though some of the related practices and strategies may seem relatively simple, even the most advanced science requires data to be properly stored, organized, and documented.
Effective data management requires scientific data to be defined broadly. Scientific data does not exist in a vacuum. Proper management of data requires taking a workflow-based view of data. All materials needed to trace the provenance of a research finding “count” as data.
Effective data management requires research data to be understood thoroughly. Related practices and strategies are, partially, determined by the characteristics of the data being managed. For example, sensitive data necessitates limiting access.
Data management is a set of practices and strategies, not a set of tools. Though there are a myriad of tools and resources to make the process more efficient, the core of effective data management are practices and strategies implemented by people.
Data management practices and strategies should be routine and standardized. Data management practices and strategies should be a part of the day-to-day conduct of research. Realizing the benefits of data management requires agreement about how data should be organized, stored, described, and representation.
It takes planning and communication to manage data well. Like any workflow, the data management-related activities implemented at an early stage of a project substantially affect what can be done later. Though plans may change, strategizing prospectively helps things proceed efficiently.
Managing data requires managing documentation and metadata. Effective data management often necessitates documenting more than seems necessary in greater detail than may seem necessary. Information about data “counts” as data.
All data must be managed, but not all data necessarily needs to be shared (openly). Releasing scientific data for use by others has numerous benefits, but the protection of human subjects must be centered. Typically, only materials that underlie research findings need to be shared.
Data management requires commitment and a range of skills. Supporting data management is a community effort, involving both researchers and other data stakeholders (e.g. IT professionals, librarians, leadership). To be effective, related practices and strategies need to be verified, audited, and revisited over time.
Sometimes “good enough” has to be good enough Best practices in data management is a moving target. In settings where formal data management standards and expectations are still under development, it may be necessary to consider and adopt a set of minimum guidelines.