Skip to Main Content

Research Data Management

What is Research Data?

Data can be "any information that can be stored in digital form, including text, numbers, images, video or movies, audio, software, algorithms, equations, animations, models, simulations, etc." (National Science Board, 2005).

The definition of research data varies depending discipline and research funder.

  • Research data is the output from any systematic investigation that involves a process of observation, experiment or the testing of a hypothesis (Pryor, 2012).
  • Research data is " the recorded factual material commonly accepted in the scientific community as necessary to validate research findings." (OMB Circular 110)
  • Scientific data is "recorded factual material commonly accepted in the scientific community as of sufficient quality to validate and replicate research findings, regardless of whether the data are used to support scholarly publications." (Final NIH Policy for Data Management and Sharing)

Research data include, but are not limited to:

  • Laboratory notebooks, field notebooks, diaries
  • survey responses, transcripts, codebooks
  • software and code
  • Models, algorithms, scripts
  • measurements from laboratory or field equipment (such as IR spectra or hygrothermograph charts)
  • Images (photographs, films, scans, or autoradiograms)
  • audio/video recordings
  • Slides, artifacts, specimens, samples
  • Contents of an application (input, output, logfiles for analysis software, simulation software, schemas)

Research Data Mangement and Research Data Lifecycle

  • Research Data management: the practices of organizing, storing, and accessing research data throughout its lifecycle.
  • Research data life cycle: distinct stages a research dataset goes through, from initial planning and collection to analysis, preservation, and potential sharing or reuse.

 

Aspects of data management through a research data life cycle