Skip to ContentSkip to Navigation
Research GELIFES Data Management

What are primary data?

Question

What are primary data and when am I required to archive them?

Answer

Primary data include all sources of raw data:

  1. scanned field logs, lab journals, score forms,
  2. pictures of gels, microscopic observations,
  3. output from data loggers,
  4. video and audio recordings,
  5. webcam/photo identification files: only when the resulting IDs are NOT included in other primary data sets, e.g. in field journals or data files,
  6. sequencing and genotyping data,
  7. micro array and hi throughput data: only if NOT stored in public database on publication.

The primary data listed above must always be deposited in the repository when collected in a GELIFES research project, unless explicitly stated otherwise. In case of primary data that are collected and/or stored externally the following exceptions may apply:

  1. Data that is collected and stored at an external institute, falls under the responsibility of the external institute and need not be deposited in the repository; however, this is ONLY the case for raw, primary data! The database and institute should be referred to in the corresponding metadata file (read_me_first.txt) of the archive. All processed, secondary data such as spreadsheets, databases, scripts, code etc. that is used for the thesis/publication must be included in the archive file.
  2. Large primary data sets such as sequencing data that are stored elsewhere in a public database need not be deposited in the repository; again, this is ONLY the case for primary data. The link to the storage location should be included in the metadata file. All secondary data must be included in the archive file.
  3. Large long-term databases that are used for many projects need not be stored with each project. The raw/primary data, i.e., scans/photos of lab and field journals, should be stored once, and can be updated periodically with new contributions if applicable. The relevant version of this database archive should be referred to in subsequent project/publication archives. Secondary data must always be included in each thesis/publication archive.
Last modified:01 February 2017 12.43 a.m.