Skip to main content Link Menu Expand (external link) Document Search Copy Copied

Play Three: Preserving data

When to use this Play?

  • Before finishing the project
  • When the project requires for open publishing and long term preservation

Key steps:

  1. Have a list of data types and resources that need to be published and preserved (it might be the case that not everything needs to be published publicly).
  2. Use an external trusted repository for your community/domain specific data such as GBIF, Genbank and BOLD. Or Zenodo, Figshare for generic data. These resources provide DOIs and structured metadata that are both human and machine readable.
  3. Check if aggregators like GBIF are useful for your datasets.
  4. Check long term preservation policies. Do you have requirements and guidelines for long term data preservation and archiving?
  5. Besides data, what else needs to be preserved (software, lab protocols, algorithms, notebooks?).
  6. Think about data compression challenges. Do you need to compress data before sharing and preserving?
  1. https://www.surf.nl/en/services/data-archive – long term data storage service.
  2. [Historical Scientific Analog Data: Life Sciences Faculty’s Perspectives on Management, Reuse and Preservation(https://doi.org/10.5334/dsj-2020-051).

Common pitfalls to avoid for Play 03

  1. Backup is not the same as archiving.
  2. Not all repositories are created and operated equally. Check policies and services.

Return to Home