Retaining data

At the end of a project, you will need to decide which of your research data to retain and which to dispose.

On

Where possible, data selected for long-term retention should be made as discoverable and accessible as permitted. This is normally done by submitting the data to a discipline-specific or institutional repository, or a funder-established data centre.

Some data cannot be made publicly available, due to personal or commercial sensitivity for example, and may be retained but not shared. In these cases, you should contact  (login required) for advice on data storage options.


Data selection and disposal

Decisions about retaining data after the end of your project should begin during the data management planning stage, and should take into account institutional, funder and repository requirements.

Most funders expect data underlying published papers, and other data of long-term value, to be retained for 10 to 30 years and made available if possible.

For projects with extremely large datasets, retaining full datasets may not be possible. In these instances, it may be appropriate to dispose of datasets when no longer required, but to retain data samples, along with detailed methodologies or code to allow the data to be recreated.

The Digital Curation Centre provides advice on .

Files that are not required for long-term retention should be deleted when they have fulfilled their purpose. Researchers have a legal responsibility for collected data, and sensitive data should be disposed of in an appropriate manner.

Further guidance on the retention and disposal of data can be found in the University’s and at the .

Postgraduate research students should follow University guidance on transferring and disposing of their research data at the end of their degree.


Long-term data retention

Data selected for retention must be kept securely after the end of a research project.

For projects involving personal data, ensure that the terms of consent agreements are observed. These may include anonymisation by de-identification of data and securely destroying any personal data associated with the dataset. Where personal data needs to be retained for planned follow-up research, secure storage should be ensured.

Many research funders specify which data need to be retained, for how long, and where they should be deposited. You can find more information in research funder policies

This includes , which recommends that data underpinning publications should be accessible for at least 10 years after publication.

Since hardware and software may become obsolete over time, data should be converted into standard or open formats for long-term accessibility. Find out more about from the UK Data Service.

It may be possible to store physical research data, in the form of paper records, for a fixed period of time in the University Records Centre. 

The primary storage location for research data is now the X: drive. If you were planning to retain data in Google Drive, you may wish to think about moving this to the X: Drive and refer to the .

For further information, contact rdm@sheffield.ac.uk.