Data management plans

We have pulled together guidance on the best practices to creating a data management plan.

What is a data management plan?

A data management plan (DMP) is a written document outlining how you are planning to manage your research data both during and after your research project ends. The plan should address what types of data will be collected and how the data will be documented, processed, stored, shared and preserved. It is a living document which should be periodically reviewed and updated throughout the lifetime of your project.

Why create a data management plan?

Data management plans ensure a project’s research data is created, managed, processed, documented, shared, and preserved in a way that enables easy verification and reuse. They set out a roadmap for your data from planning to preservation, providing the backbone of good research data practices.

Most funders and universities, including the University of Surrey, refer to Point 4.2 of the Research Data Management Procedure (PDF), require you to create a plan at the start of your research. Your funder may have specific guidance or templates. You can include data management plans in your ethics applications and PhD confirmation documents, too.

How to write a data management plan

Depending on the project, plans can be very simple (a page or less) or highly detailed (multiple pages).

If you are writing a plan for a funding bid, you can budget costs to help improve the management, sharing, and preservation of research data. This could include staff time, software, technology, and resources to make your data more open. Check out this data management costing tool and checklist (PDF).

Below we provide some guidance on the main topics your plan should address, a helpful tool, and outline how we can help.

Describe where you will store your data during collection, measurement and analysis
Be specific about the journey your data will take and how it will be transferred from one device, platform or physical location to another
How you will keep your data safe from accidental loss, corruption, being overwritten and unauthorised access
Decide if you will transfer data from a collection tool or specific equipment to another, maybe in a different format, to do your analysis, e.g. voice recorder or video, field measurements or log book, or online survey or participant questionnaire
How and when will you do this? Every day/week? After data collection ends?

In almost all cases, research data should be kept on University storage. Avoid using local hard drives, portable storage devices, laptops, and tablets for storage to reduce the risk of accidental loss. Do not use third party storage like Dropbox, Google drive, etc. They offer less protection and are less secure than University storage. If your project has special requirements like high performance computing, highly sensitive data, or commercially owned data then consult IT Services, Ethics, or your sponsor/funding agency for an appropriate set-up.

Outline where you will store the data at every stage of collection, measurement and analysis
Describe how and when you will transfer data if necessary, including deleting data off collection tools/measurement equipment and their associated storage
Identify any ethical, legal or commercial issues with your data, e.g. identifiable data, copyrighted materials, patents, etc. How will you protect the data? (This could include transforming, de-identifying, or anonymising the data, or using collaborative agreements)
Identify who will have access to the data. How will collaborators have access to the data?
Identify any special storage or computing requirements you or your funder may have
Describe how you will securely store and maintain any non-digital data.

For more on storage, see managing your data.

Data sharing for verification, reproduction and reuse is an increasingly important marker of academic integrity. Researchers are encouraged to make their data as open as possible. Your plan should identify what data will or will not be shared from the project. For data that can’t be shared, you should include a justification for why not.

For shareable data, you should outline where, when, and how others can access it. Often data is released following publication or at the close of a project. Be aware: some funders and publishers require data to be shared within specific timelines. You can describe any restrictions or terms of use for the data, including any licenses you might want to apply:

Make sure your consent forms don’t prohibit sharing/retention, and even better, request permission for de-identified data to be shared/archived in an open repository
Outline what parts of your data can and cannot be shared or published
Describe and justify any restrictions, secure permissions needed or terms of access (restricted, registration form, Non Disclosure Agreement, etc.)
When will the data be released?
Is there non-digital data that needs to be made available? How will people request access (e.g. by using a publicly discoverable metadata record)?
If the data cannot be shared, explain why (e.g. don’t own, national security, copyrighted)
Will you transform the data? (e.g. de-identify or convert to an open format)
Identify how you will share your data, such as depositing in a repository
Consider applying a Creative Commons license to your shared data or code
Check out the FAIR principles of data sharing.

Best practice is to deposit the data into a data repository. Repositories provide the best visibility, tracking, and safe keeping for your data:

Identify a suitable repository. Consider a discipline specific repository that is most appropriate for your data. Check out PLOS’ list of recommended repositories or Scientific Data’s recommended data repositories
You can also use the University’s Open Research repository.

For more information on sharing your data, please see open data.

The aim is to be "As open as possible, as closed as necessary"

How we can help

Consultation

If you have any questions about data management plans, your funder’s requirements, etc. please get in touch with us at openresearch@surrey.ac.uk or come along to our Research Data Management drop-in sessions which run on Tuesday mornings (8:30am-1pm) on Level 1 of the Library next to the Entrance gates.

Review

We are always happy to help you create and/or review your Data Management Plan but please allow a week for our review if sending your plan to openresearch@surrey.ac.uk. Otherwise just bring it along to a Research Data Management drop-in session on a Tuesday morning (8:30am-1pm) for instant in-person feedback or use the Surrey Data Management Plan Review Guide to review the plan yourself.

Training

We offer regular training sessions on data management plans and research data management practices involving both qualitative and quantitative data through the Doctoral College or via the Library Research Hub Workshop Descriptions. Want a bespoke session? Let us know what you have in mind at openresearch@surrey.ac.uk.

Resources

PGR supervisors’ guide to reviewing DMPs
Overview of funder’s data policies
Examples of DMPs
DMP Checklist
DMP Checklist for qualitative data
UK Data Service’s Plan to Share guidance
UK Data Service’s Data Protection guidance
UKRI’s GDPR and Research – An Overview for Researchers.

Data management plans

What is a data management plan?

Why create a data management plan?

How to write a data management plan

How we can help

Consultation

Review

Training

Resources

Research Data Management Procedure

Research Data Management Procedure Companion Guide

Research Data Management Deposit Guide

Data management plans

What is a data management plan?

Why create a data management plan?

How to write a data management plan

File types and formats

Documentation and metadata

Storage, security, and IP

Data sharing

Archiving/Preservation

Postgraduate data management plans

Online planning tool

How we can help

Consultation

Review

Training

Resources

Research Data Management Procedure

Research Data Management Procedure Companion Guide

Research Data Management Deposit Guide