LibGuides: Centre for Future Transport and Cities: Research Data Management

Research Data Management

All research projects create or utilise data, which are necessary to support or validate findings, observations and outputs. Naturally, data will vary in types across disciplines, and techniques will differ in how they are collected. The management of research data (regardless of the form or the media in which they exist in), is essential for good research practice, ensuring that data are organised, preserved and in the long-term, reusable.

Coventry University recognises that good research data management is fundamental to high quality research and academic integrity. These LibGuides pages provide guidance and tools to support the management, re-use and preservation of research data.

Research Data Lifecycle
	Research Data Lifecycle. Highlights the different stages that research data may encounter in a project, and indicates systems and services that are required to support data management. ©Jisc and Bonner McHardy (page last updated 2/10/17)

Data Management for Funding Bids

Many funding bodies require that their funding recipients create and follow plans for managing data, storing or preserving it in the long term, and sharing some, or all data products with the public. The Digital Curation Centre (DCC) has provided a convenient Overview of Individual Funders' Data Policies.

Individual councils and other funding agencies vary in the degree of planning and explanation they expect at the bidding stage, but some, for example the Wellcome Trust, have very strong expectations:

"All those seeking Wellcome Trust funding should consider their approach for managing and sharing data at the research proposal stage. In cases where the proposed research is likely to generate data outputs that will hold significant value as a resource for the wider research community, applicants will be required to submit a data management and sharing plan to the Wellcome Trust prior to an award bring made". (Policy on data management and sharing)

Even where such demands are not made, it is likely that funders will respond more positively to applications which have clear plans for managing, preserving and sharing their data.

The steps you apply to manage your data are also likely to affect the costing of your project. As well as the cost of staff and other resources, it is necessary to think about what your short and long-term storage and back-up requirements will be, for the safekeeping of the data you create during your research work. There are likely to be both funder, and institutional requirements related to this.

Creating your data

There are many decisions to make about managing your data before you even start creating/collecting it, including choosing hardware and software, and addressing issues with intellectual property rights and ethics. Decisions made at the beginning will affect how you can access, use, or preserve your data in the future.

Research data can exist in many forms, dependent on research area / discipline, including:

Documents (text, Word), spreadsheets
Laboratory notebooks, field notebooks, diaries
Questionnaires, transcripts, codebooks
Audiotapes, videotapes
Photographs, films
Test responses
Results from experiments
Slides, artifacts, specimens, samples
Artistic works, including dance, music performances, recordings,
Sculpture, painting, design
Collection of digital objects acquired and generated during the process of research
Database contents (video, audio, text, images)
Models, algorithms, scripts
Contents of an application (input, output, log files for analysis software, simulation software, schemas)
Methodologies, workflows
Standard operating procedures and protocols

(list adapted from Leeds University)

Choosing Formats

In planning a research project, it’s important that you consider which file formats you will use to store your data. In some cases, this will be dictated by the software you’re using or the conventions of your discipline, but in other cases you may have to make a choice between several options:

what software and formats you or colleagues have used in past projects.
any discipline-specific norms (and any peer support that comes with them).
what formats will be easiest to share with colleagues for future projects.
what formats are at risk of obsolescence, because of new versions or their dependence on particular software.
what formats it will be possible to open and read in the future.
what formats will be easiest to annotate with metadata so that you and others can interpret them days, months, or years in the future.

What formats are best for preserving files in the long term?

Popular formats such as those produced by Microsoft Office products (e.g. Word documents or Excel spreadsheets) are likely to have reasonable longevity, but be aware that they are proprietary (owned by someone) and so will not necessarily exist forever or remain easily readable. You may be better off storing important information in open, non-proprietary formats – for example, PDF/A rather than Microsoft Word, CSV rather than Excel, TIFF rather than Photoshop files, or as XML rather than a database.

What image format should I use?

Some images formats are better for particular purposes than others. For example, TIFFs preserve digital image information well, but users cannot view them with internet browsers and they take up a lot of computer storage space. Taking this into consideration, TIFF image files would make suitable master copies for archival purposes, particularly if the image content is important. For smaller images which are to be used for web delivery and for embedding in documents, JPEG format is suitable. JPEGs are compressed using 'lossy', which keeps the files from being too large. Each time a particular JPEG image is compressed, it loses some of its information, so over time, the image becomes blurry. This process means that JPEGs are not considered for archival processes.

The link below directs to the Digital Preservation Coalition's Handbook, which provides useful information on all aspects of digital preservation.

Digital Preservation Handbook - File formats and standards
A chapter on file formats and standards, from the Digital Preservation Handbook, by the Digital Preservation Coalition.
Citation: Digital Preservation Handbook, 2nd Edition, http://handbook.dpconline.org/ Digital Preservation Coalition © 2015 licensed under the Open Government Licence v3.0.

Organising your data

Once you create, gather, or start manipulating data and files, they can quickly become disorganised. To save time and prevent errors later on, you and your colleagues should decide how you will name and structure files and folder. Including documentation (or 'metadata') will allow you to add context to your data so that you and others can understand it in the short, medium, and long-term. Good metadata should be both computer and human-readable.

Naming and Organising Files

Agreeing on a logical and consistent naming convention at the beginning of your project will make it easier to find and correctly identify your files, prevent version control problems when working on files collaboratively, and generally prevent errors in research. Organising your files carefully will save you time and frustration and prevent duplication or errors by helping you and your colleagues find what you need when you need it.

Use folders ‐ group files within folders so information on a particular topic is located in one place.
Adhere to existing procedures ‐ check for established approaches in your team or department which you can adopt.
Name folders appropriately ‐ name folders after the areas of work to which they relate and not after individual researchers or students. This avoids confusion in shared workspaces if a member of staff leaves, and makes the file system easier for new staff or subsequent projects to navigate.
Be consistent - When developing a naming scheme for your folders it’s important that once you’ve decided on a method, you stick to it. If you can, try to agree on a naming scheme from the outset of your research project.
Structure folders hierarchically ‐ start with a limited number of folders for the broader topics, and then create more specific folders within these.
Separate ongoing and completed work ‐ As you start to amass lots of folders and files, it’s a good idea to start thinking about separating your old documents, from those you are currently working on.
Try to keep your ‘My Documents’ folder for files you're actively working on, and every month or so, move the files you're no longer working on to a different folder or location, such as a folder on your desktop, a special archive folder or an external hard drive.
Backup - Ensure that your files, whether they are on your local drive or on a network drive, are backed up.
Review records ‐ assess materials regularly or at the end of a project to ensure files aren’t kept needlessly. Put a reminder in your calendar so you don't forget!

What do I need to consider when creating a file name?

It is useful if your department/project agrees on the following elements of a file name:

Vocabulary – choose a standard vocabulary for file names, so that everyone uses a common language.
Punctuation – decide on conventions for if and when to use punctuation symbols, capitals, hyphens and spaces.
Dates – agree on a logical use of dates so that they display chronologically i.e. YYYY-MM-DD. Order - confirm which element should go first, so that files on the same theme are listed together and can therefore be found easily.
Numbers – specify the amount of digits that will be used in numbering so that files are listed numerically e.g. 01, 002, etc.

How should I name my files, so that I know which document is the most recent version?

Very few documents are drafted by one person in one sitting. More often there will be several people involved in the process and it will occur over an extended period of time. Without proper controls this can quickly lead to confusion as to which version is the most recent. Here is a suggestion of one way to avoid this happening:

Use a 'revision' numbering system. Any major changes to a file can be indicated by whole numbers, for example, v1 would be the first version, v2 the second version. Minor changes can be indicated by increasing the decimal figure for example, v1.01 indicates a minor change has been made to the first version, and v3.01 a minor change has been made to the third version.

When draft documents are sent out for amendment, they should return carry additional information to identify the individual who has made the amendments. Example: a file with the name 20100816_dataman_v1_sj indicates that a colleague (sj) has made amendments to the first version on the 16th August 2010. The lead author would then add those amendments to version v1 and rename the file following the revision numbering system.

Include a 'version control table' each important document, noting changes and their dates alongside the appropriate version number of the document. If helpful, you can include the file names themselves along with (or instead of) the version number.

Agree who will finalise documents, marking them as 'final.'

What are the benefits of sharing my data?

Many researchers fear that by sharing their data they will lose their competitive edge, that others will misinterpret or misuse their data or that their research methods will be open to scrutiny. However, there also benefits to be gained though sharing your data. For example it:

Allows independent validation of results
Increases the impact and visibility of research makes best use of investment by avoiding replication
leads to new collaborations and partnerships
advances research when datasets are combined in new and innovative ways

If you plan for data sharing from the beginning of your project, you can decide on a method of providing access that you are comfortable with.

Are there occasions when I shouldn’t share my data?

Issues of intellectual property rights, commercial potential or of privacy can all affect whether you can or should share your data.

Sensitive and confidential data can, however, often be shared ethically if informed consent for data sharing has been given, subjects' identities are anonymised (if needed) or consideration is given to access restrictions.

These measures should be planned from the beginning of your research to ensure that you are not limiting future opportunities to share your data.

The UK Data Archive has an excellent guide on consent, confidentiality and ethics as part of their Managing and Sharing Data guide, and they provide brief guidance and tool reommendations for Anonymisation.

Sharing data with collaborators

Please note: The University does not authorise or approve the use of DropBox. It should never be used for confidential, personal or sensitive data.

The Centralised Research Data Storage and Collaboration (CRDSC) is a University-hosted web platform that provides a central storage and collaboration space for documents, information and ideas. For example, a CRDSC site can help you:

Coordinate projects, calendars and schedules.
Discuss ideas and review documents or proposals.
Store and share research data with international collaborators

Check out the IT Support tab above, or contact Digital Services via the Centralised Research Data Storage and Collaboration website for advice on using the service to safely collaborate with external partners

Eduroam
Eduroam (education roaming) provides university members with secure wireless internet access at other participating institutions worldwide (including institutions in Europe, Asia-Pacific, and North America).
UK Data Service Impact Case Studies
Real examples of impact achieved through sharing data in a range of sectors

Data preservation

Your data are only truly open if other people can access them, understand them and reuse them. It is recognised good research practice by researchers and institutions to manage and retain data, fulfilling any legal requirements that may exist following the conclusion of research projects. This requires active preservation to ensure that the files continue to be readable over the long term, making this an important feature of the research data lifecycle. You should ensure that the repository you choose has active preservation procedures for digital data curation. The Digital Curation Centre highlight data preservation as a key aspect to consider when planning a new research project, particularly with data that are unique and irreplaceable if destroyed or lost. Without the ability to refer to verifiable data, your research may not be judged as sound.

Digital Repositories

There are numerous Digital Repositories and data centres with varying content types (e.g. articles, data sets, images, etc) and disciplinary foci. The majority of them share data openly with the public, or the research community.

OpenDOAR (Directory of Open Access Repositories) maintains an online list of open access digital repositories, and has a content search tool.

re3data.org is the Registry of Research Data Repositories, providing a global registry of data repositories from different academic disciplines, and its use is particularly recommended in the European Commission’s “Guidelines on Open Access to Scientific Publications and Research Data in Horizon 2020”.

Online stores of discipline or subject-specific data ('data centres') abound, but there is currently no definitive list of these.

Some examples of popular data centres include:

Preserving data at Coventry University

At Coventry University, we have the option of utilising the institutional repository for the storage of data and datasets. A short demonstration video on the 'Adding a Dataset to Pure' tab shows the process of adding data to the repository.

Why would I put my data in a digital repository?

Raise the impact of your research. Digital repositories allow you to make data easily accessible to more people than ever before. The more people who can use your data, the more public good it can do and the more it can do to enrich your field of research. Open online access makes new collaborations and uses of data possible. In some areas (e.g. Archaeological excavation data), the data is often unique and many researchers feel a moral compunction to make it available to others (and, of course, to ensure its long-term preservation).

Raise your research profile. The more other researchers cite your data, the more they will know and admire your work. As the trend toward online open access rises, the prestige associated with data citations is growing. In addition, making some data available can increase the credibility of your analyses.

Keep your data safe and readable in the long term. Many researchers hold on to an old computer from a decade or two ago because it is the only way to access their old files, created in formats that are now obsolete. Once these computers break, the files are essentially lost. Many repositories store and back up your treasured research products and will, if appropriate file formats are used, attempt to move the data into new file formats as the original formats become obsolete. So long as the repository exists, your materials will remain readable and usable.

Your funder may require it. This is more and more common. You can find summary of funders’ open access requirements using the SHERPA/JULIET database. Even if your funder does not require that you deposit your data, a plan to deposit your data may strengthen your bid.

If I published my paper/data in a peer-reviewed journal, can I still deposit it in an open digital repository?
This depends on the journal (especially for papers), but the majority do allow it. Contact your journal for more information, or, you can find summary information on journals’ copyright policies using the SHERPA/RoMEO database.

You can also ask your repository support team for help with this. Coventry University's Research and Scholarly Publications Team is happy to help you find an answer, regardless of your target repository.

Selection- Choosing what to keep

Choosing what to keep and what can be disposed of or deleted is always going to involve a subjective judgement, as nobody knows exactly what information is going to be wanted in the future.

All we can do is think the matter through carefully, abide by the policies we need to (e.g. from funders) and document decisions made and the reasons for them. It won’t be a perfect process, but should at least be a sensible one.

There are some good reasons why selection is worth doing:

Because storage costs money;
Storage requires effort / staff hours;
Storing massive amounts of data complicate finding and access of truly useful stuff.
Because Freedom of Information laws mean that what you keep on file may have to be disclosed, if requested.

How do I know what to keep and what to delete?

These following questions, based on material devised by the Digital Curation Centre, can help you decide what you should keep and what can be deleted:

Does my funder or the University need me to keep this data and / or make it available for a certain amount of time?
Does this data constitute the 'vital records' of a project, organisation or consortium and therefore need to be retained indefinitely?
Do I have the legal and intellectual property rights to keep and re-use this data? If not, can these be negotiated?
Does sufficient documentation and descriptive information (‘metadata’) exist to explain the data, and allow the data or record to be found wherever it ends up being stored?
If I need to pay to keep the data, can I afford it?

Once you've sorted through your files and asked these questions you then need to:

Check your data protection responsibilities.
Prepare documentation for each file.
Find out how to deposit in an institutional or subject-specific repository, as appropriate.

Uploading a Dataset to Pure

Follow these instructions to upload your data to Coventry University's institutional repository.

1. Navigate to [to be shared during the event]

2. Log in with username and password.

3. To add a new dataset, click on the green 'Add new' button in the top right-hand corner.

4. Select 'Dataset' from the submission options.

5. Start adding metadata to the record, including the title of the data and a description. If the data have been collected over a set period of time, add the dates in.

6. Add the names of the people involved in the data collection / creation to the record, including the role that they played in the research activity and the location in which they work.

7. Include the Organisational Unit by which the dataset is managed (this is usually the same as the location in which the researcher works).

8. Under 'Data availability', add the publisher(s) and if available, the Digital Object Identifier.

9. Upload the dataset and associated files in the 'Electronic data' box, any useful information to accompany the files, and the date that the dataset and files were made available. Ensure to include the correct reuse license as required by the funder / university.

10. Select the access options to the dataset - this will depend on the funder's and/or the University's open data requirements so it is important to refer to their documentation.

11. Adding contact details will ensure others can can request further information about the dataset, if required.

12. Include other useful information, if it is appropriate to the research (temporal coverage, geo location, legal/ethical).

13. Ensure the Visibility is set to 'Public - No restriction' and save the record for validation. It will then be checked by the Research and Scholarly Publications Team to ensure that the metadata details fulfil funder / university requirement.

Uploading a Dataset to Pure

Readme File Template A file to add to a metadata record, providing additional information to others who are interested in the research data.
Coventry University Repository Take Down Notice A document detailing how to request the removal of an item from the university's repository
Coventry University Repository Terms of Use A document explaining the terms on which one may make use of the Repository, whether as a guest or a registered user, End User or Authorised Depositor

Documentation and Metadata

To ensure that you understand your own data and that others may find, use and properly cite your data, it helps to add 'documentation' or 'metadata' (data about data) to the documents and datasets you create. This encompasses all the information necessary to interpret, understand and use a given dataset or set of documents.

It is good practice to begin to document your data at the very beginning of your research project and continue to add information as the project progresses. Include procedures for documentation in your data planning. There are a number of ways you can add documentation to your data:

Embedded documentation

Information about a file or dataset can be included within the data or document itself. For digital data sets, this means that the documentation can sit in separate files (for example text files) or be integrated into the data file(s), as a header or at specified locations in the file. Examples include:

Code, field and label descriptions
Descriptive headers or summaries
Transcripts
Recording information in the Document Properties function of a file (Microsoft)

Supporting documentation

This is information in separate files that accompanies data in order to provide context, explanation, or instructions on confidentiality and data use or reuse. Examples include:

Working papers or laboratory books
Questionnaires or interview guides
Final project reports and publications

Catalogue metadata

This is structured information which can be used to identify and locate the data that meet the user's requirements via a web browser or web based catalogue. Catalogue metadata is usually structured according to an international standard and associated with the data by repositories or data centres when materials are deposited with them. Examples are:

Title
Description
Abstract
Creator
Geographic location
Keywords

UK Data Archive - Document your data
A crucial part of making data user-friendly, shareable and with long-lasting usability is to ensure they can be understood and interpreted by any user. This requires clear data description, annotation, contextual information and documentation.
ANDS Metadata: Working Level
This comprehensive 11 page Guide is intended to provide a generic working-level view of the needs, issues, and processes around metadata collection and creation as it relates to research data.

Digital Object Identifiers

Digital Object Identifiers (DOI) are a set of alphanumeric characters which give outputs such as articles, conference papers, datasets and book chapters, a unique online identity. DOIs are usually found on the publication itself or on the publisher's webpage and this unique string of characters provides a stable, persistent identification for the lifetime of the output. Even when the content of an object is updated or the web address is changed, a DOI record will be updated but its link remains the same. As a DOI is a stable point of reference for an output, its link will always work and will never change, once created.

If you are wondering whether you need a DOI, consider that the addition of a persistent identifier to your outputs will help increase the research and the impact of your work. DOIs are a method of identifying specific objects and works accurately, which will aid in the connection of the outputs to their creators / authors, plus associated metadata and documentation.

The Research and Scholarly Publications Team help and encourage researchers to share and register data, make them more stable to locate online and easier to cite, by obtaining a DOI. DOIs can be assigned to reports, theses, datasets, online toolkits, and working / technical papers.

Please ask for further guidance by contacting the Team.

Data Protection

Coventry University Group is committed to processing personal/sensitive personal data in accordance with:

Data Protection Act 2018 (DPA) and General Data Protection Regulation 2016 (GDPR) sets out a legal framework for processing personal data.

You must comply with the DPA 2018 and/or GDPR principles as appropriate, if your research data includes:

Personal data, special categories of data (sensitive data), and data relating to criminal convictions and offences,
Sharing personal and/or sensitive personal data with an external third party e.g. collaborative research, using an external IT system to process the data,
Transferring personal and/or sensitive personal data outside of the European Economic Area (EEA),

GDPR places greater emphasis on organisations to comply with Data Protection legislation where organisation is acting as a Data Controller, who determine the purpose for which and the manner in which personal data are collected and processed (e.g. Coventry University is a Data Controller if your research is in connection with the University). The new Data protection regime also provides enhanced and new rights for individual such as Right to Data Portability, not all rights will apply where personal data is processed for the research project.

Please contact the University’s Information Governance Unit (IGU) who regulates governance and compliance over data protection and privacy matters. Please ensure you initiate the contact with IGU as early as possible in your project if you require advice or assistance on any data protection points or query.

Information Governance Unit - enquiry.igu@coventry.ac.uk

Guidance on information security incidents and digital compliance is available on the Digital Services:Information Security page (internal page).

To report a data breach incident - Please report the incident via the Data Breach Form and notify IGU via databreach.igu@coventry.ac.uk.

Useful Websites

I nformation Commissioner's Office

The DCC - Five Things You Need to Know About RDM and the Law: DCC Checklist on Legal Aspects of RDM

JISC Research Data Management Toolkit - Data protection regulation

Funder Data Plan Requirements

Funders expect data plans to cover how data will be collected or created, managed, shared and preserved. Plans should include information on expected / potential difficulties that may arise during a research project, along with causes and possible measures to overcome these difficulties.

UK Research and Innovation (UKRI) have set expectations on the routine management and sharing of research data, known as Common Principles. These common principles provide a framework for the individual Research Council policies on data policy.

The Digital Curation Centre (DCC) provide an overview of the coverage of individual funders' policies for publication and data, and the support that they provide for researchers. Full details are available directly from the individual funders' pages:

Arts and Humanities Research Council: Data Management Plan - Text for Funding Guide

Biotechnology and Biological Sciences Research Council: Data Management Plan Application Guidance

British Heart Foundation: How to apply for a research grant

Cancer Research: Practical guidance for researchers on writing data sharing plans

EPSRC: EPSRC Policy Framework on Research Data

ESRC: ESRC Research Data Policy

European Commission Horizon 2020: Data Management, Guidelines on FAIR Data Management in Horizon 2020

MRC: Data Sharing

NERC: Data Management Plan Guidance, Data Policy

STFC: Data Management Review Guidance

The Royal Society: Research Grants, Data Sharing and Mining

Wellcome Trust: How to complete an outputs management plan

Creating a Data Management Plan

A good DMP will usually cover the following themes, but will vary in exact details and requirements dependent on the funder that is being applied to:

The types of data will be created, including formats, volumes, and any standards or capture methods.
How ethics, intellectual property and data protection will be addressed and managed.
Plans for resource accessibility, sharing agreements and reuse of data, including timescales for public release.
Details on short term storage, including back-up strategies to prevent data loss.
Strategy for long-term storage, use, and sustainability for your data, data derivatives, and other research outputs.
Details of resources required.

Much of data management is simply good research practice that you will be doing already. Data plans are just a way of articulating or evidencing that you've thought about how to create, store, backup, share and preserve your data. The DCC has produced an interactive online tool to help researchers create data management plans: DMPonline The website has a record of major UK/European funder requirements, so it can also tailor the template to your particular funder.

F.A.I.R. Data

F.A.I.R. Data Principles are a set of principles to guide researchers in making their research data findable, accessible, interoperable and reusable (Wilkinson et al. 2016), directing data producers and publishers to promote maximum use of research data. The Principles also highlight the importance of data to be machine-readable, as humans rely on computers to search for and deal with increasing volumes of data, in addition to data complexity.

Following the FAIR Principles would be seen as good research practice by all Research Funders, particularly beneficiaries of Horizon 2020 funding. Data Management Plans for European Commission projects must address how datasets will be created, if these data can be made accessible and how they will be curated, stored and preserved. Further details can be found in the following documents:

H2020 Programme: Guidelines on FAIR Data Management in Horizon 2020

European Research Council (ERC): Guidelines on Implementation of Open Access to Scientific Publications and Research Data

FAIR

Go Fair expand on the granular details of the F.A.I.R. Principles (CC-BY 4.0):

Findable

The first step in (re)using data is to find them. Metadata and data should be easy to find for both humans and computers. Machine-readable metadata are essential for automatic discovery of datasets and services, so this is an essential component of the FAIRification process.

F1. (Meta)data are assigned a globally unique and persistent identifier

F2. Data are described with rich metadata

F3. Metadata clearly and explicitly include the identifier of the data they describe

F4. (Meta)data are registered or indexed in a searchable resource

Accessible

Once the user finds the required data, she/he needs to know how can they be accessed, possibly including authentication and authorisation.

A1. (Meta)data are retrievable by their identifier using a standardised communications protocol

A1.1 The protocol is open, free, and universally implementable

A1.2 The protocol allows for an authentication and authorisation procedure, where necessary

A2. Metadata are accessible, even when the data are no longer available

Interoperable

The data usually need to be integrated with other data. In addition, the data need to interoperate with applications or workflows for analysis, storage, and processing.

I1. (Meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation.

I2. (Meta)data use vocabularies that follow FAIR principles

I3. (Meta)data include qualified references to other (meta)data

Reusable

The ultimate goal of FAIR is to optimise the reuse of data. To achieve this, metadata and data should be well-described so that they can be replicated and/or combined in different settings.

R1. Meta(data) are richly described with a plurality of accurate and relevant attributes

R1.1. (Meta)data are released with a clear and accessible data usage license

R1.2. (Meta)data are associated with detailed provenance

R1.3. (Meta)data meet domain-relevant community standards

The principles refer to three types of entities: data (or any digital object), metadata (information about that digital object), and infrastructure. For instance, principle F4 defines that both metadata and data are registered or indexed in a searchable resource (the infrastructure component).

Go Fair - FAIR Principles
Fostering the coherent development of the global Internet of FAIR Data & Services (IFDS), with the main focus on early developments in the European Open Science Cloud (EOSC).
Implementing FAIR Data Principles: The Role of Libraries
The FAIR Data Principles are essential for libraries who want to foster and extend research data services. For libraries which do not already actively promote and include the FAIR Data Principles in their work, LIBER’s Research Data Management Working Group has produced a factsheet with tips on getting started.
The FAIR Guiding Principles for scientific data management and stewardship (CC-BY 4.0)
Wilkinson, M. D. et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 3:160018 doi: 10.1038/sdata.2016.18 (2016)

IT Support for Data Management

Coventry University Information Technology Service has processes in place to ensure: the completeness and accuracy of the backed up data; backup copies of data are stored in a secure manner; data can be restored from a required backup within a reasonable time from authorisation from a data custodian; control of media rotation; secure storage; copies of data are taken on schedule; the secure disposal of data and magnetic media when they are no longer required.

Many departments and research groups provide networked storage and SharePoint spaces for collaborative work. File size limitations, back-up capabilities, and remote access can vary. Further details are available via the File Storage & Sharing page in the Digital Services Catalogue. Find this page under the Communication and Collaboration menu.

Portable storage media such as CDs, DVDs, memory sticks (also known as USB sticks, flash drives, thumb drives, memory keys) are more risky are vulnerable to loss and damage. It is important not to rely on them as your only copy of important data.

They are very convenient though, and useful for:

temporary copies / moving files e.g. taking a presentation to a conference.
secondary or back-up copies
files only one person at a time needs access to.
data you can afford to lose.

Data sticks or CD/DVDs must be encrypted before posting them to external collaborators.

Centralised Research Data Storage and Collaboration

Digital Services host a centralised storage facility for research data in on-going research projects, Centralised Research Data Storage and Collaboration (CRDSC). Storage is mirrored to a secure ISO27001 off-campus site and subject to a Disaster Recovery Plan with secure nightly backups taken. CRDSC project spaces provide the following features for researchers and their collaborators:

Small Document Libraries which support files of up to 2GB and support SharePoint's out of the box features such as version control.
Large Document Libraries which are connected to a network share and can support large files over 2GB and large data sets of multiple terabytes.
Access for external partners to all of the content via the SharePoint platform.

For further details and to request a project space, navigate to the CRDSC pages.

Finding Data

You may be creating and collecting entirely new data for your project, but you can often draw on a wealth of data already available to complement or enrich your own research. Given the proper attention to Intellectual Property Rights (IPR), Data Protection and ethics, you may be able to process existing raw data to create entirely new research outputs.

Benefits of reusing data

There are many reasons why you may wish to reuse data:

Reusing data increases work efficiency, as both time and resources are saved. If data are already available and suitable for reuse, less money and time are required to collect new information.
Using data that are already available can help to avoid duplication of data collection.
Using other researchers' data can promote opportunities to expand on their research and broaden knowledge.
Reusing data that already exists takes less effort to collect than engaging in collecting new data.

When reusing existing data, you must ensure that you use the data in the way that the data owner has specified. A method which authors and creators of outputs use to provide a clear, regulated method of providing permissions for reuse of works is through Creative Commons licenses. Those interested in reusing the data then follow the license conditions.

The following table visually explains Creative Commons Licenses and their permissions.

Creative Commons Licenses explained - at a glance

Creative Commons Licenses explained - at a glance. from Xplore - your web agency

Locating data repositories

The following links will help you locate data repositories that may be of use to you.

The Directory of Open Access Repositories - OpenDOAR
As well as providing a simple repository list, OpenDOAR lets you search for repositories or search repository contents.
Discover - the UK Data Archive Search Tool
Search and browse data collections, support guides, case studies, and related publications.
re3data - Registry of Research Data repositories
re3data.org is a global registry of research data repositories that covers research data repositories from different academic disciplines.
figshare
A repository where users can make their research outputs available in a citable, shareable and discoverable manner.
Zenodo
Built and developed by researchers, to ensure that everyone can join in Open Science.
Dryad
The Dryad Digital Repository is a curated resource that makes the data underlying scientific publications discoverable, freely reusable, and citable. Dryad provides a general-purpose home for a wide diversity of datatypes.
Mendeley Data
Mendeley Data is an open research data repository, where researchers can upload and share their research data. Datasets can be shared privately amongst individuals, as well as published to share with the world.
DANS
DANS offers the following (certified) services:
DataverseNL : during your research you can save and share research data via DataverseNL .
EASY : after your research you can permanently store and share research data via the online archiving system EASY .
NARCIS : other research data, such as your research projects and publications, can be shared via science portal NARCIS .
In addition to these services, DANS guides other archives, research institutes and research financiers in questions relating to data management, certification and subjects such as FAIR, open access and software sustainability.
DataHub
Datahub provide various solutions to publish and deploy data with power and simplicity.

Data Tree
A free online course with all you need to know for research data management, along with ways to engage and share data with business, policymakers, media and the wider public.
FOSTER Open Science Portal
The FOSTER portal is an e-learning platform that brings together the best training resources addressed to those who need to know more about Open Science, or need to develop strategies and skills for implementing Open Science practices in their daily workflows.
JISC Guides
JISC produce a range of guides that summarise complex legal information, including:
"How and why you should manage your research data: a guide for researchers"; "Data protection and research data"; and "Meeting the requirements of the EPSRC research data policy".
MIT Data Management guides
Detailed advice on planning, storing and sharing your research data
UK Data Archive's Managing and Sharing Data guide
Best practice guidance produced by the UK Data Archive, and endorsed by many organisations including JISC and ESRC.
UK Data Archive RDM advice
A thorough look at many aspects of Research Data Management, with detailed answers to a wide range of FAQ

Other RDM resources and links to policies

Research Data Management and Sharing Standard

CU Data Breach Policy

Coventry University Open Access Policy

CU Data Management Policy
CU Data Management Policy written by the IPU, which aims to identify the roles and responsibilities for ensuring that the University Group both achieves and maintains, through the adoption of ‘good practice’, Legal Compliance with, and adherence to, Data Protection Laws in the various Jurisdictions within which it operates. (Link is to CU staff portal only)

JISC Research Data Blog This blog is written by the Jisc team that work on research data activity, it is largely concerned with activity relating to research data management and use.
University of Edinburgh Research Data Blog

MANTRA RDM training
MANTRA is a free online course for those who manage digital data as part of their research project. Developed by Edina and the University of Edinburgh
Research Data Management and Sharing
Free online course developed by The University of North Carolina at Chapel Hill, and The University of Edinburgh. Provided by Coursera.
Data Tree
A free online course with all you need to know for research data management, along with ways to engage and share data with business, policymakers, media and the wider public.

Ethics at Coventry University

This page provides general guidance rather than legal advice - please follow the University weblinks provided for further support and advice.

Intellectual Property Rights (IPR) (e.g. copyright, patents, etc) affect the way both you and others can use your research outputs.

Failure to clarify rights at the start of the research process can lead to unexpected limitations to:

your research,
its dissemination,
future related research projects, and
associated profit or credit.

It can also cause you legal trouble.

Further information on IPR can be found on the University's IPR webpages

Frequently Asked Questions

Are research data or data derivatives protected by copyright law?

Copyright law sometimes protects data and other research products (provided that you share them with the proper copyright statement or end-user agreement), but it depends on the nature of your data or files.

The University has a Copyright webpage, which provides information and contacts for who to consult on copyright questions in various situations (e.g. research grants and funding, commercialisation and intellectual property, etc).

A seminar was held in 2011 (hosted by CRASHH and the Incremental Project). Andrew Charlesworth (Centre for IT & Law, University of Bristol) gave a presentation that addressed some copyright issues:

'Intellectual Property Rights and Research Data - Focus on copyright' [32 mins 6 secs].

He also participated in a short interview on the same subject [2 mins 34 secs].

What are my intellectual property rights with regard to research data at Coventry University?

This depends on whether you are a student, post-doc, PI/project director, your relationship with the University, your role in the project, and your agreement with other parties (funders, study participants, corporate partners, etc). Advice can be sought via the University's webpages and by contacting colleagues listed below, under the 'Who can help me with Copyright and IPR?' question.

Can I use materials that I find online?

It depends on how those materials are licensed. IPR is usually in play, even if you don't see a "©" or 'all rights reserved' notice. When in doubt, contact the University Copyright Officer (contact information in FAQ below) for advice, or ask the website administrator or publisher who distributed the content for permission directly.

The Web2Rights project has produced a useful IPR & Legal Issues Toolkit for the web.

How can I make it easier for others to re-use the materials that I produce?

One relatively simple way to make it easier for others to re-use tools, data, or other content that you produce is to add a Creative Commons license.

For example ‘By-Attribution, Non-Commercial’ is a common Creative Commons license – when you mark your file, image, or information with this, it means that anyone can use your information in any way they like, so long as they attribute it to you and don’t use it for commercial purposes. Creative Commons licenses are often used for materials released online, but you can also include these in printed materials if you don't have a publisher who owns the rights. For additional information and Creative Commons license options, visit the www.creativecommons.org.

To license something with a Creative Commons license, you don't need to file any paperwork -- just publish (in print or on the web) your materials along with a notification that you are using a particular license.

IMPORTANT NOTE: Creative Commons licenses are 'irrevocable' so don't add a Creative Commons license unless you are sure that (1) you have the right to publish this information, and (2) you won't want to re-voke it later on for any reason.

Who can help me with Copyright and IPR?

For information on Copyright, please contact:

Phil Brabban

University Librarian and Group Director of Learning Resources, LIB

Telephone: 024 7688 7519

E-mail: p.brabban@coventry.ac.uk

For general questions on IPR or to discuss Intellectual Property Disclosure Forms, contact:

Mandy Tipple

Business Development Support Office

Mobile: 07974 98 4387

E-mail: ipr@coventry.ac.uk

Director of IP Services

Brian More

Mobile: 07974 98 4928

E-mail: ipr@coventry.ac.uk

For questions touching on commercialisation, contact:

Tim Francis

IPR Commercialisation Manager

Mobile: 07557 42 5047

E-mail: ipr@coventry.ac.uk

What rights do other people have to request my work - i.e. Freedom of Information Act (FOI)?

The Freedom of Information Act of 2000 (FOIA) gives all members of the public the right to request any information produced with public money, but there are some exemptions.

For information about FOI at Coventry, see the CU FOI page.

Further Reading

Web2Rights IPR & Legal Issues Toolkit Information on intellectually property rights pertaining to Web 2.0 internet resources.

Alex Ball has created a presentation for the Digital Curation Centre/University of Bath on Derestricting Datasets: How to License Research Data

ICO Guide to Data Protection
The Information Commissioners Office, is the organisation responsible for Data Protection and Freedom of Information. They have produced a useful guide.

As members of a publicly-funded university, you may receive requests for information under the Freedom of Information Act 2000 (FOI) or Environmental Information Regulations 2004 (EIR).

Deadline
Once the University has received an information request, it has 20 working days to respond to an FOI request and up to 40 for an EIR request. Both FOI and EIR include a number of exemptions and exceptions respectively against disclosure. This is because the legislation recognised that not all official information ought to be disclosed. For example to protect information such as confidential, sensitive data or personal information. If you are unsure about disclosure, consult the University's FOI officer foia @coventry.ac.uk.

CU Freedom of Information guidance
Information Commissioner's Office Guide to freedom of information
The Information Commissioner’s Office (ICO) is the UK’s independent authority set up to uphold information rights in the public interest, promoting openness by public bodies and data privacy for individuals.
Information Commissioner's Office Guide to the Environmental Information Regulations
This guide is for those who work for a public authority and have day-to-day responsibility for environmental information. It explains how to apply the Act by giving practical examples and answering frequently asked questions.

Glossary

Advance Online Publication - Some publishers enable articles to be published online as soon as they have been fully copy-edited and proof-checked, ahead of the final, ‘printed’ version. This version of the article is in exactly the same format as they appear in the final issue except for page numbering. Any embargo periods pertaining to Open Access start from this release date. Also known as Early or First online publication.

Article Processing Charge (APC) - Fee which may be payable to the publisher to publish via the gold open access route. When an article is published in a traditional subscription journal, the author pays an APC to make their individual article freely available from the journal website, without restriction or charge to the reader.

Bibliographic Record - The bibliographic description of a digital publication. Search engines crawl the internet to find documents and, depending on the quality of the metadata, they list the 'hits'. The high-quality metadata for items deposited in repositories enables the documents to be easily discoverable. Also known as Publication record or Metadata.

CC-BY Licence - Creative Commons Attribution Licence. This is the most liberal of the CC licences. As long as the original author(s) receives attribution, this allows anyone to copy, distribute or transmit the research, adapt the research and make commercial use of the research. RCUK requires this licence is used if the gold open access route is selected. The Wellcome Trust encourage its use, and will cover the costs of any APC where an article is published under this licence.

COAF - Charity Open Access Fund - Comprised of six medical research charities - Arthritis Research UK, Breast Cancer Campaign, the British Heart Foundation, Cancer Research UK, Leukaemia & Lymphoma Research, and the Wellcome Trust. Research funded by any of these charities must meet their Open Access requirements. See COAF (Wellcome Trust).

Corresponding Authors - The author responsible for manuscript correction, correspondence during submission, handling of revisions and re-submission of the revised manuscript. On acceptance of the manuscript, the corresponding author is responsible for co-ordinating any application for payment of a Gold Open Access Article Processing Charge (APC).
Creative Commons Licences - Creative Commons licences can be used in open access publishing to help authors retain copyright while allowing others to copy, distribute, and make use of their work. There are several different Creative Commons licences, which allow different types of re-use. See the Creative Commons website.
Curve open - CURVE Open is the University's repository for educational resources and open access items other than research publications. The aim of this open access institutional repository is to showcase University research and teaching, increasing accessibility to, and raising the visibility of our authors work.
DOI - Digital Object Identifier - A unique identifier for an online document, used by most online journal publishers. As the DOI is unique to the publication, linking to an online document by its DOI provides more stable linking than simply referring to it by its URL.

Embargo Period - An embargo in academic publishing is a period during which access to a research publication self-archived in an open access repository (Green open access) is restricted. The purpose of this is usually to protect the revenue of publishers who rely on subscription payments to cover the costs of publication.

Europe PubMed Central (Europe PMC) - A life sciences and biomedical research subject repository. The Wellcome Trust, the Medical Research Council (MRC) and most other UK biomedical funders require copies of funded articles to be deposited in Europe PMC within 6 months of publication. The USA-based PubMed Central is the repository containing global content

Gold Open Access - The full text of the article is instantly available to anyone without a subscription or viewing fee from the publisher's website. The author may need to pay an "article processing charge" (APC) to the publisher.

Green Open Access - Author publishes in a traditional, subscription based journal and a copy of the research (usually the author’s final, peer-reviewed manuscript – sometimes referred to as a post-print) is deposited in either an institutional or subject repository, usually at the point of publication. No APC is paid to the publisher. Following any embargo period set by the publisher the manuscript is then made free to access. The published final version of the journal sits behind a subscription pay wall on the journal website, while the "post-print" copy is available to anyone from the repository.

Hybrid Open Access - When an article is published in a traditional subscription journal, but where the author pays an APC to make their individual article freely available from the journal website, without restriction or charge to the reader. This means that some articles in that journal will only be available to subscribers whereas others (where the author has paid an APC) will be freely available to everyone

Institutional Repository - Online digital archive of an institution’s research publications. Previously, CU hosted CURVE Open, however in Spring 2017 there will be a managed transition to a PURE based repository for publications.

Open Access - Open access is the practice of providing free, unlimited online access to scholarly works and research outputs in a digital format, with limited restrictions on re-use. A key driver behind OA has been to make publicly-funded research accessible to tax-payers.
Pre-print - This is usually defined as the author's final draft of a paper before peer-review. It is also often referred to as the author's submitted manuscript. Many publishers allow authors to place the pre-print in a repository. However, pre-print versions do not normally meet funder requirements.

Post-print - Refers to the final draft author manuscript, as accepted for publication, including modifications based on referees' suggestions but before it has undergone copy-editing and proof correction. It is often referred to as the author's accepted manuscript. The post-print version is the one that should ideally be deposited in the CU institutional repository in order to meet REF and funder requirements.

Published PDF - The formatted PDF file that appears in the journal. This version will be the publisher's copy-edited PDF with final page numbers, typesetting and journal branding included. Many publishers will not allow you to self-archive the published version unless you have paid an APC to make the paper openly available immediately (gold route).

Publisher Agreement - When you publish your paper you will probably sign a 'publisher agreement’. This document states your rights as an author, so it is always worthwhile keeping a copy. On the publisher agreement it should state whether you can make your article available on CU intuitional repository.

RCUK (Research Councils UK) Mandates - RCUK policy is that all research funded by them should be freely and immediately available. The RCUK preference is for gold OA, but they do support a mixed approach; the decision on which OA route to follow is taken by the individual author/institution. If the gold route is not available or appropriate, RCUK requires green deposit for a funded research output within 0-12 months of publication (see embargo). The AHRC require funded research-level theses to be OA within 12 months.

REF - Research Excellence Framework - The Research Excellence Framework is the system for assessing the quality of research in UK higher education institutions (HEIs). The next REF is in 2021. To be eligible for the next REF, staff will have to meet open access requirements. These must be met at the time papers are accepted for publication. To access the HEFCE policy for REF 2021 click here.

Wellcome Trust Mandates - The Wellcome Trust supports unrestricted access to publications wholly or partly funded by them. The outputs must be made available in PubMed Central or Europe PubMed Central within 6 months of final publications. They will provide grant-holders (via their institution) with additional funding to cover OA charges where appropriate.

RDM Appointments Available

Contact Us

Research & Scholarly Publications
FL320, Lanchester Library
Coventry University
Frederick Lanchester Building
Gosford Street
Coventry, United Kingdom
CV1 5DD
Telephone: 024 7765 7568

Email:
Open Access and Institutional Repository - oa.lib@coventry.ac.uk

Research Data Management - rdm.lib@coventry.ac.uk

Centre for Future Transport and Cities: Research Data Management

Research Data Management

Research Data Management

Research Data Lifecycle

Data Management for Funding Bids

Creating your data

Choosing Formats

What formats are best for preserving files in the long term?

What image format should I use?

Organising your data

Naming and Organising Files

What do I need to consider when creating a file name?

How should I name my files, so that I know which document is the most recent version?

What are the benefits of sharing my data?

Are there occasions when I shouldn’t share my data?

Sharing data with collaborators

Data preservation

Digital Repositories

Preserving data at Coventry University

Why would I put my data in a digital repository?

Selection- Choosing what to keep

How do I know what to keep and what to delete?

Uploading a Dataset to Pure

Uploading a Dataset to Pure

Documentation and Metadata

Embedded documentation

Supporting documentation

Catalogue metadata

Digital Object Identifiers

Data Protection

Funder Data Plan Requirements

Creating a Data Management Plan

F.A.I.R. Data

Findable

Accessible

Interoperable

Reusable

IT Support for Data Management

Centralised Research Data Storage and Collaboration

Finding Data

Benefits of reusing data

Locating data repositories

Other RDM resources and links to policies

This page provides general guidance rather than legal advice - please follow the University web​links provided for further support and advice.​

Glossary

RDM Appointments Available

Contact Us

This page provides general guidance rather than legal advice - please follow the University weblinks provided for further support and advice.