Publish your COVID-19 research data to make it available for the rest of the research community. The data should be deposited in a public repository together with descriptive metadata. For many biological datatypes, there are international databases that can be considered de facto standards.
Submitting data
The European Bioinformatics Institute (EBI) hosts many different international data repositories which should be used if appropriate. For further information, see their COVID-19 Data Portal data submission page.
Here are our data submission guidelines for each specific data type:
-
Genomics & transcriptomics data
We suggest that raw virus sequence data as well as assembled and annotated genomes are submitted to ENA. See documentation about submission at SARS-CoV-2 submission. Before submission of raw sequence data (e.g. shotgun sequencing) it is necessary to remove contaminating human reads.
Host (human) sequence data requires restricted access, and ELIXIR Greece is building a local federated version of the European Genome-phenome Archive (EGA), allowing for the publication of sensitive personal data within a legal framework. Until local EGA is available, the dataset should remain in the secure analysis environment.
-
Protein data
For a curated list of relevant proteomics repositories see FAIRsharing using the query ’proteomics’.
We recommend to use the PRIDE repository provided by the ProteomeXchange Consortium. The repository admits protein and peptide identification/quantification data with the accompanying mass spectra evidence and any other related data types. Submission is done using the PX Submission Tool.
-
Imaging data
Depending on the type of image data you have, different public repositories are available, please see the table at BioImage Archive.
-
Biochemistry
We suggest that users submit data to ChEMBL which is a manually curated database of bioactive molecules with drug-like properties run by EMBL-EBI. It brings together chemical, bioactivity and genomic data to aid the translation of genomic information into effective new drugs.
Depositing data in a public repository
By depositing data in a public repository, you accept that the data will be published and may be used and redistributed according to specific terms set by yourself or the repository in question. Data deposition is considered important and you may get credit and acknowledgement for sharing data. A publication often results in getting a unique identifier for your dataset, which is often required for a journal to accept a manuscript for publication. Data sharing is often required by funders.