Globus Deposition and Folder Structure
How to structure your deposition folder and send it to us
We are using Globus to recieve data. You can log in with an institutional login, GitHub, Google, or ORCID. Or, you can sign up for an individual account.
By using Globus, we ensure that each depositor has an individual folder containing their most recent deposition. We do not accept data by email, as it is often unclear which is the latest dataset that someone has sent.
If you are unable to access Globus, please contact us for support at chembl-deposition@ebi.ac.uk.
Folder Structure Rules
We recieve a lot of datasets and we want to clearly distinguish your data from similar sets.
Folder names should describe your instution, group, and what the data is.
If you are submitting multiple datasets at once, each subfolder name should describe each particular dataset.
No special characters like spaces, full stops
.
or slashes/
in folder names. These can break UNIX filesystems.Underscores
_
and dashes-
are our preferred 'space' character.Save your tab-separated files with the file ending
.tsv
not.txt
. It is substantially easier to inspect your files when our software knows they are tab-separated.Do not include any files other the data files and an optional
README
file in the data folder.
Good example
Cambridge_Smith_Lab_T_gondii_09-09-2025/
Cambridge_Smith_Lab_T_gondii_09-09-2025/Binding_IC50
Cambridge_Smith_Lab_T_gondii_09-09-2025/Cell_Death
Cambridge_Smith_Lab_T_gondii_09-09-2025/Cell_Death/REFERENCE.tsv
Bad example:
ChEMBL Deposition/
ChEMBL Deposition/Assay.1
Smith_Lab_T_gondii_09/09/2025
Last updated