Globus Deposition and Folder Structure

How to structure your deposition folder and send it to us

We are using Globus to recieve data. You can log in with an institutional login, GitHub, Google, or ORCID. Or, you can sign up for an individual account.

By using Globus, we ensure that each depositor has an individual folder containing their most recent deposition. We do not accept data by email, as it is often unclear which is the latest dataset that someone has sent.

If you are unable to access Globus, please contact us for support at chembl-deposition@ebi.ac.uk.

Folder Structure Rules

  • We recieve a lot of datasets and we want to clearly distinguish your data from similar sets.

  • Folder names should describe your instution, group, and what the data is.

  • If you are submitting multiple datasets at once, each subfolder name should describe each particular dataset.

  • No special characters like spaces, full stops . or slashes / in folder names. These can break UNIX filesystems.

  • Underscores _ and dashes - are our preferred 'space' character.

  • Save your tab-separated files with the file ending .tsv not .txt . It is substantially easier to inspect your files when our software knows they are tab-separated.

  • Do not include any files other the data files and an optional README file in the data folder.

Good example

Cambridge_Smith_Lab_T_gondii_09-09-2025/
Cambridge_Smith_Lab_T_gondii_09-09-2025/Binding_IC50
Cambridge_Smith_Lab_T_gondii_09-09-2025/Cell_Death
Cambridge_Smith_Lab_T_gondii_09-09-2025/Cell_Death/REFERENCE.tsv 

Bad example:

ChEMBL Deposition/
ChEMBL Deposition/Assay.1
Smith_Lab_T_gondii_09/09/2025

Last updated