Depositor-Defined Identifiers
Identifiers for key entities such as compounds and assays are defined within the deposition files by using Depositor-Defined Identifiers (DDIs). These identifiers are created and maintained by depositors themselves, and provide a way for depositors to:
Give a meaningful unique identifier to each compound and assay in their datasets.
Update sets later with additional data.
In a basic data submission DDIs are used for compounds/substances, assays and references, and are called CIDX, AIDX and RIDX, respectively.
DDIs are also used for internal reference between records in a dataset, when more complex data is being submitted. ACT_ID, TEOID, REGID and SAMID are used to group together supplementary data. More information on the links between tables can be found in the Deposition Files section.
Types of Depositor-Defined ID
*IDX | Description | Primary File |
AIDX | Assay ID | ASSAY |
CIDX | Compound ID | COMPOUND_RECORD |
RIDX | Reference ID | REFERENCE |
ACT_ID | ACTivity ID | ACTIVITY |
TEOID | TEst Occasion ID | ACTIVITY |
REGID | REcord Grouping ID | ACTIVITY_SUPP |
SAMID | Supplementary Activity Mapping ID | ACTIVITY_SUPP |
Depositors may use a string of between 1 and 200 visible UniCode characters to define these identifiers. All deposited files should bu UTF-8 encoded.
DDIs are owned by the depositor; a given identifier is defined by the depositor when a deposition is first made.
Subsequent use of the same identifier in future ChEMBL loads will result in the updating or overwriting of the data that was associated with this identifier when it was first loaded
Last updated