📥
ChEMBL Data Deposition Guide
  • Introduction
  • Overview
    • Source Identifier
    • Depositor-Defined Identifiers
      • AIDX
      • RIDX
      • CIDX
    • File types and names
  • File structure
    • File hierarchy
    • Simplified input data schema
    • Deposition file list
    • Field names and data types - basic submission
      • The CONTACT field
      • ACTION_TYPE valid names
      • TARGET_TYPE list
      • Adding context using the ACTIVITY_PROPERTIES file
    • The ASSAY_DESCRIPTION field
  • Complex results sets
    • Linking files through depositor defined IDs
    • Linking multiple result types using TEOID
    • Supplementary data in the ACTIVITY_SUPP file
    • Flexible SAMID mapping
    • Field names and data types - more complex data types
  • Example dataset
  • Common data issues
  • Advanced features and documentation
  • Depositing activities against other depositors entities
  • Creating a COMPOUND_CTAB file from a file containing SMILES strings
  • FAQs
  • Glossary
Powered by GitBook
On this page
  1. File structure

Simplified input data schema

PreviousFile hierarchyNextDeposition file list

Last updated 1 month ago

This diagram is a minimal example of the data schema. It shows only the fields which are mandatory (in bold) or which are required to link data together within or between files. Fields with a * are a primary key.

RIDX is not mandatory in some files as it is not necessary when loading additional data to existing assays. In any other case, RIDX is a mandatory field in any file that includes it. Without including your RIDX, records cannot be linked to your references and data loading will fail.

Some fields will only take certain valid identifiers and there are additional fields which are not mandatory which are outlined in the section.

Field names and data types