COMPOUND_RECORD.tsv

COMPOUND_RECORD files provide links between a record ID and a compound ID. If a CIDX-RIDX combination does not yet exist in the COMPOUND_RECORDS table for this src_id, then one will be created. If one already exists, then this existing one will be updated with the COMPOUND_NAME, etc, present in the incoming file. The CIDX is then referenced in ACTIVITY files, to identify which compounds were used.

Please ensure that the compounds listed in this file exactly match the structures in the COMPOUND_CTAB.sdf file; this includes any salts, hydrates etc. ChEMBL has a compound hierarchyarrow-up-right which connects parent compounds to salts, but it is imperative that the compounds in COMPOUND_RECORD.tsv and COMPOUND_CTAB.sdf files exactly match the compounds used experimentally.

You must provide at least one of COMPOUND_NAME, COMPOUND_KEY or COMPOUND_SOURCE in order to make the compound searchable.

Header

Description

Existence

Data Type

CIDX

The CIDX set by the depositor - a primary key

Mandatory

Any character up to a length of 200.

Should be a meaningful unique identifier for each Compound, not just a number.

RIDX

The RIDX cited by the depositor. MUST be owned by the same depositor

Mandatory

Any character up to a length of 200

COMPOUND_KEY

The local synonym used for this compound in the RIDX referenced

Mandatory

Any character up to a length of 250

COMPOUND_NAME

The name used for this compound in the RIDX referenced

Mandatory

Any character up to a length of 4000

COMPOUND_SOURCE

The source of this compound in the RIDX referenced

Optional

Any character up to a length of 400

Last updated