Example dataset
An example dataset, showing the structure of all required files and advice on avoiding common issues.
Last updated
An example dataset, showing the structure of all required files and advice on avoiding common issues.
Last updated
A simple set of example data files suitable for use as a template are available below:
The dataset below is a truncated example showing only the first SDF record and first three rows of each file type. The full files are available in the "Simple Example Dataset" download. Mandatory fields are in bold.
PLEASE NOTE: For ease of viewing the column headers have been formatted as rows in this example. When submitting data to ChEMBL the formatting should be rotated by 90 degrees so that the fields are column headers and each row below is a data entry point.
CIDX
MMV010764
MMV026468
MMV011229
RIDX
Pathogen_Box_Bloggs
Pathogen_Box_Bloggs
Pathogen_Box_Bloggs
COMPOUND_NAME
MMV010764
MMV026468
MMV011229
COMPOUND_KEY
MMV010764
MMV026468
MMV011229
COMPOUND_SOURCE
AIDX
PB_FECH
PB_HMBS
RIDX
Pathogen_Box_Bloggs
Pathogen_Box_Bloggs
ASSAY_DESCRIPTION
Compound was evaluated for the inhibition of human FECH at 10uM
Compound was evaluated for the inhibition of human HMBS at 100uM
ASSAY_TYPE
B
B
ASSAY_ORGANISM
Homo sapiens
Homo sapiens
ASSAY_STRAIN
ASSAY_TAX_ID
9606
9606
ASSAY_TISSUE
ASSAY_CELL_TYPE
ASSAY_SUBCELLULAR_FRACTION
ASSAY_SOURCE
TARGET_TYPE
PROTEIN
PROTEIN
TARGET_NAME
FECH
HMBS
TARGET_ACCESSION
P22830
P08397
TARGET_ORGANISM
Homo sapiens
Homo sapiens
TARGET_TAX_ID
9606
9606
AIDX
PB_FECH
PB_HMBS
TYPE
CONC
CONC
RELATION
=
=
VALUE
10
100
UNITS
uM
uM
TEXT_VALUE
COMMENTS
RIDX
Pathogen_Box_Bloggs
Pathogen_Box_Bloggs
Pathogen_Box_Bloggs
CRIDX
Pathogen_Box_Bloggs
Pathogen_Box_Bloggs
Pathogen_Box_Bloggs
CRIDX_DOCID
CRIDX_CHEMBLID
CIDX
MMV161996
MMV202458
MMV676395
SRC_ID_CIDX
AIDX
1
1
1
TYPE
Inhibition
Inhibition
Inhibition
ACTION_TYPE
ANTAGONIST
ANTAGONIST
ANTAGONIST
TEXT_VALUE
RELATION
=
=
=
VALUE
0
1
61
UPPER_VALUE
UNITS
%
%
%
SD_PLUS
SD_MINUS
ACTIVITY_COMMENT
Not active
Not active
Active
ACT_ID
PB_FECH_MMV161996
PB_FECH_MMV202458
PB_FECH_MMV676395
TEOID
ACT_ID
PB_FECH_MMV161996
TYPE
HILL_SLOPE
RELATION
=
VALUE
UNITS
TEXT_VALUE
1.1
COMMENTS
RESULTS_FLAG
REGID
0
1
2
2
SAMID
0
1
2
2
TYPE
Treatment
Treatment
Treatment
"Side Effect (Y,N)"
RELATION
=
=
=
=
TEXT_VALUE
Y
UNITS
VALUE
COMMENTS
1
1
1
"Y,N"
ACT_ID
PB_FECH_MMV161996
PB_FECH_MMV202458
PB_FECH_MMV676395
PB_FECH_MMV676395
SAMID
0
1
2
2
RIDX
Pathogen_Box_Bloggs
PUBMED_ID
DOI
10.1016/S0960-894X(01)81052-0
PATENT_ID
JOURNAL_NAME
YEAR
2011
VOLUME
ISSUE
FIRST_PAGE
LAST_PAGE
REF_TYPE
Dataset
TITLE
Pathogen Box Compounds Screening
AUTHORS
Bloggs, J; Smith, M
ABSTRACT
400 compounds from the Pathogen box were screened for inhibitory activity against two human proteins.