# RIDX

## RIDX Requirements

* Depositor-defined Reference IDs (RIDXs) are defined in the REFERENCE deposition file.
* An RIDX may refer to the results from a given publication, or a single unpublished dataset. &#x20;
* An RIDX should be a string that is meaningful to the depositor.
* RIDXs must be unique within a source. Where a source contains submissions from multiple datasets or sites, using a unique RIDX for each one will make it possible to distinguish between them in ChEMBL. &#x20;
* An entry in the REFERENCE file may include URLs, or simply text descriptions of the collection of data.
* RIDXs map 1:1 to an internal DOC\_ID field. This is enforced by DOC\_ID being the Primary Key on the DOCS table.  A DOC\_ID can never be shared between SRC\_IDs.
* **RIDX is referred to as CRIDX in the ACTIVITIES file.**  It should be the same as the depositor defined identifier RIDX in the REFERENCE, COMPOUND\_RECORD and ASSAY files; this is for historical programmatic reasons.

#### Suggested format:&#x20;

Smith\_KinaseXYZ\_wave1\
(abbreviation of the group leader, subject area, specific identifier)

This RIDX is easy to differentiate from others, and easy to track through the deposition files. It makes any data issues during loading easier to resolve, and makes your data easier to search when loaded.

## **Relations to compounds and assays**

* Each Compound IDX (CIDX) or Assay IDX (AIDX) defined by a depositor can only be assigned to a SINGLE RIDX. The same RIDX may be assigned to multiple CIDXs and AIDXs.&#x20;
* A depositor may only assign RIDXs defined by themselves, and may only assign them to their own CIDXs and AIDXs.&#x20;

## Default DOC\_IDs and RIDXs <a href="#default_doc_ids" id="default_doc_ids"></a>

All SRC\_IDs are associated with a single 'default' DOC\_ID, which is created alongside the new source. The DOCS table requires that a unique constraint of a ‘SRC\_ID / RIDX’ combination be enforced. Because multi-field unique constraints in Oracle cannot accept ‘null’ in any field, the string ‘default’ is used for the RIDX field when no RIDX has been specified.&#x20;

If a depositor decides to not associate an RIDX to one of their CIDXs, AIDXs or Activities by leaving the field blank, the loader will automatically assign the default RIDX and DOC\_ID for this SRC\_ID. Using an RIDX which does not match any other RIDX defined in the Source would have the same effect as leaving the RIDX blank. The default DOC\_ID can be manually edited by the ChEMBL administrator in consultation with the depositor if this was the result of a data error rather than intentional. &#x20;

Attempting to create or edit an RIDX called ‘default’ will be met with an error by the loader. Its use to confirm that the Activity data set should be assigned to ‘default’ (by putting ‘default’ in the RIDX field in the ACTIVITY file) is permitted. &#x20;


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://chembl.gitbook.io/chembl-data-deposition-guide/deposition-overview/depositor-defined-identifiers/ridx.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
