UniChem 2.0
New Unichem 2.0 Docs
  • Introduction
  • What's new?
  • Web application
    • Compound Sources Search
    • Connectivity Search
    • Similarity Search
    • Query field requirements
  • API
    • Sources
    • Compound Search
    • Connectivity Search
    • Legacy
  • Downloads
  • Submission of data to UniChem
    • Reasons for data omission
  • Getting in touch
  • Citing UniChem
  • FAQs
  • Definitions
    • What is an Assignment?
    • What is meant by a 'src_id' and a 'src_compound_id' in UniChem ?
    • Glossary of Terms
  • Data protection: Privacy notice for UniChem's public website
Powered by GitBook
On this page

Was this helpful?

  1. Submission of data to UniChem

Reasons for data omission

PreviousSubmission of data to UniChemNextGetting in touch

Last updated 2 years ago

Was this helpful?

During the loading process some records are omitted for various reasons. These reasons are listed below.

A record is not loaded if any of the following apply:

Rule

Description

1

There is a mis-match between the InChI and the InChIkey. (*)

2

The source does not provide a Standard InChI. (**)

3

UniChem cannot generate an InChIkey from the Standard InChI provided by the source.

4

The source does not provide an ID for the structure.

5

The Standard InChI supplied is greater than 2000 characters long.

6

The auxilliary data is absent, or greater than 4000 characters long (***)

* = In other words, where the InChIKey calculated by UniChem from the InChI provided by the source does not exactly match the InChIKey provided by the source. Note, however, if the InChIKey is completely absent (ie: not provided by the source), then UniChem will calculate an InChIKey, and the record loaded with this calculated key (although UniChem will note this absence in a 'comment'). ** = However, if the source only provides Molfiles rather than Standard InChIs, then UniChem will generate Standard InChIs instead. Currently, this has only been done for a limited number of sources [see the page to see which ones]). *** = Clearly, this only applies to the limited number of sources that require auxilliary information. Go to find out what auxilliary information is.

sources
here