Structure

Physi-Chem Properties

Molecular Weight:  151.99
Volume:  122.839
LogP:  0.144
LogD:  2.787
LogS:  0.136
# Rotatable Bonds:  3
TPSA:  74.6
# H-Bond Aceptor:  4
# H-Bond Donor:  2
# Rings:  0
# Heavy Atoms:  5

MedChem Properties

QED Drug-Likeness Score:  0.565
Synthetic Accessibility Score:  3.026
Fsp3:  0.5
Lipinski Rule-of-5:  Accepted
Pfizer Rule:  Accepted
GSK Rule:  Accepted
BMS Rule:  2
Golden Triangle Rule:  Rejected
Chelating Alert:  0
PAINS Alert:  0

ADMET Properties (ADMETlab2.0)

ADMET: Absorption

Caco-2 Permeability:  -5.919
MDCK Permeability:  0.004402408376336098
Pgp-inhibitor:  0.0
Pgp-substrate:  0.002
Human Intestinal Absorption (HIA):  0.005
20% Bioavailability (F20%):  0.002
30% Bioavailability (F30%):  0.001

ADMET: Distribution

Blood-Brain-Barrier Penetration (BBB):  0.201
Plasma Protein Binding (PPB):  47.17683029174805%
Volume Distribution (VD):  0.219
Pgp-substrate:  45.79224395751953%

ADMET: Metabolism

CYP1A2-inhibitor:  0.005
CYP1A2-substrate:  0.062
CYP2C19-inhibitor:  0.024
CYP2C19-substrate:  0.048
CYP2C9-inhibitor:  0.003
CYP2C9-substrate:  0.905
CYP2D6-inhibitor:  0.01
CYP2D6-substrate:  0.115
CYP3A4-inhibitor:  0.007
CYP3A4-substrate:  0.004

ADMET: Excretion

Clearance (CL):  6.033
Half-life (T1/2):  0.887

ADMET: Toxicity

hERG Blockers:  0.004
Human Hepatotoxicity (H-HT):  0.239
Drug-inuced Liver Injury (DILI):  0.429
AMES Toxicity:  0.012
Rat Oral Acute Toxicity:  0.193
Maximum Recommended Daily Dose:  0.008
Skin Sensitization:  0.399
Carcinogencity:  0.036
Eye Corrosion:  0.998
Eye Irritation:  0.99
Respiratory Toxicity:  0.147

Download Data

Data Type Select
General Info & Identifiers & Properties  
Structure MOL file  
Source Organisms  
Biological Activities  
Similar NPs/Drugs  

  Natural Product: NPC321569

Natural Product ID:  NPC321569
Common Name*:   2-Chlorobutanedioic Acid
IUPAC Name:   2-chlorobutanedioic acid
Synonyms:  
Standard InCHIKey:  QEGKXSHUKXMDRW-UHFFFAOYSA-N
Standard InCHI:  InChI=1S/C4H5ClO4/c5-2(4(8)9)1-3(6)7/h2H,1H2,(H,6,7)(H,8,9)
SMILES:  C(C(C(=O)O)Cl)C(=O)O
Synthetic Gene Cluster:   n.a.
ChEMBL Identifier:   CHEMBL3186215
PubChem CID:   27655
Chemical Classification**:  
  • CHEMONTID:0000000 [Organic compounds]
    • [CHEMONTID:0000012] Lipids and lipid-like molecules
      • [CHEMONTID:0003909] Fatty Acyls
        • [CHEMONTID:0000262] Fatty acids and conjugates
          • [CHEMONTID:0000483] Halogenated fatty acids

*Note: the InCHIKey will be temporarily assigned as the "Common Name" if no IUPAC name or alternative short name is available.
**Note: the Chemical Classification was calculated by NPClassifier Version 1.5. Reference: PMID:34662515.

  Species Source

Organism ID Organism Name Taxonomy Level Family SuperKingdom Isolation Part Collection Location Collection Time Reference
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. DOI[10.1172/JCI16309]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. DOI[10.1371/journal.pone.0115359]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[10557354]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[11034610]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[11419736]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[11530998]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[1175644]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[12391014]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[12812989]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[12840027]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[12878451]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[15084647]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[1521032]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[15230696]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[15314235]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[16112079]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[16770722]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[1687010]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[17116739]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[17190852]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[17875433]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[18311922]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[18544912]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[18799520]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[19425150]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[19961175]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[20506249]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[20601097]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. faeces n.a. PMID[20708442]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. bile n.a. PMID[20708442]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. urine n.a. PMID[20708442]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[20876113]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[21798258]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[2268561]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[22711758]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[23315938]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[23717534]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[23752203]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[23810710]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[23811455]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[23868375]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[23919613]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[24101735]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[24399466]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[24494566]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[24558969]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[24816727]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[25114169]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[25181601]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[25293588]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[25644343]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[26236990]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[27471436]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[3179836]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[347637]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[4696527]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[5432584]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[6121420]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[6780563]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[8600370]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[8987136]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[9192820]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[9800648]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. Database[MetaboLights]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. Database[UNPD]

☑ Note for Reference:
In addition to directly collecting NP source organism data from primary literature (where reference will provided as NCBI PMID or DOI links), NPASS also integrated them from below databases:
UNPD: Universal Natural Products Database [PMID: 23638153].
StreptomeDB: a database of streptomycetes natural products [PMID: 33051671].
TM-MC: a database of medicinal materials and chemical compounds in Northeast Asian traditional medicine [PMID: 26156871].
TCM@Taiwan: a Traditional Chinese Medicine database [PMID: 21253603].
TCMID: a Traditional Chinese Medicine database [PMID: 29106634].
TCMSP: The traditional Chinese medicine systems pharmacology database and analysis platform [PMID: 24735618].
HerDing: a herb recommendation system to treat diseases using genes and chemicals [PMID: 26980517].
MetaboLights: a metabolomics database [PMID: 27010336].
FooDB: a database of constituents, chemistry and biology of food species [www.foodb.ca].

  NP Quantity Composition/Concentration

Organism ID NP ID Organism Material Preparation Organism Part NP Quantity (Standard) NP Quantity (Minimum) NP Quantity (Maximum) Quantity Unit Reference

☑ Note for Reference:
In addition to directly collecting NP quantitative data from primary literature (where reference will provided as NCBI PMID or DOI links), NPASS also integrated NP quantitative records for specific NP domains (e.g., NPS from foods or herbs) from domain-specific databases. These databases include:
DUKE: Dr. Duke's Phytochemical and Ethnobotanical Databases.
PHENOL EXPLORER: is the first comprehensive database on polyphenol content in foods [PMID: 24103452], its homepage can be accessed at here.
FooDB: a database of constituents, chemistry and biology of food species [www.foodb.ca].

  Biological Activity

Target ID Target Type Target Name Target Organism Activity Type Activity Relation Value Unit Reference
NPT152 Individual Protein Nuclear factor erythroid 2-related factor 2 Homo sapiens Potency n.a. 2761.2 nM PubChem BioAssay data set
NPT2 Others Unspecified Potency n.a. 69357.6 nM PubChem BioAssay data set
NPT2 Others Unspecified Potency n.a. 3098.1 nM PubChem BioAssay data set
NPT2 Others Unspecified Potency n.a. 156.6 nM PubChem BioAssay data set
NPT2 Others Unspecified Potency n.a. 109.9 nM PubChem BioAssay data set
NPT2 Others Unspecified Potency n.a. 22116.2 nM PubChem BioAssay data set
NPT2 Others Unspecified Potency n.a. 43761.7 nM PubChem BioAssay data set

☑ Note for Activity Records:
☉ The quantitative biological activities were primarily integrated from ChEMBL (Version-30) database and were also directly collected from PubMed literature. PubMed PMID was provided as the reference link for each activity record.

  Chemically structural similarity: I. Similar Active Natural Products in NPASS

Top-200 similar NPs were calculated against the active-NP-set (includes 4,3285 NPs with experimentally-derived bioactivity available in NPASS)

Similarity level is defined by Tanimoto coefficient (Tc) between two molecules. Tc lies between [0, 1] where '1' indicates the highest similarity. What is Tanimoto coefficient

●  The left chart: Distribution of similarity level between NPC321569 and all remaining natural products in the NPASS database.
●  The right table: Most similar natural products (Tc>=0.56 or Top200).

Similarity Score Similarity Level Natural Product ID
0.7647 Intermediate Similarity NPC236709
0.6471 Remote Similarity NPC158994
0.6341 Remote Similarity NPC1037
0.6316 Remote Similarity NPC198126
0.6111 Remote Similarity NPC149209
0.6111 Remote Similarity NPC278758
0.6053 Remote Similarity NPC180423
0.5854 Remote Similarity NPC7814
0.5814 Remote Similarity NPC317945
0.5778 Remote Similarity NPC240109
0.5714 Remote Similarity NPC316685
0.5682 Remote Similarity NPC128713
0.5652 Remote Similarity NPC121018
0.5652 Remote Similarity NPC19044
0.5652 Remote Similarity NPC24751
0.5652 Remote Similarity NPC100742
0.5652 Remote Similarity NPC192402
0.5652 Remote Similarity NPC97444
0.5641 Remote Similarity NPC174368
0.5641 Remote Similarity NPC122768
0.5641 Remote Similarity NPC61066
0.5641 Remote Similarity NPC104195
0.5641 Remote Similarity NPC292641
0.5641 Remote Similarity NPC151140
0.561 Remote Similarity NPC125575
0.561 Remote Similarity NPC108238
0.561 Remote Similarity NPC18224
0.561 Remote Similarity NPC328710

  Chemically structural similarity: II. Similar Clinical/Approved Drugs

Similarity level is defined by Tanimoto coefficient (Tc) between two molecules.

●  The left chart: Distribution of similarity level between NPC321569 and all drugs/candidates.
●  The right table: Most similar clinical/approved drugs (Tc>=0.56 or Top200).

Similarity Score Similarity Level Drug ID Developmental Stage
0.7647 Intermediate Similarity NPD8594 Approved
0.7647 Intermediate Similarity NPD8593 Approved
0.6765 Remote Similarity NPD8592 Approved
0.6765 Remote Similarity NPD8591 Approved
0.65 Remote Similarity NPD8596 Approved
0.6486 Remote Similarity NPD8619 Approved
0.6486 Remote Similarity NPD8617 Approved
0.6341 Remote Similarity NPD8597 Approved
0.6316 Remote Similarity NPD8618 Approved
0.5854 Remote Similarity NPD8857 Approved
0.575 Remote Similarity NPD8595 Approved
0.5652 Remote Similarity NPD8600 Approved
0.5652 Remote Similarity NPD8605 Approved
0.5652 Remote Similarity NPD8602 Approved
0.5652 Remote Similarity NPD8601 Approved
0.5652 Remote Similarity NPD8599 Approved
0.5652 Remote Similarity NPD8604 Approved
0.5652 Remote Similarity NPD8603 Approved
0.5652 Remote Similarity NPD8598 Approved

  Bioactivity similarity: Similar Natural Products in NPASS

Bioactivity similarity was calculated based on bioactivity descriptors of compounds. The bioactivity descriptors were calculated by a recently developed AI algorithm Chemical Checker (CC) [Nature Biotechnology, 38:1087–1096, 2020; Nature Communications, 12:3932, 2021], which evaluated bioactivity similarities at five levels:
A: chemistry similarity;
B: biological targets similarity;
C: networks similarity;
D: cell-based bioactivity similarity;
E: similarity based on clinical data.

Those 5 categories of CC bioactivity descriptors were calculated and then subjected to manifold projection using UMAP algorithm, to project all NPs on a 2-Dimensional space. The current NP was highlighted with a small circle in the 2-D map. Below figures: left-to-right, A-to-E.

A: chemistry similarity
B: biological targets similarity
C: networks similarity
D: cell-based bioactivity similarity
E: similarity based on clinical data