Structure

Physi-Chem Properties

Molecular Weight:  176.04
Volume:  153.072
LogP:  -1.977
LogD:  -0.747
LogS:  -1.122
# Rotatable Bonds:  5
TPSA:  129.72
# H-Bond Aceptor:  7
# H-Bond Donor:  5
# Rings:  0
# Heavy Atoms:  7

MedChem Properties

QED Drug-Likeness Score:  0.417
Synthetic Accessibility Score:  2.401
Fsp3:  0.4
Lipinski Rule-of-5:  Accepted
Pfizer Rule:  Accepted
GSK Rule:  Accepted
BMS Rule:  0
Golden Triangle Rule:  Rejected
Chelating Alert:  0
PAINS Alert:  0

ADMET Properties (ADMETlab2.0)

ADMET: Absorption

Caco-2 Permeability:  -6.441
MDCK Permeability:  0.0032203218434005976
Pgp-inhibitor:  0.0
Pgp-substrate:  0.184
Human Intestinal Absorption (HIA):  0.015
20% Bioavailability (F20%):  0.032
30% Bioavailability (F30%):  0.076

ADMET: Distribution

Blood-Brain-Barrier Penetration (BBB):  0.408
Plasma Protein Binding (PPB):  12.776578903198242%
Volume Distribution (VD):  0.277
Pgp-substrate:  77.0459213256836%

ADMET: Metabolism

CYP1A2-inhibitor:  0.002
CYP1A2-substrate:  0.031
CYP2C19-inhibitor:  0.037
CYP2C19-substrate:  0.036
CYP2C9-inhibitor:  0.031
CYP2C9-substrate:  0.781
CYP2D6-inhibitor:  0.024
CYP2D6-substrate:  0.124
CYP3A4-inhibitor:  0.011
CYP3A4-substrate:  0.002

ADMET: Excretion

Clearance (CL):  2.77
Half-life (T1/2):  0.704

ADMET: Toxicity

hERG Blockers:  0.004
Human Hepatotoxicity (H-HT):  0.051
Drug-inuced Liver Injury (DILI):  0.687
AMES Toxicity:  0.019
Rat Oral Acute Toxicity:  0.004
Maximum Recommended Daily Dose:  0.005
Skin Sensitization:  0.279
Carcinogencity:  0.034
Eye Corrosion:  0.003
Eye Irritation:  0.087
Respiratory Toxicity:  0.028

Download Data

Data Type Select
General Info & Identifiers & Properties  
Structure MOL file  
Source Organisms  
Biological Activities  
Similar NPs/Drugs  

  Natural Product: NPC322206

Natural Product ID:  NPC322206
Common Name*:   2-(Carbamoylamino)Butanedioic Acid
IUPAC Name:   2-(carbamoylamino)butanedioic acid
Synonyms:  
Standard InCHIKey:  HLKXYZVTANABHZ-UHFFFAOYSA-N
Standard InCHI:  InChI=1S/C5H8N2O5/c6-5(12)7-2(4(10)11)1-3(8)9/h2H,1H2,(H,8,9)(H,10,11)(H3,6,7,12)
SMILES:  C(C(C(=O)O)NC(=O)N)C(=O)O
Synthetic Gene Cluster:   n.a.
ChEMBL Identifier:   CHEMBL1161506
PubChem CID:   279
Chemical Classification**:  
  • CHEMONTID:0000000 [Organic compounds]
    • [CHEMONTID:0000264] Organic acids and derivatives
      • [CHEMONTID:0000265] Carboxylic acids and derivatives
        • [CHEMONTID:0000013] Amino acids, peptides, and analogues
          • [CHEMONTID:0000347] Amino acids and derivatives
            • [CHEMONTID:0000060] Alpha amino acids and derivatives
              • [CHEMONTID:0004317] Aspartic acid and derivatives

*Note: the InCHIKey will be temporarily assigned as the "Common Name" if no IUPAC name or alternative short name is available.
**Note: the Chemical Classification was calculated by NPClassifier Version 1.5. Reference: PMID:34662515.

  Species Source

Organism ID Organism Name Taxonomy Level Family SuperKingdom Isolation Part Collection Location Collection Time Reference
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. DOI[10.1172/JCI16309]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. DOI[10.1371/journal.pone.0115359]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[10557354]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[11034610]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[11419736]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[11530998]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[1175644]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[12391014]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[12812989]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[12840027]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[12878451]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[15084647]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[1521032]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[15230696]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[15314235]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[16112079]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[16770722]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[1687010]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[17116739]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[17190852]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[17875433]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[18311922]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[18544912]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[18799520]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[19425150]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[19961175]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[20506249]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[20601097]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. faeces n.a. PMID[20708442]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. bile n.a. PMID[20708442]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. urine n.a. PMID[20708442]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[20876113]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[21798258]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[2268561]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[22711758]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[23315938]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[23717534]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[23752203]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[23810710]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[23811455]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[23868375]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[23919613]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[24101735]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[24399466]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[24494566]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[24558969]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[24816727]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[25114169]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[25181601]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[25293588]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[25644343]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[26236990]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[27471436]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[3179836]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[347637]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[4696527]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[5432584]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[6121420]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[6780563]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[8600370]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[8987136]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[9192820]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. PMID[9800648]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. Database[MetaboLights]
NPO20338 Mus musculus Species Muridae Eukaryota n.a. n.a. n.a. Database[UNPD]

☑ Note for Reference:
In addition to directly collecting NP source organism data from primary literature (where reference will provided as NCBI PMID or DOI links), NPASS also integrated them from below databases:
UNPD: Universal Natural Products Database [PMID: 23638153].
StreptomeDB: a database of streptomycetes natural products [PMID: 33051671].
TM-MC: a database of medicinal materials and chemical compounds in Northeast Asian traditional medicine [PMID: 26156871].
TCM@Taiwan: a Traditional Chinese Medicine database [PMID: 21253603].
TCMID: a Traditional Chinese Medicine database [PMID: 29106634].
TCMSP: The traditional Chinese medicine systems pharmacology database and analysis platform [PMID: 24735618].
HerDing: a herb recommendation system to treat diseases using genes and chemicals [PMID: 26980517].
MetaboLights: a metabolomics database [PMID: 27010336].
FooDB: a database of constituents, chemistry and biology of food species [www.foodb.ca].

  NP Quantity Composition/Concentration

Organism ID NP ID Organism Material Preparation Organism Part NP Quantity (Standard) NP Quantity (Minimum) NP Quantity (Maximum) Quantity Unit Reference

☑ Note for Reference:
In addition to directly collecting NP quantitative data from primary literature (where reference will provided as NCBI PMID or DOI links), NPASS also integrated NP quantitative records for specific NP domains (e.g., NPS from foods or herbs) from domain-specific databases. These databases include:
DUKE: Dr. Duke's Phytochemical and Ethnobotanical Databases.
PHENOL EXPLORER: is the first comprehensive database on polyphenol content in foods [PMID: 24103452], its homepage can be accessed at here.
FooDB: a database of constituents, chemistry and biology of food species [www.foodb.ca].

  Biological Activity

Target ID Target Type Target Name Target Organism Activity Type Activity Relation Value Unit Reference
NPT2672 Individual Protein Dihydroorotase Homo sapiens Km = 50000.0 nM PMID[525584]
NPT367 Cell Line MDA-N Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT368 Cell Line SN12C Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT369 Cell Line ACHN Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT370 Cell Line NCI-H23 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT372 Cell Line HOP-92 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT371 Cell Line UO-31 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT90 Cell Line DU-145 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT373 Cell Line SK-MEL-5 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT111 Cell Line K562 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT375 Cell Line Malme-3M Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT376 Cell Line A498 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT377 Cell Line OVCAR-3 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT112 Cell Line MOLT-4 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT378 Cell Line NCI/ADR-RES Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT379 Cell Line HOP-62 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT381 Cell Line OVCAR-8 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT380 Cell Line U-251 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT383 Cell Line SNB-19 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT382 Cell Line OVCAR-5 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT82 Cell Line MDA-MB-231 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT384 Cell Line TK-10 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT385 Cell Line SR Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT455 Cell Line NCI-H522 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT323 Cell Line SW-620 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT386 Cell Line KM12 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT387 Cell Line M14 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT388 Cell Line NCI-H322M Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT389 Cell Line RPMI-8226 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT456 Cell Line OVCAR-4 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT390 Cell Line LOX IMVI Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT457 Cell Line BT-549 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT147 Cell Line SK-MEL-2 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT391 Cell Line HCC 2998 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT81 Cell Line A549 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT392 Cell Line SNB-75 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT148 Cell Line HCT-15 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT393 Cell Line HCT-116 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT395 Cell Line SF-268 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT394 Cell Line EKVX Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT306 Cell Line PC-3 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT397 Cell Line NCI-H460 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT396 Cell Line T47D Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT398 Cell Line UACC-62 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT308 Cell Line CAKI-1 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT399 Cell Line SF-295 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT400 Cell Line MDA-MB-435 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT458 Cell Line IGROV-1 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT402 Cell Line Hs-578T Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT401 Cell Line 786-0 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT403 Cell Line UACC-257 Homo sapiens GI50 n.a. 93325.43 nM PMID[525585]
NPT404 Cell Line CCRF-CEM Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT139 Cell Line HT-29 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT405 Cell Line NCI-H226 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT170 Cell Line SK-MEL-28 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT406 Cell Line RXF 393 Homo sapiens GI50 n.a. 100000.0 nM PMID[525585]
NPT20555 ORGANISM SARS-CoV-2 Severe acute respiratory syndrome coronavirus 2 IC50 > 20000.0 nM PMID[525586]
NPT20555 ORGANISM SARS-CoV-2 Severe acute respiratory syndrome coronavirus 2 IC50 > 19952.62 nM PMID[525586]

☑ Note for Activity Records:
☉ The quantitative biological activities were primarily integrated from ChEMBL (Version-30) database and were also directly collected from PubMed literature. PubMed PMID was provided as the reference link for each activity record.

  Chemically structural similarity: I. Similar Active Natural Products in NPASS

Top-200 similar NPs were calculated against the active-NP-set (includes 4,3285 NPs with experimentally-derived bioactivity available in NPASS)

Similarity level is defined by Tanimoto coefficient (Tc) between two molecules. Tc lies between [0, 1] where '1' indicates the highest similarity. What is Tanimoto coefficient

●  The left chart: Distribution of similarity level between NPC322206 and all remaining natural products in the NPASS database.
●  The right table: Most similar natural products (Tc>=0.56 or Top200).

Similarity Score Similarity Level Natural Product ID
0.8276 Intermediate Similarity NPC10915
0.7846 Intermediate Similarity NPC325534
0.75 Intermediate Similarity NPC118429
0.7458 Intermediate Similarity NPC202525
0.7377 Intermediate Similarity NPC327831
0.7321 Intermediate Similarity NPC285322
0.7321 Intermediate Similarity NPC208793
0.7213 Intermediate Similarity NPC322091
0.7213 Intermediate Similarity NPC60672
0.7049 Intermediate Similarity NPC245768
0.7015 Intermediate Similarity NPC103130
0.7015 Intermediate Similarity NPC226453
0.6949 Remote Similarity NPC153370
0.6912 Remote Similarity NPC278881
0.6857 Remote Similarity NPC81647
0.6842 Remote Similarity NPC198301
0.6812 Remote Similarity NPC327985
0.678 Remote Similarity NPC136159
0.6716 Remote Similarity NPC273037
0.6667 Remote Similarity NPC227850
0.6567 Remote Similarity NPC177191
0.6552 Remote Similarity NPC40511
0.6528 Remote Similarity NPC133183
0.6351 Remote Similarity NPC328457
0.6349 Remote Similarity NPC200550
0.6349 Remote Similarity NPC155156
0.6333 Remote Similarity NPC195448
0.6333 Remote Similarity NPC309658
0.6316 Remote Similarity NPC326212
0.6316 Remote Similarity NPC237525
0.6316 Remote Similarity NPC121517
0.6316 Remote Similarity NPC66043
0.6316 Remote Similarity NPC168375
0.6316 Remote Similarity NPC326992
0.6269 Remote Similarity NPC245346
0.6269 Remote Similarity NPC11433
0.6269 Remote Similarity NPC302003
0.625 Remote Similarity NPC221764
0.625 Remote Similarity NPC196359
0.625 Remote Similarity NPC135539
0.625 Remote Similarity NPC78312
0.6176 Remote Similarity NPC64250
0.6176 Remote Similarity NPC268927
0.6176 Remote Similarity NPC276928
0.6167 Remote Similarity NPC329263
0.6164 Remote Similarity NPC107224
0.6154 Remote Similarity NPC315780
0.6102 Remote Similarity NPC291186
0.6102 Remote Similarity NPC167986
0.6066 Remote Similarity NPC297220
0.6053 Remote Similarity NPC185084
0.6032 Remote Similarity NPC328378
0.6029 Remote Similarity NPC112224
0.6029 Remote Similarity NPC327895
0.6029 Remote Similarity NPC43169
0.6029 Remote Similarity NPC93861
0.6026 Remote Similarity NPC122471
0.6 Remote Similarity NPC132307
0.6 Remote Similarity NPC325097
0.6 Remote Similarity NPC126925
0.597 Remote Similarity NPC325985
0.597 Remote Similarity NPC313263
0.5938 Remote Similarity NPC125736
0.5909 Remote Similarity NPC321118
0.5909 Remote Similarity NPC316889
0.5902 Remote Similarity NPC327698
0.5902 Remote Similarity NPC118459
0.5882 Remote Similarity NPC278209
0.5873 Remote Similarity NPC140872
0.5873 Remote Similarity NPC93081
0.5806 Remote Similarity NPC185755
0.5806 Remote Similarity NPC136476
0.5806 Remote Similarity NPC213876
0.5806 Remote Similarity NPC49952
0.5789 Remote Similarity NPC216443
0.5781 Remote Similarity NPC17244
0.5781 Remote Similarity NPC316231
0.5781 Remote Similarity NPC112890
0.5781 Remote Similarity NPC324825
0.5763 Remote Similarity NPC53449
0.5758 Remote Similarity NPC197087
0.5758 Remote Similarity NPC190184
0.5758 Remote Similarity NPC329495
0.5735 Remote Similarity NPC38463
0.5714 Remote Similarity NPC316168
0.5714 Remote Similarity NPC321536
0.5692 Remote Similarity NPC315977
0.5692 Remote Similarity NPC137958
0.5692 Remote Similarity NPC273330
0.5682 Remote Similarity NPC320057
0.5652 Remote Similarity NPC283786
0.5634 Remote Similarity NPC143722
0.5616 Remote Similarity NPC315744
0.5614 Remote Similarity NPC116709
0.5614 Remote Similarity NPC272614
0.5614 Remote Similarity NPC21290

  Chemically structural similarity: II. Similar Clinical/Approved Drugs

Similarity level is defined by Tanimoto coefficient (Tc) between two molecules.

●  The left chart: Distribution of similarity level between NPC322206 and all drugs/candidates.
●  The right table: Most similar clinical/approved drugs (Tc>=0.56 or Top200).

Similarity Score Similarity Level Drug ID Developmental Stage
0.7846 Intermediate Similarity NPD8952 Approved
0.75 Intermediate Similarity NPD9014 Approved
0.7321 Intermediate Similarity NPD8610 Approved
0.7015 Intermediate Similarity NPD9045 Approved
0.7015 Intermediate Similarity NPD9046 Phase 3
0.7015 Intermediate Similarity NPD9047 Approved
0.7015 Intermediate Similarity NPD9048 Approved
0.6949 Remote Similarity NPD8614 Approved
0.6849 Remote Similarity NPD366 Approved
0.6786 Remote Similarity NPD8609 Approved
0.6714 Remote Similarity NPD9233 Phase 3
0.6714 Remote Similarity NPD9232 Phase 2
0.6714 Remote Similarity NPD9231 Phase 3
0.662 Remote Similarity NPD9451 Clinical (unspecified phase)
0.65 Remote Similarity NPD4829 Discontinued
0.6389 Remote Similarity NPD9452 Clinical (unspecified phase)
0.6351 Remote Similarity NPD5382 Phase 2
0.6333 Remote Similarity NPD8621 Approved
0.6269 Remote Similarity NPD8865 Approved
0.625 Remote Similarity NPD8868 Approved
0.6212 Remote Similarity NPD8864 Clinical (unspecified phase)
0.6102 Remote Similarity NPD8804 Approved
0.6102 Remote Similarity NPD8805 Approved
0.6027 Remote Similarity NPD1429 Clinical (unspecified phase)
0.6 Remote Similarity NPD8798 Approved
0.5909 Remote Similarity NPD8545 Approved
0.5873 Remote Similarity NPD8808 Approved
0.5873 Remote Similarity NPD8809 Approved
0.5806 Remote Similarity NPD8982 Approved
0.5797 Remote Similarity NPD9433 Approved
0.5781 Remote Similarity NPD9044 Approved
0.5781 Remote Similarity NPD9230 Discontinued
0.5758 Remote Similarity NPD8785 Approved
0.5745 Remote Similarity NPD6120 Clinical (unspecified phase)
0.5735 Remote Similarity NPD8949 Discontinued
0.5714 Remote Similarity NPD8209 Phase 2
0.5714 Remote Similarity NPD8208 Clinical (unspecified phase)
0.5692 Remote Similarity NPD8872 Phase 3
0.5692 Remote Similarity NPD8871 Approved
0.5641 Remote Similarity NPD9432 Discontinued
0.5625 Remote Similarity NPD9224 Clinical (unspecified phase)
0.5614 Remote Similarity NPD8210 Phase 3
0.5614 Remote Similarity NPD8211 Approved

  Bioactivity similarity: Similar Natural Products in NPASS

Bioactivity similarity was calculated based on bioactivity descriptors of compounds. The bioactivity descriptors were calculated by a recently developed AI algorithm Chemical Checker (CC) [Nature Biotechnology, 38:1087–1096, 2020; Nature Communications, 12:3932, 2021], which evaluated bioactivity similarities at five levels:
A: chemistry similarity;
B: biological targets similarity;
C: networks similarity;
D: cell-based bioactivity similarity;
E: similarity based on clinical data.

Those 5 categories of CC bioactivity descriptors were calculated and then subjected to manifold projection using UMAP algorithm, to project all NPs on a 2-Dimensional space. The current NP was highlighted with a small circle in the 2-D map. Below figures: left-to-right, A-to-E.

A: chemistry similarity
B: biological targets similarity
C: networks similarity
D: cell-based bioactivity similarity
E: similarity based on clinical data