NPASS Compound

Natural Product: NPC260619

Natural Product ID	NPC260619
Common Name ? The InCHIKey will be temporarily assigned as the "Common Name" if no IUPAC name or alternative short name is available.	OILXMJHPFNGGTO-VFEOXTDESA-N
IUPAC Name	n.a.
Synonyms
Synthetic Gene Cluster	n.a.
ChEMBL Identifier	n.a.
PubChem CID	n.a.
Chemical Classification	CHEMONTID:0000000 [Organic compounds] [CHEMONTID:0000012] Lipids and lipid-like molecules [CHEMONTID:0000258] Steroids and steroid derivatives [CHEMONTID:0003567] Ergostane steroids [CHEMONTID:0001403] Ergosterols and derivatives

The Chemical Classification was calculated by Classyfire, a software for chemical taxonomy calculation. Reference: DOI:10.1186/s13321-016-0174-y.

Chemical Representations

RDKit 3D 75 78 0 0 0 0 0 0 0 0999 V2000 -6.8987 0.9078 0.3498 C 0 0 0 0 0 0 0 0 0 0 0 0 -6.5858 -0.5221 0.4552 C 0 0 0 0 0 0 0 0 0 0 0 0 -7.8747 -1.3369 0.2668 C 0 0 0 0 0 0 0 0 0 0 0 0 -5.5562 -1.1146 -0.4127 C 0 0 1 0 0 0 0 0 0 0 0 0 -5.8230 -0.9875 -1.8897 C 0 0 0 0 0 0 0 0 0 0 0 0 -4.1378 -0.8313 -0.0438 C 0 0 0 0 0 0 0 0 0 0 0 0 -3.7741 0.1244 0.7862 C 0 0 0 0 0 0 0 0 0 0 0 0 -2.3934 0.4775 1.2043 C 0 0 2 0 0 0 0 0 0 0 0 0 -2.2632 1.9746 0.9839 C 0 0 0 0 0 0 0 0 0 0 0 0 -1.3835 -0.3351 0.5172 C 0 0 1 0 0 0 0 0 0 0 0 0 -1.3097 -0.1927 -0.9516 C 0 0 0 0 0 0 0 0 0 0 0 0 -0.0992 0.6692 -1.2016 C 0 0 0 0 0 0 0 0 0 0 0 0 0.6059 0.7899 0.1390 C 0 0 2 0 0 0 0 0 0 0 0 0 2.0552 0.7924 -0.1313 C 0 0 1 0 0 0 0 0 0 0 0 0 2.6003 2.1871 0.0754 C 0 0 0 0 0 0 0 0 0 0 0 0 4.0698 2.2143 0.1527 C 0 0 0 0 0 0 0 0 0 0 0 0 4.7899 1.1340 0.2661 C 0 0 0 0 0 0 0 0 0 0 0 0 6.3077 1.1955 0.3433 C 0 0 0 0 0 0 0 0 0 0 0 0 6.8020 0.4234 -0.8528 C 0 0 2 0 0 0 0 0 0 0 0 0 5.7261 -0.1588 -1.7053 C 0 0 0 0 0 0 0 0 0 0 0 0 4.6846 -0.9499 -0.9054 C 0 0 0 0 0 0 0 0 0 0 0 0 4.2758 -0.2673 0.3339 C 0 0 1 0 0 0 0 0 0 0 0 0 4.9719 -0.8963 1.5499 C 0 0 0 0 0 0 0 0 0 0 0 0 2.8161 -0.2342 0.6564 C 0 0 2 0 0 0 0 0 0 0 0 0 2.1345 -1.5486 0.4295 C 0 0 0 0 0 0 0 0 0 0 0 0 0.7158 -1.6119 0.8464 C 0 0 0 0 0 0 0 0 0 0 0 0 0.0400 -0.2678 1.0095 C 0 0 2 0 0 0 0 0 0 0 0 0 0.0498 0.0096 2.4900 C 0 0 0 0 0 0 0 0 0 0 0 0 7.7879 -0.5091 -0.5233 O 0 0 0 0 0 0 0 0 0 0 0 0 -6.9200 1.3997 1.3562 H 0 0 0 0 0 0 0 0 0 0 0 0 -6.2796 1.5035 -0.3453 H 0 0 0 0 0 0 0 0 0 0 0 0 -7.9546 1.0570 -0.0308 H 0 0 0 0 0 0 0 0 0 0 0 0 -6.2851 -0.7566 1.5206 H 0 0 0 0 0 0 0 0 0 0 0 0 -8.7360 -0.8548 0.7280 H 0 0 0 0 0 0 0 0 0 0 0 0 -7.6941 -2.3814 0.6119 H 0 0 0 0 0 0 0 0 0 0 0 0 -8.0053 -1.4189 -0.8464 H 0 0 0 0 0 0 0 0 0 0 0 0 -5.6461 -2.2386 -0.2369 H 0 0 0 0 0 0 0 0 0 0 0 0 -6.0842 0.0137 -2.2146 H 0 0 0 0 0 0 0 0 0 0 0 0 -4.8559 -1.2547 -2.4089 H 0 0 0 0 0 0 0 0 0 0 0 0 -6.5590 -1.7237 -2.2542 H 0 0 0 0 0 0 0 0 0 0 0 0 -3.3286 -1.4581 -0.4535 H 0 0 0 0 0 0 0 0 0 0 0 0 -4.6370 0.7019 1.1964 H 0 0 0 0 0 0 0 0 0 0 0 0 -2.3965 0.3831 2.3192 H 0 0 0 0 0 0 0 0 0 0 0 0 -1.6551 2.4717 1.7858 H 0 0 0 0 0 0 0 0 0 0 0 0 -3.2925 2.4007 1.1703 H 0 0 0 0 0 0 0 0 0 0 0 0 -1.9985 2.2596 -0.0272 H 0 0 0 0 0 0 0 0 0 0 0 0 -1.6721 -1.4339 0.7006 H 0 0 0 0 0 0 0 0 0 0 0 0 -2.2123 0.2511 -1.4447 H 0 0 0 0 0 0 0 0 0 0 0 0 -1.1675 -1.1762 -1.4417 H 0 0 0 0 0 0 0 0 0 0 0 0 0.5339 0.1574 -1.9540 H 0 0 0 0 0 0 0 0 0 0 0 0 -0.3407 1.6552 -1.6432 H 0 0 0 0 0 0 0 0 0 0 0 0 0.2777 1.7830 0.5651 H 0 0 0 0 0 0 0 0 0 0 0 0 2.2857 0.5559 -1.2128 H 0 0 0 0 0 0 0 0 0 0 0 0 2.2182 2.6335 1.0365 H 0 0 0 0 0 0 0 0 0 0 0 0 2.2753 2.8752 -0.7316 H 0 0 0 0 0 0 0 0 0 0 0 0 4.5613 3.1787 0.1124 H 0 0 0 0 0 0 0 0 0 0 0 0 6.5798 0.6484 1.2587 H 0 0 0 0 0 0 0 0 0 0 0 0 6.6651 2.2227 0.3878 H 0 0 0 0 0 0 0 0 0 0 0 0 7.3318 1.1836 -1.5004 H 0 0 0 0 0 0 0 0 0 0 0 0 6.2043 -0.9015 -2.3850 H 0 0 0 0 0 0 0 0 0 0 0 0 5.2361 0.5816 -2.3681 H 0 0 0 0 0 0 0 0 0 0 0 0 5.2104 -1.9121 -0.6327 H 0 0 0 0 0 0 0 0 0 0 0 0 3.9044 -1.2212 -1.6103 H 0 0 0 0 0 0 0 0 0 0 0 0 5.9141 -1.3757 1.2894 H 0 0 0 0 0 0 0 0 0 0 0 0 4.2739 -1.6783 1.9327 H 0 0 0 0 0 0 0 0 0 0 0 0 5.0408 -0.1314 2.3472 H 0 0 0 0 0 0 0 0 0 0 0 0 2.7876 0.1197 1.7169 H 0 0 0 0 0 0 0 0 0 0 0 0 2.1567 -1.7122 -0.6886 H 0 0 0 0 0 0 0 0 0 0 0 0 2.7137 -2.4089 0.8403 H 0 0 0 0 0 0 0 0 0 0 0 0 0.6446 -2.2116 1.7777 H 0 0 0 0 0 0 0 0 0 0 0 0 0.0980 -2.1929 0.0981 H 0 0 0 0 0 0 0 0 0 0 0 0 -0.0125 1.0572 2.7714 H 0 0 0 0 0 0 0 0 0 0 0 0 -0.7876 -0.6291 2.9142 H 0 0 0 0 0 0 0 0 0 0 0 0 0.9266 -0.4756 3.0232 H 0 0 0 0 0 0 0 0 0 0 0 0 8.3468 -0.6820 -1.3438 H 0 0 0 0 0 0 0 0 0 0 0 0 1 2 1 0 2 3 1 0 2 4 1 0 4 5 1 0 4 6 1 0 6 7 2 0 7 8 1 0 8 9 1 0 8 10 1 0 10 11 1 0 11 12 1 0 12 13 1 0 13 14 1 0 14 15 1 0 15 16 1 0 16 17 2 0 17 18 1 0 18 19 1 0 19 20 1 0 20 21 1 0 21 22 1 0 22 23 1 1 22 24 1 0 24 25 1 0 25 26 1 0 26 27 1 0 27 28 1 1 19 29 1 0 27 10 1 0 27 13 1 0 24 14 1 0 22 17 1 0 1 30 1 0 1 31 1 0 1 32 1 0 2 33 1 0 3 34 1 0 3 35 1 0 3 36 1 0 4 37 1 1 5 38 1 0 5 39 1 0 5 40 1 0 6 41 1 0 7 42 1 0 8 43 1 1 9 44 1 0 9 45 1 0 9 46 1 0 10 47 1 6 11 48 1 0 11 49 1 0 12 50 1 0 12 51 1 0 13 52 1 1 14 53 1 6 15 54 1 0 15 55 1 0 16 56 1 0 18 57 1 0 18 58 1 0 19 59 1 6 20 60 1 0 20 61 1 0 21 62 1 0 21 63 1 0 23 64 1 0 23 65 1 0 23 66 1 0 24 67 1 1 25 68 1 0 25 69 1 0 26 70 1 0 26 71 1 0 28 72 1 0 28 73 1 0 28 74 1 0 29 75 1 0 M END

Standard InCHIKey	OILXMJHPFNGGTO-VFEOXTDESA-N
Standard InCHI	InChI=1S/C28H46O/c1-18(2)19(3)7-8-20(4)24-11-12-25-23-10-9-21-17-22(29)13-15-27(21,5)26(23)14-16-28(24,25)6/h7-9,18-20,22-26,29H,10-17H2,1-6H3/b8-7+/t19-,20-,22-,23+,24+,25+,26+,27-,28+/m0/s1
SMILES	CC(C)[C@@H](C)/C=C/[C@H](C)[C@H]1CC[C@@H]2[C@H]3CC=C4C[C@H](CC[C@]4(C)[C@@H]3CC[C@]12C)O

Calculated Properties

Physi-Chem Properties

Molecular Weight:	398.35	Volume:	462.136 ? Van der Waals volume.
Dense:	0.862	LogP:	6.362 ? The logarithm of the n-octanol/water distribution coefficients.
logD7.4:	4.77 ? The logarithm of the n-octanol/water distribution coefficient at pH=7.4.	LogS:	-5.959 ? The logarithm of aqueous solubility value.
Rotatable Bonds:	4.0	Rigid Bonds:	21.0
TPSA:	20.23 ? Topological Polar Surface Area.	H-Bond Acceptor:	1.0
H-Bond Donor:	1.0	Rings:	4.0
Heavy Atoms:	1.0

MedChem Properties

QED Drug-Likeness Score:	0.489	GASA:	1.0 ? GASA represents the probability of being difficult to synthesize, ranging from 0 to 1.
Synthetic Accessibility Score:	4.483	Fsp³:	0.857
MCE-18:	69.577 ? MCE-18 stands for medicinal chemistry evolution.MCE-18≥45 is considered a suitable value.	Lipinski Rule-of-5:	Rejected
Pfizer Rule:	Accepted	GSK Rule:	Accepted
Golden Triangle Rule:	Rejected	BMS Rule:	0
Chelating Alert:	0	PAINS Alert:	0
Colloidal aggregators:	0.988	Fluc inhibitor:	0.0 ? The fluc inhibitor value is the probability of being fLuc inhibitors, within the range of 0 to 1.
Blue fluorescence:	0.003 ? The blue fluorescence value is the probability of being blue fluorescence, within the range of 0 to 1	Green fluorescence:	0.0 ? The green fluorescence value is the probability of being green fluorescence, within the range of 0 to 1
Reactive compounds:	0.615	Promiscuous compounds:	0.026

ADMET Properties (ADMETlab3.0)

ADMET: Absorption

Caco-2 Permeability:	-5.203	MDCK Permeability:	-4.891
Pgp-inhibitor:	0.792	Pgp-substrate:	0.005
PAMPA:	0.008 ? The experimental data for Peff was logarithmically transformed (logPeff). Molecules with log Peff values below 2.0 were classified as low-permeability (Category 0), while those with log Peff values exceeding 2.5 were classified as high-permeability (Category 1).	Human Intestinal Absorption (HIA):	0.0
20% Bioavailability (F20%):	0.936	30% Bioavailability (F30%):	0.94
50% Bioavailability (F50%):	0.999

ADMET: Distribution

Blood-Brain-Barrier Penetration (BBB):	0.001	MRP1:	0.127
Plasma Protein Binding (PPB):	98.477%	Volume Distribution (VD):	-0.087
Fu:	1.263% ? The fraction unbound in plasms.	OATP1B1 inhibitor:	1.0
OATP1B3 inhibitor:	1.0	BCRP inhibitor:	0.69
BSEP inhibitor:	1.0

ADMET: Metabolism

CYP1A2-inhibitor:	0.0	CYP1A2-substrate:	0.003
CYP2C19-inhibitor:	0.0	CYP2C19-substrate:	0.028
CYP2C9-inhibitor:	0.0	CYP2C9-substrate:	0.019
CYP2D6-inhibitor:	0.001	CYP2D6-substrate:	0.16
CYP3A4-inhibitor:	0.942	CYP3A4-substrate:	1.0
CYP2B6-substrate:	0.0	CYP2C8-inhibitor:	1.0
HLM stability:	0.909 ? Human liver microsomal (HLM) stability. Category 0: stable+ (HLM > 30 min); Category 1: unstable- (HLM ≤ 30 min). The output value is the probability of human liver microsomal instability, where a value closer to 1 indicates a higher likelihood of instability.

ADMET: Excretion

Clearance (CL):

13.753

Half-life (T1/2):

0.539

ADMET: Toxicity

hERG Blockers:	0.105	hERG Blockers (10um):	0.358
Human Hepatotoxicity (H-HT):	0.62	Drug-induced Liver Injury (DILI):	0.218
AMES Toxicity:	0.054	Rat Oral Acute Toxicity:	0.119
Maximum Recommended Daily Dose:	0.486	Skin Sensitization:	0.919
Carcinogencity:	0.796	Eye Corrosion:	0.129
Eye Irritation:	0.863	Respiratory Toxicity:	0.614
Drug-induced Neurotoxicity:	0.074	Ototoxicity:	0.645
Hematotoxicity:	0.36	Drug-induced Nephrotoxicity:	0.513
Genotoxicity:	0.027	RPMI-8226 Immunitoxicity:	0.081
A549 Cytotoxicity:	0.309	Hek293 Cytotoxicity:	0.544
BCF:	2.795 ? Bioconcentration factors are used for considering secondary poisoning potential and assessing risks to human health via the food chain. The unit is -log10[(mg/L)/(1000*MW)].	IGC50:	4.25 ? 48 hour Tetrahymena pyriformis IGC50. The unit of IGC50 is -log10[(mg/L)/(1000*MW)].
LC50DM:	4.956 ? 48 hour Daphnia magna LC50. The unit of LC50DM is -log10[(mg/L)/(1000*MW)].	LC50FM:	4.899 ? 96 hour fathead minnow LC50. The unit of LC50FM is -log10[(mg/L)/(1000*MW)].

Species Source

Organism ID	Organism Name	Taxonomy Level	Family	SuperKingdom	Isolation Part	Collection Location	Collection Time	Reference
NPO18007	Agrimonia pilosa	Species	Rosaceae	Eukaryota	n.a.	n.a.	n.a.	PMID[22281186]
NPO21305	Carpesium macrocephalum	Species	Asteraceae	Eukaryota	Whole Plant	n.a.	n.a.	PMID[26394911]
NPO18007	Agrimonia pilosa	Species	Rosaceae	Eukaryota	n.a.	n.a.	n.a.	PMID[27588326]
NPO14041	Dendrobium findlayanum	Species	Orchidaceae	Eukaryota	n.a.	n.a.	n.a.	PMID[29338260]
NPO21305	Carpesium macrocephalum	Species	Asteraceae	Eukaryota	n.a.	n.a.	n.a.	PMID[39407585]
NPO22452	Leucanthemum vulgare	Species	Asteraceae	Eukaryota	n.a.	n.a.	n.a.	Database[COCONUT]
NPO14041	Dendrobium findlayanum	Species	Orchidaceae	Eukaryota	n.a.	n.a.	n.a.	Database[COCONUT]
NPO18007	Agrimonia pilosa	Species	Rosaceae	Eukaryota	n.a.	n.a.	n.a.	Database[COCONUT]
NPO18007	Agrimonia pilosa	Species	Rosaceae	Eukaryota	n.a.	n.a.	n.a.	Database[HerDing]
NPO18007	Agrimonia pilosa	Species	Rosaceae	Eukaryota	n.a.	n.a.	n.a.	Database[TCMID]
NPO22452	Leucanthemum vulgare	Species	Asteraceae	Eukaryota	n.a.	n.a.	n.a.	Database[TCMID]
NPO14041	Dendrobium findlayanum	Species	Orchidaceae	Eukaryota	n.a.	n.a.	n.a.	Database[TCMID]
NPO18007	Agrimonia pilosa	Species	Rosaceae	Eukaryota	n.a.	n.a.	n.a.	Database[TCM_Taiwan]
NPO18007	Agrimonia pilosa	Species	Rosaceae	Eukaryota	n.a.	n.a.	n.a.	Database[TM-MC]
NPO18007	Agrimonia pilosa	Species	Rosaceae	Eukaryota	n.a.	n.a.	n.a.	Database[UNPD]
NPO22452	Leucanthemum vulgare	Species	Asteraceae	Eukaryota	n.a.	n.a.	n.a.	Database[UNPD]
NPO14041	Dendrobium findlayanum	Species	Orchidaceae	Eukaryota	n.a.	n.a.	n.a.	Database[UNPD]
NPO21305	Carpesium macrocephalum	Species	Asteraceae	Eukaryota	n.a.	n.a.	n.a.	Database[UNPD]

Note for Reference:
In addition to directly collecting NP source organism data from primary literature (where reference will provided as NCBI PMID or DOI links), NPASS also integrated them from below databases:
☉ UNPD: Universal Natural Products Database [PMID: 23638153].
☉ StreptomeDB: a database of streptomycetes natural products [PMID: 33051671].
☉ TM-MC: a database of medicinal materials and chemical compounds in Northeast Asian traditional medicine [PMID: 26156871].
☉ TCM@Taiwan: a Traditional Chinese Medicine database [PMID: 21253603].
☉ TCMID: a Traditional Chinese Medicine database [PMID: 29106634].
☉ TCMSP: The traditional Chinese medicine systems pharmacology database and analysis platform [PMID: 24735618].
☉ HerDing: a herb recommendation system to treat diseases using genes and chemicals [PMID: 26980517].
☉ MetaboLights: a metabolomics database [PMID: 27010336].
☉ FooDB: a database of constituents, chemistry and biology of food species [www.foodb.ca].

NP Quantity Composition/Concentration

Organism ID	Organism Name	Organism Material Preparation	Organism Part	NP Quantity (Standard)	NP Quantity (Minimum)	NP Quantity (Maximum)	Quantity Unit	Reference

Note for Reference:
In addition to directly collecting NP quantitative data from primary literature (where reference will provided as NCBI PMID or DOI links), NPASS also integrated NP quantitative records for specific NP domains (e.g., NPS from foods or herbs) from domain-specific databases. These databases include:
☉ DUKE: Dr. Duke's Phytochemical and Ethnobotanical Databases.
☉ PHENOL EXPLORER: is the first comprehensive database on polyphenol content in foods [PMID: 24103452], its homepage can be accessed at here.
☉ FooDB: a database of constituents, chemistry and biology of food species [www.foodb.ca].

Biological Activity

Molecular-level activity

Target ID	Target Type	Target Name	Target Organism	Activity Type	Activity Relation	Value	Unit	Reference

In vitro activity

Target ID	Target Type	Target Name	Target Organism	Activity Type	Activity Relation	Value	Unit	Reference

In vivo activity

Target ID	Target Type	Target Name	Target Organism	Activity Type	Activity Relation	Value	Unit	Reference

Experimental ADME

Experiment Model	Experiment Tissue	ADME Type	ADME Relation	ADME Value	ADME Unit	Reference

Experimental Toxicity

Quantitative toxicity

Experiment Model	Experiment Organism	Toxicity Type	Toxicity Relation	Toxicity Value	Toxicity Unit	Reference

Common Abbreviations:
LC: Lethal Concentration; LD: Lethal Dose; LT:Lethal Time; NOAEL: No-observed-adverse-effect Level; BMDL: Benchmark Dose Lower Confidence Limit; BMD: Benchmark Dose; BMC:Benchmark Concentration; LOAEL: Lowest Observed Adverse Effect Level; RfD:Reference Dose; RfC:Reference Concentration; MRL: Minimal Risk Level; MEG: Maximum Exposure Guideline; PAC: Protective Action Criteria

Categorical toxicity labels

Hepatotoxicity	Carcinogenicity	Mutagenicity	Cardiotoxicity	Respiratory Toxicity	Eye Irritation	Endocrine Disruption

Note for Reference:
In addition to directly collecting NP quantitative data from primary literature (where reference will provided as NCBI PMID or DOI links), NPASS also integrated NP toxicity records from domain-specific databases. These databases include:
☉ ToxValDB: a curated database that compiles quantitative toxicity values for chemicals from diverse public sources to support toxicological research and risk assessment.
☉ TOXRIC: a comprehensive, free-to-access, online database providing toxicological/feature data. The toxicity labels are retrieved from this database. [PMID: 36400569]

Chemically structural similarity

Similar Active Natural Products in NPASS

Top-200 similar NPs were calculated against the active-NP-set (includes approximately 50,000 NPs with experimentally-derived bioactivity available in NPASS)

Similarity is measured using the Tanimoto coefficient (Tc) , which compares the binary fingerprints of two molecules. Tc is calculated as the intersection divided by the union of '1' bits in the fingerprints, ranging from 0 to 1, with 1 indicating highest similarity.

● The left chart: Distribution of similarity level between NPC260619 and all remaining natural products in the NPASS database.
● The right table: Most similar natural products (Tc>=0.5 or Top200).

range	Tanimoto Coefficient
0-0.1	23035
0.1-0.2	22938
0.2-0.3	3456
0.3-0.4	272
0.4-0.5	82
0.5-0.6	27
0.6-0.7	28
0.7-0.8	3
0.8-0.85	0
0.85-0.9	1
0.9-0.95	1
0.95-1	0

Similarity Score	Similarity Level	Natural Product ID
0.913	High Similarity	NPC154330
0.8936	High Similarity	NPC113733
0.7917	Intermediate Similarity	NPC162742
0.7917	Intermediate Similarity	NPC304309
0.7917	Intermediate Similarity	NPC470228
0.7692	Intermediate Similarity	NPC33913
0.7451	Intermediate Similarity	NPC230301
0.74	Intermediate Similarity	NPC22105
0.74	Intermediate Similarity	NPC34019
0.74	Intermediate Similarity	NPC107059
0.74	Intermediate Similarity	NPC600590
0.7255	Intermediate Similarity	NPC136188
0.7255	Intermediate Similarity	NPC28657
0.7255	Intermediate Similarity	NPC474216
0.7115	Intermediate Similarity	NPC198968
0.7115	Intermediate Similarity	NPC285893
0.7115	Intermediate Similarity	NPC134847
0.7059	Intermediate Similarity	NPC221758
0.6981	Remote Similarity	NPC241290
0.6981	Remote Similarity	NPC164840
0.6981	Remote Similarity	NPC484739
0.6981	Remote Similarity	NPC209944
0.6981	Remote Similarity	NPC264245
0.6981	Remote Similarity	NPC155986
0.6852	Remote Similarity	NPC328714
0.6852	Remote Similarity	NPC321381
0.6727	Remote Similarity	NPC472265
0.6727	Remote Similarity	NPC318495
0.6727	Remote Similarity	NPC59453
0.6667	Remote Similarity	NPC51014
0.6607	Remote Similarity	NPC603646
0.6491	Remote Similarity	NPC243985
0.6491	Remote Similarity	NPC473943
0.6491	Remote Similarity	NPC280710
0.6491	Remote Similarity	NPC240650
0.6491	Remote Similarity	NPC155011
0.64	Remote Similarity	NPC96319
0.6379	Remote Similarity	NPC474164
0.6379	Remote Similarity	NPC47761
0.6379	Remote Similarity	NPC488870
0.6296	Remote Similarity	NPC328313
0.6071	Remote Similarity	NPC234193
0.6066	Remote Similarity	NPC601043
0.6066	Remote Similarity	NPC605412
0.6034	Remote Similarity	NPC58063
0.6034	Remote Similarity	NPC477522
0.5893	Remote Similarity	NPC76879
0.5862	Remote Similarity	NPC1272
0.5862	Remote Similarity	NPC470614
0.5818	Remote Similarity	NPC151519
0.5818	Remote Similarity	NPC307965
0.5818	Remote Similarity	NPC18603
0.5818	Remote Similarity	NPC491013
0.5714	Remote Similarity	NPC20688
0.5645	Remote Similarity	NPC474189
0.5424	Remote Similarity	NPC87604
0.5397	Remote Similarity	NPC474349
0.5312	Remote Similarity	NPC176012
0.5231	Remote Similarity	NPC5985
0.5224	Remote Similarity	NPC235126
0.5224	Remote Similarity	NPC309493
0.5224	Remote Similarity	NPC242419
0.5217	Remote Similarity	NPC158088
0.5156	Remote Similarity	NPC474970
0.5152	Remote Similarity	NPC91604
0.5147	Remote Similarity	NPC147835
0.5147	Remote Similarity	NPC253645
0.5147	Remote Similarity	NPC85001
0.5147	Remote Similarity	NPC95920
0.5143	Remote Similarity	NPC3715
0.5088	Remote Similarity	NPC81306
0.5082	Remote Similarity	NPC474207
0.5072	Remote Similarity	NPC486119

Similar Clinical/Approved Drugs

Similarity level is defined by Tanimoto coefficient (Tc) between two molecules.

● The left chart: Distribution of similarity level between NPC260619 and all drugs/candidates.
● The right table: Most similar clinical/approved drugs (Tc>=0.5 or Top200).

range	Tanimoto Coefficient
0-0.1	7615
0.1-0.2	1335
0.2-0.3	169
0.3-0.4	19
0.4-0.5	7
0.5-0.6	1
0.6-0.7	1
0.7-0.8	3
0.8-0.85	0
0.85-0.9	0
0.9-0.95	0
0.95-1	0

Similarity Score	Similarity Level	Drug ID	Developmental Stage
0.7451	Intermediate Similarity	NPD7339	Approved
0.7255	Intermediate Similarity	NPD6942	Phase 4
0.7059	Intermediate Similarity	NPD4786	Phase 1
0.64	Remote Similarity	NPD3701	Pre-clinical
0.5818	Remote Similarity	NPD3667	Phase 4

Bioactivity similarity

Similar Natural Products in NPASS

Similarity level is defined by Bioactivity similarity was calculated based on bioactivity descriptors of compounds. The bioactivity descriptors were calculated by a recently developed AI algorithm Chemical Checker (CC) [Nature Biotechnology, 38:1087–1096, 2020; Nature Communications, 12:3932, 2021], which evaluated bioactivity similarities at five levels:
☉ A: chemistry similarity;
☉ B: biological targets similarity;
☉ C: networks similarity;
☉ D: cell-based bioactivity similarity;
☉ E: similarity based on clinical data.
Those 5 categories of CC bioactivity descriptors were calculated and then subjected to manifold projection using UMAP algorithm, to project all NPs on a 2-Dimensional space. The current NP was highlighted with a small circle in the 2-D map. Below figures: left-to-right, A-to-E.

A: chemistry similarity

B: biological targets similarity

C: networks similarity

D: cell-based bioactivity similarity

E: similarity based on clinical data