Javascript must be enabled to continue!
Adapting Document Similarity Measures for Ligand-Based Virtual Screening
View through CrossRef
Quantifying the similarity of molecules is considered one of the major tasks in virtual screening. There are many similarity measures that have been proposed for this purpose, some of which have been derived from document and text retrieving areas as most often these similarity methods give good results in document retrieval and can achieve good results in virtual screening. In this work, we propose a similarity measure for ligand-based virtual screening, which has been derived from a text processing similarity measure. It has been adopted to be suitable for virtual screening; we called this proposed measure the Adapted Similarity Measure of Text Processing (ASMTP). For evaluating and testing the proposed ASMTP we conducted several experiments on two different benchmark datasets: the Maximum Unbiased Validation (MUV) and the MDL Drug Data Report (MDDR). The experiments have been conducted by choosing 10 reference structures from each class randomly as queries and evaluate them in the recall of cut-offs at 1% and 5%. The overall obtained results are compared with some similarity methods including the Tanimoto coefficient, which are considered to be the conventional and standard similarity coefficients for fingerprint-based similarity calculations. The achieved results show that the performance of ligand-based virtual screening is better and outperforms the Tanimoto coefficients and other methods.
Title: Adapting Document Similarity Measures for Ligand-Based Virtual Screening
Description:
Quantifying the similarity of molecules is considered one of the major tasks in virtual screening.
There are many similarity measures that have been proposed for this purpose, some of which have been derived from document and text retrieving areas as most often these similarity methods give good results in document retrieval and can achieve good results in virtual screening.
In this work, we propose a similarity measure for ligand-based virtual screening, which has been derived from a text processing similarity measure.
It has been adopted to be suitable for virtual screening; we called this proposed measure the Adapted Similarity Measure of Text Processing (ASMTP).
For evaluating and testing the proposed ASMTP we conducted several experiments on two different benchmark datasets: the Maximum Unbiased Validation (MUV) and the MDL Drug Data Report (MDDR).
The experiments have been conducted by choosing 10 reference structures from each class randomly as queries and evaluate them in the recall of cut-offs at 1% and 5%.
The overall obtained results are compared with some similarity methods including the Tanimoto coefficient, which are considered to be the conventional and standard similarity coefficients for fingerprint-based similarity calculations.
The achieved results show that the performance of ligand-based virtual screening is better and outperforms the Tanimoto coefficients and other methods.
Related Results
Theoretical study of laser-cooled SH<sup>–</sup> anion
Theoretical study of laser-cooled SH<sup>–</sup> anion
The potential energy curves, dipole moments, and transition dipole moments for the <inline-formula><tex-math id="M13">\begin{document}${{\rm{X}}^1}{\Sigma ^ + }$\end{do...
Revisiting near-threshold photoelectron interference in argon with a non-adiabatic semiclassical model
Revisiting near-threshold photoelectron interference in argon with a non-adiabatic semiclassical model
<sec> <b>Purpose:</b> The interaction of intense, ultrashort laser pulses with atoms gives rise to rich non-perturbative phenomena, which are encoded within th...
Generation of appropriate protein structures for virtual screening using AlphaFold3 predicted protein–ligand complexes
Generation of appropriate protein structures for virtual screening using AlphaFold3 predicted protein–ligand complexes
AbstractIn early drug discovery, virtual screening—a computational method for selecting candidate compounds—helps reduce development costs. Traditionally, structure-based virtual s...
Similarity Search with Data Missing
Similarity Search with Data Missing
Similarity search is a fundamental research problem with broad applications in various research fields, including data mining, information retrieval, and machine learning. The core...
Improved Cosine Similarity Measures for q-Rung Orthopair Fuzzy Sets
Improved Cosine Similarity Measures for q-Rung Orthopair Fuzzy Sets
In this paper, we introduce some novel cosine similarity measures for \(q\)-rung orthopair fuzzy sets (\(q\)-ROFSs), which capture both direction and magnitude aspects of fuzzy set...
Transformation of recording features in an electronic environment
Transformation of recording features in an electronic environment
The article deals with one of the main theoretical problems of document science related to the definition of document features. This problem is also of applied importance, since wh...
Ukrainian Embroidery as a Type of Document
Ukrainian Embroidery as a Type of Document
The purpose of the article is to determine the general and specific features of Ukrainian embroidery as a type of carrier of documented information.
The methodology. We chose the ...

