Javascript must be enabled to continue!

MoTSE: an interpretable task similarity estimator for small molecular property prediction tasks

AbstractUnderstanding the molecular properties (e.g., physical, chemical or physiological characteristics and biological activities) of small molecules plays essential roles in biomedical researches. The accumulating amount of datasets has enabled the development of data-driven computational methods, especially the machine learning based methods, to address the molecular property prediction tasks. Due to the high cost of obtaining experimental labels, the datasets of individual tasks generally contain limited amount of data, which inspired the application of transfer learning to boost the performance of the molecular property prediction tasks. Our analyses revealed that simultaneously considering similar tasks, rather than randomly chosen ones, can significantly improve the performance of transfer learning in this field. To provide accurate estimation of task similarity, we proposed an effective and interpretable computational tool, named Molecular Tasks Similarity Estimator (MoTSE). By extracting task-related local and global knowledge from pretrained graph neural networks (GNNs), MoTSE projects individual tasks into a latent space and measures the distance between the embedded vectors to derive the task similarity estimation and thus enhance the molecular prediction results. We have validated that the task similarity estimated by MoTSE can serve as a useful guidance to design a more accurate transfer learning strategy for molecular property prediction. Experimental results showed that such a strategy greatly outperformed baseline methods including training from scratch and multitask learning. Moreover, MoTSE can provide interpretability for the estimated task similarity, through visualizing the important loci in the molecules attributed by the attribution method employed in MoTSE. In summary, MoTSE can provide an accurate method for estimating the molecular property task similarity for effective transfer learning, with good interpretability for the learned chemical or biological insights underlying the intrinsic principles of the task similarity.

Cold Spring Harbor Laboratory

Han Li Xinyi Zhao Shuya Li Fangping Wan Dan Zhao Jianyang Zeng

2021

Title: MoTSE: an interpretable task similarity estimator for small molecular property prediction tasks

Description:

AbstractUnderstanding the molecular properties (e.

, physical, chemical or physiological characteristics and biological activities) of small molecules plays essential roles in biomedical researches.

The accumulating amount of datasets has enabled the development of data-driven computational methods, especially the machine learning based methods, to address the molecular property prediction tasks.

Due to the high cost of obtaining experimental labels, the datasets of individual tasks generally contain limited amount of data, which inspired the application of transfer learning to boost the performance of the molecular property prediction tasks.

Our analyses revealed that simultaneously considering similar tasks, rather than randomly chosen ones, can significantly improve the performance of transfer learning in this field.

To provide accurate estimation of task similarity, we proposed an effective and interpretable computational tool, named Molecular Tasks Similarity Estimator (MoTSE).

By extracting task-related local and global knowledge from pretrained graph neural networks (GNNs), MoTSE projects individual tasks into a latent space and measures the distance between the embedded vectors to derive the task similarity estimation and thus enhance the molecular prediction results.

We have validated that the task similarity estimated by MoTSE can serve as a useful guidance to design a more accurate transfer learning strategy for molecular property prediction.

Experimental results showed that such a strategy greatly outperformed baseline methods including training from scratch and multitask learning.

Moreover, MoTSE can provide interpretability for the estimated task similarity, through visualizing the important loci in the molecules attributed by the attribution method employed in MoTSE.

In summary, MoTSE can provide an accurate method for estimating the molecular property task similarity for effective transfer learning, with good interpretability for the learned chemical or biological insights underlying the intrinsic principles of the task similarity.

Back

AbstractTask knowledge can be encoded hierarchically such that complex tasks can be built by associating simpler tasks. This associative organization supports generalization to fac...

Similarity Search with Data Missing

Similarity search is a fundamental research problem with broad applications in various research fields, including data mining, information retrieval, and machine learning. The core...

On the Efficiency of the newly Proposed Convex Olanrewaju-Olanrewaju Lo-oλγ(|θ|) Penalized Regression-Type Estimator via GLMs Technique.

In this article, we proposed a novel convex penalized regression-type estimator, termed Olanrewaju-Olanrewaju penalized regression-type estimator, denoted by Lo-oλγ(|θ|) for ultra...

Expanding dual-task research by a triple-task

Multitasking research in the laboratory is dominated by extremely simplistic dual-task paradigms. Although dual-tasks allow for some variations, they do not compare well to more co...

K-L Estimator: Dealing with Multicollinearity in the Logistic Regression Model

Multicollinearity negatively affects the efficiency of the maximum likelihood estimator (MLE) in both the linear and generalized linear models. The Kibria and Lukman estimator (KLE...

Effect of property management on property price: a case study in HK

PurposeIt has been said that people's expectation towards their living space has been increased. They have a higher requirement not only for the facilities it provides, but also fo...

Fast and effective molecular property prediction with transferability map

Abstract Effective transfer learning for molecular property prediction has shown considerable strength in addressing insufficient labeled molecules. Many existing methods e...

Property rights in martial law

The article is devoted to the study of property rights in martial law, the definition of «forced alienation of property» and «seizure of property», reveals their characteristics. ...

Email:
Password:

Email:

MoTSE: an interpretable task similarity estimator for small molecular property prediction tasks

Related Results