Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Relationship Between Prediction Accuracy and Feature Importance Reliability: an Empirical and Theoretical Study

View through CrossRef
Abstract There is significant interest in using neuroimaging data to predict behavior. The predictive models are often interpreted by the computation of feature importance, which quantifies the predictive relevance of an imaging feature. Tian and Zalesky (2021) suggest that feature importance estimates exhibit low split-half reliability, as well as a trade-off between prediction accuracy and feature importance reliability across parcellation resolutions. However, it is unclear whether the trade-off between prediction accuracy and feature importance reliability is universal. Here, we demonstrate that, with a sufficient sample size, feature importance (operationalized as Haufe-transformed weights) can achieve fair to excellent split-half reliability. With a sample size of 2600 participants, Haufe-transformed weights achieve average intra-class correlation coefficients of 0.75, 0.57 and 0.53 for cognitive, personality and mental health measures respectively. Haufe-transformed weights are much more reliable than original regression weights and univariate FC-behavior correlations. Original regression weights are not reliable even with 2600 participants. Intriguingly, feature importance reliability is strongly positively correlated with prediction accuracy across phenotypes. Within a particular behavioral domain, there is no clear relationship between prediction performance and feature importance reliability across regression models. Furthermore, we show mathematically that feature importance reliability is necessary, but not sufficient, for low feature importance error. In the case of linear models, lower feature importance error is mathematically related to lower prediction error. Therefore, higher feature importance reliability might yield lower feature importance error and higher prediction accuracy. Finally, we discuss how our theoretical results relate with the reliability of imaging features and behavioral measures. Overall, the current study provides empirical and theoretical insights into the relationship between prediction accuracy and feature importance reliability.
Title: Relationship Between Prediction Accuracy and Feature Importance Reliability: an Empirical and Theoretical Study
Description:
Abstract There is significant interest in using neuroimaging data to predict behavior.
The predictive models are often interpreted by the computation of feature importance, which quantifies the predictive relevance of an imaging feature.
Tian and Zalesky (2021) suggest that feature importance estimates exhibit low split-half reliability, as well as a trade-off between prediction accuracy and feature importance reliability across parcellation resolutions.
However, it is unclear whether the trade-off between prediction accuracy and feature importance reliability is universal.
Here, we demonstrate that, with a sufficient sample size, feature importance (operationalized as Haufe-transformed weights) can achieve fair to excellent split-half reliability.
With a sample size of 2600 participants, Haufe-transformed weights achieve average intra-class correlation coefficients of 0.
75, 0.
57 and 0.
53 for cognitive, personality and mental health measures respectively.
Haufe-transformed weights are much more reliable than original regression weights and univariate FC-behavior correlations.
Original regression weights are not reliable even with 2600 participants.
Intriguingly, feature importance reliability is strongly positively correlated with prediction accuracy across phenotypes.
Within a particular behavioral domain, there is no clear relationship between prediction performance and feature importance reliability across regression models.
Furthermore, we show mathematically that feature importance reliability is necessary, but not sufficient, for low feature importance error.
In the case of linear models, lower feature importance error is mathematically related to lower prediction error.
Therefore, higher feature importance reliability might yield lower feature importance error and higher prediction accuracy.
Finally, we discuss how our theoretical results relate with the reliability of imaging features and behavioral measures.
Overall, the current study provides empirical and theoretical insights into the relationship between prediction accuracy and feature importance reliability.

Related Results

Domination of Polynomial with Application
Domination of Polynomial with Application
In this paper, .We .initiate the study of domination. polynomial , consider G=(V,E) be a simple, finite, and directed graph without. isolated. vertex .We present a study of the Ira...
Domination of polynomial with application
Domination of polynomial with application
In this paper, .We .initiate the study of domination. polynomial , consider G=(V,E) be a simple, finite, and directed graph without. isolated. vertex .We present a study of the Ira...
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
Outburst prediction and influencing factors analysis based on Boruta-Apriori and BO-SVM algorithms
Outburst prediction and influencing factors analysis based on Boruta-Apriori and BO-SVM algorithms
The influencing factors of coal and gas outburst are complex, and now the accuracy and efficiency of outburst prediction are not high. In order to obtain the effective features fro...
Optimising tool wear and workpiece condition monitoring via cyber-physical systems for smart manufacturing
Optimising tool wear and workpiece condition monitoring via cyber-physical systems for smart manufacturing
Smart manufacturing has been developed since the introduction of Industry 4.0. It consists of resource sharing and networking, predictive engineering, and material and data analyti...
From features to functions : leveraging protein feature architectures in comparative genomics
From features to functions : leveraging protein feature architectures in comparative genomics
When analyzing genomic data, one of the key challenges is the annotation of new genes. The toolkit for incorporating newly discovered proteins into a comprehensive evolutionary and...
Sygeplejevidenskab myte eller virkelighed?
Sygeplejevidenskab myte eller virkelighed?
ENGLISH SUMMARY: The aim of this study is to explore the feasibility and productivity of a model for description and analysis of what Bourdieu has called “the genesis and structure...
Interpreting and evaluating digital soil mapping prediction uncertainties
Interpreting and evaluating digital soil mapping prediction uncertainties
There is an implicit quality associated with all soil maps which depends on a range of factors including the mapping algorithm, the extent and quality of the calibration data, and ...

Back to Top