Javascript must be enabled to continue!
MV-ComBat and MV-CovBat: Multivariate Frameworks for Joint Harmonization of Multi-Metric Neuroimaging Data
View through CrossRef
Abstract
Aggregating neuroimaging data across sites and studies is increasingly common, yet site- and scanner-related batch effects can obscure meaningful biological variation and introduce spurious associations. Although ComBat and its extensions are widely used, they are primarily designed for single-metric (univariate) harmonization. In practice, neuroimaging studies often involve multiple biologically coupled metrics (e.g., cortical thickness, surface area, and gray-matter volume) measured across multiple features (e.g., regional values), with shared covariance structure both within and across metrics. Applying univariate ComBat independently to each metric ignores these dependencies and can leave residual batch effects in cross-metric covariance. Using data from the NIH Acute to Chronic Pain Signatures (A2CPS) program, we show that batch effects occur not only in means and variances but also in covariance across cortical regions and metrics—relationships that univariate ComBat does not fully remove. We propose MV-ComBat, a multivariate extension of ComBat that jointly harmonizes multiple metrics by borrowing strength across them. Both empirical Bayes (EB) and Bayesian Markov Chain Monte Carlo (MCMC) implementations of MV-ComBat effectively reduce batch effects. In our experiments, EB is more robust to measurement error, whereas MCMC more accurately recovers cross-metric correlations when priors are well specified. Recognizing that batch effects can also affect feature-level covariance, CovBat was recently introduced as an extension of ComBat that harmonizes both first- and second-order moments across sites. We extend CovBat to the multivariate framework as MV-CovBat, which performs a second-stage latent-space harmonization to address covariance-related batch effects across features and metrics. Simulations confirm that MV-ComBat improves correlation recovery and biological signal preservation relative to univariate ComBat, particularly for moderate-to-strong effects, and that MV-CovBat further improves separation of true biological variation from batch effects when independence assumptions are violated. Together, these methods provide a flexible and unified framework for harmonizing complex, multi-metric neuroimaging data in large-scale, multi-site studies.
Title: MV-ComBat and MV-CovBat: Multivariate Frameworks for Joint Harmonization of Multi-Metric Neuroimaging Data
Description:
Abstract
Aggregating neuroimaging data across sites and studies is increasingly common, yet site- and scanner-related batch effects can obscure meaningful biological variation and introduce spurious associations.
Although ComBat and its extensions are widely used, they are primarily designed for single-metric (univariate) harmonization.
In practice, neuroimaging studies often involve multiple biologically coupled metrics (e.
g.
, cortical thickness, surface area, and gray-matter volume) measured across multiple features (e.
g.
, regional values), with shared covariance structure both within and across metrics.
Applying univariate ComBat independently to each metric ignores these dependencies and can leave residual batch effects in cross-metric covariance.
Using data from the NIH Acute to Chronic Pain Signatures (A2CPS) program, we show that batch effects occur not only in means and variances but also in covariance across cortical regions and metrics—relationships that univariate ComBat does not fully remove.
We propose MV-ComBat, a multivariate extension of ComBat that jointly harmonizes multiple metrics by borrowing strength across them.
Both empirical Bayes (EB) and Bayesian Markov Chain Monte Carlo (MCMC) implementations of MV-ComBat effectively reduce batch effects.
In our experiments, EB is more robust to measurement error, whereas MCMC more accurately recovers cross-metric correlations when priors are well specified.
Recognizing that batch effects can also affect feature-level covariance, CovBat was recently introduced as an extension of ComBat that harmonizes both first- and second-order moments across sites.
We extend CovBat to the multivariate framework as MV-CovBat, which performs a second-stage latent-space harmonization to address covariance-related batch effects across features and metrics.
Simulations confirm that MV-ComBat improves correlation recovery and biological signal preservation relative to univariate ComBat, particularly for moderate-to-strong effects, and that MV-CovBat further improves separation of true biological variation from batch effects when independence assumptions are violated.
Together, these methods provide a flexible and unified framework for harmonizing complex, multi-metric neuroimaging data in large-scale, multi-site studies.
Related Results
An MRI multi‐scanner neuroimaging data harmonization study using RAVEL and ComBat
An MRI multi‐scanner neuroimaging data harmonization study using RAVEL and ComBat
AbstractBackgroundLarge‐scale multi‐site neuroimaging studies provide higher power for statistical analyses. However, these aggregated datasets are susceptible to unwanted variabil...
Effect of data harmonization of multicentric dataset in ASD/TD classification
Effect of data harmonization of multicentric dataset in ASD/TD classification
Abstract
Machine Learning (ML) is nowadays an essential tool in the analysis of Magnetic Resonance Imaging (MRI) data, in particular in the identification of brain correlat...
A comparative study of mappings in metric space and controlled metric space
A comparative study of mappings in metric space and controlled metric space
The objective of this paper is to present a comparative study of mapping in Metric Space and Controlled Metric Space. The study provides the structure, gap analysis and application...
ComBat Harmonization: Empirical Bayes versus Fully Bayes Approaches
ComBat Harmonization: Empirical Bayes versus Fully Bayes Approaches
Abstract
Studying small effects or subtle neuroanatomical variation requires large-scale sample size data. As a result, comb...
Differential Diagnosis of Neurogenic Thoracic Outlet Syndrome: A Review
Differential Diagnosis of Neurogenic Thoracic Outlet Syndrome: A Review
Abstract
Thoracic outlet syndrome (TOS) is a complex and often overlooked condition caused by the compression of neurovascular structures as they pass through the thoracic outlet. ...
An Event Based Topic Learning Pipeline for Neuroimaging Literature Mining
An Event Based Topic Learning Pipeline for Neuroimaging Literature Mining
Abstract
Neuroimaging text mining extracts knowledge from neuroimaging text and has received widespread attention. Topic learning is an important research focus of neuroima...
Expansion mapping in controlled metric space and extended B-metric space
Expansion mapping in controlled metric space and extended B-metric space
This paper delves into the intricate study of expansion mappings within the frameworks of controlled metric spaces and extended B-metric spaces. Expansion mappings, known for their...
Riemannian Curvature of a Sliced Contact Metric Manifold
Riemannian Curvature of a Sliced Contact Metric Manifold
Contact geometry become a more important issue in the mathematical world with the works which had done in the 19th century. Many mathematicians have made studies on contact manifol...

