Javascript must be enabled to continue!
Variable selection and causal treatment effect estimation based on interval-censored failure time data
View through CrossRef
[EMBARGOED UNTIL 6/1/2023] Variable selection has been discussed under many contexts and especially a great deal of literature has been established in the failure time context with constant coefficients. However, the time-varying effect sometimes could show more insight of how the influence changing through time. For example, the treatment effect may vanish because of mutation of virus. In addition, to identify important variables or covariates, a desired feature of a variable selection method is to distinguish time-varying coefficients from time-independent ones, which also presents an additional challenge. Nevertheless, only limited research exists on the variable selection for time-varying effect. Existing methods focused on right-censored data or generalized linear model. In Chapter 2 and Chapter 3, we discuss simultaneous parameter estimation and variable selection for interval censored data with time-varying effects, which can simultaneously select between time-dependent and time-independent covariate effects. To implement the proposed procedure, an EM algorithm is developed, and a simulation study is conducted and suggests that the proposed method works well in practical situations. What's more, the augmented Lagrangian method is used in implementation [Bertsekas, 1996] to deal with the compositional covariates in microbiome data. Finally, its usefulness is illustrated by the real data that motivated these studies. Another focus of this dissertation is the treatment effect estimation. In the presence of censoring, standard methods of summarizing the treatment effect estimates, Kaplan-Meier curves (survival function), the logrank test, etc., are not proper in observational studies as they all based on randomized experimental designs. For causal inference on survival outcomes, the commonly used causal estimands are: restricted average survival time, survival probability, survival quantile, and the marginal hazard ratio. But for the commonly used marginal hazard ratio in survival data analysis, it does not fit into Rubin's causal model framework because the observed baseline covariate balance is not guaranteed after the first failure happened in the sample. The susceptible subjects tend to experience failure events earlier, which will introduce selection bias problem to the analysis. For this reason, our target estimand is restricted average survival time, which is the difference between restricted mean survival time (RMST) defined on the potential survival time in treated and control groups. In Chapter 4, we propose a method for causal inference on interval-censored data in observational studies utilizing the pseudo observation approach. The pseudo observation for the interval-censored data is based on two methods. One is jack-knife method where the pseudo restricted mean survival time is calculated with the method proposed in Zhang et al. [2020]. Another approach uses the fast approximation of the jack-knife pseudo observations proposed by Bouaziz [2021]. With the calculated pseudo-observations, we propose to use IPW method to adjust the confounding effect arose in observation studies, where it tends to have different distributions of treatment assignment in treated and control group.
Title: Variable selection and causal treatment effect estimation based on interval-censored failure time data
Description:
[EMBARGOED UNTIL 6/1/2023] Variable selection has been discussed under many contexts and especially a great deal of literature has been established in the failure time context with constant coefficients.
However, the time-varying effect sometimes could show more insight of how the influence changing through time.
For example, the treatment effect may vanish because of mutation of virus.
In addition, to identify important variables or covariates, a desired feature of a variable selection method is to distinguish time-varying coefficients from time-independent ones, which also presents an additional challenge.
Nevertheless, only limited research exists on the variable selection for time-varying effect.
Existing methods focused on right-censored data or generalized linear model.
In Chapter 2 and Chapter 3, we discuss simultaneous parameter estimation and variable selection for interval censored data with time-varying effects, which can simultaneously select between time-dependent and time-independent covariate effects.
To implement the proposed procedure, an EM algorithm is developed, and a simulation study is conducted and suggests that the proposed method works well in practical situations.
What's more, the augmented Lagrangian method is used in implementation [Bertsekas, 1996] to deal with the compositional covariates in microbiome data.
Finally, its usefulness is illustrated by the real data that motivated these studies.
Another focus of this dissertation is the treatment effect estimation.
In the presence of censoring, standard methods of summarizing the treatment effect estimates, Kaplan-Meier curves (survival function), the logrank test, etc.
, are not proper in observational studies as they all based on randomized experimental designs.
For causal inference on survival outcomes, the commonly used causal estimands are: restricted average survival time, survival probability, survival quantile, and the marginal hazard ratio.
But for the commonly used marginal hazard ratio in survival data analysis, it does not fit into Rubin's causal model framework because the observed baseline covariate balance is not guaranteed after the first failure happened in the sample.
The susceptible subjects tend to experience failure events earlier, which will introduce selection bias problem to the analysis.
For this reason, our target estimand is restricted average survival time, which is the difference between restricted mean survival time (RMST) defined on the potential survival time in treated and control groups.
In Chapter 4, we propose a method for causal inference on interval-censored data in observational studies utilizing the pseudo observation approach.
The pseudo observation for the interval-censored data is based on two methods.
One is jack-knife method where the pseudo restricted mean survival time is calculated with the method proposed in Zhang et al.
[2020].
Another approach uses the fast approximation of the jack-knife pseudo observations proposed by Bouaziz [2021].
With the calculated pseudo-observations, we propose to use IPW method to adjust the confounding effect arose in observation studies, where it tends to have different distributions of treatment assignment in treated and control group.
Related Results
Regression analysis of interval-censored failure time data with non proportional hazards models
Regression analysis of interval-censored failure time data with non proportional hazards models
[ACCESS RESTRICTED TO THE UNIVERSITY OF MISSOURI AT AUTHOR'S REQUEST.] Interval-censored failure time data arises when the failure time of interest is known only to lie within an i...
Causal discovery and prediction: methods and algorithms
Causal discovery and prediction: methods and algorithms
(English) This thesis focuses on the discovery of causal relations and on the prediction of causal effects. Regarding causal discovery, this thesis introduces a novel and generic m...
A Practical Guide to Causal Inference in Three-Wave Panel Studies
A Practical Guide to Causal Inference in Three-Wave Panel Studies
Causal inference from observational data poses considerable challenges. This guide explains an approach to estimating causal effects using panel data focussing on the three-wave pa...
Interval censoring
Interval censoring
Interval-censored failure time data occur in many medical investigations as well as other studies such as demographical and sociological studies. They include the usual right-censo...
The Effect of Product Quality and Service Quality on Customer Satisfaction at SLV Room Boutique
The Effect of Product Quality and Service Quality on Customer Satisfaction at SLV Room Boutique
The purpose of the study was to determine the effect of product quality and service quality on customer satisfaction at the SLV Room Boutique. The population in this study were con...
Asymptotical problems of sequential interval and point estimation
Asymptotical problems of sequential interval and point estimation
The accuracy of interval estimation systems is usually measured using interval lengths for given covering probabilities. The confidence intervals are the intervals of a fixed width...
Optimising tool wear and workpiece condition monitoring via cyber-physical systems for smart manufacturing
Optimising tool wear and workpiece condition monitoring via cyber-physical systems for smart manufacturing
Smart manufacturing has been developed since the introduction of Industry 4.0. It consists of resource sharing and networking, predictive engineering, and material and data analyti...

