Javascript must be enabled to continue!
Semiparametric methods for regression analysis of panel count data and mixed panel count data
View through CrossRef
[ACCESS RESTRICTED TO THE UNIVERSITY OF MISSOURI AT AUTHOR'S REQUEST.] Recurrent event data and panel count data are two common types of data that have been studied extensively in event history studies in literature. By recurrent event data, we mean that subjects are observed continuously in the follow-up study and thus occurrence times of recurrent events of interest are available. For panel count data, subjects are monitored periodically at discrete observation times and thus only numbers of recurrent events between two subsequent observations are recorded. In addition, one may face mixed panel count data in practice, which are the mixture of recurrent event data and panel count data. They arise when each study subject may be observed continuously during the whole study period, continuously over some study periods and at some time points otherwise, or only at some discrete time points. That is, these mixed data provide complete or incomplete information on the recurrent event process over different time periods for different subjects. It is well-known that in panel count data, the observation process may carry information on the underlying recurrent event process and the censoring may also be dependent in practice. Under such circumstance, the first part of this dissertation will discuss regression analysis of panel count data with informative observations and drop-outs. For the problem, a general means model is presented that can allow both additive and multiplicative effects of covariates on the underlying recurrent event process. In addition, the proportional rates model and the accelerated failure time model are employed to describe the covariate effects on the observation process and the dropout or follow-up process, respectively. For estimation of regression parameters, some estimating equation-based procedures are developed and the asymptotic properties of the proposed estimators are established. In addition, a resampling approach is proposed for the estimation of the covariance matrix of the proposed estimator and a model checking procedure is also provided. The results from an extensive simulation study indicate that the proposed methodology works well for practical situations and it is applied to a motivated set of real data from the Childhood Cancer Survivor Study (CCSS) given in Section 1.1.2.2. In the second part of this dissertation, we will consider regression analysis of mixed panel count data. One major problem in the statistical inference on the mixed data is to combine these two different types of data structures. Since panel count data can be viewed as interval-censored recurrent event data with exact occurrence times of events of interest unobserved or missing, they may be augmented by filling in those missing data by imputation. Then the mixed data can be converted to recurrent event data on which the existing statistical inference method can be easily implemented. Motivated by this, a multiple imputation-based estimation approach is proposed. A simulation study is conducted to study the finite-sample properties of the proposed methodology and it shows that the proposed method is more efficient than the existing method. Also, an illustrative example from the CCSS is provided. The third part of this dissertation still considers regression analysis of mixed panel count data but in the presence of a dependent terminal event, which precludes further occurrence of either recurrent events of interest or observations. For this problem, we present a marginal modeling approach which acknowledges the fact that there will be no more recurrent events after the terminal event and leaves the correlation structure unspecified. To estimate the parameters of interest, an estimating equation-based procedure is developed and the inverse probability of survival weighting technique is used. Asymptotic properties of proposed estimators are also established and finite-sample properties are assessed in a simulation study. We again apply this proposed methodology to the CCSS. In the last part of this dissertation, we will discuss some work directions of the future research.
Title: Semiparametric methods for regression analysis of panel count data and mixed panel count data
Description:
[ACCESS RESTRICTED TO THE UNIVERSITY OF MISSOURI AT AUTHOR'S REQUEST.
] Recurrent event data and panel count data are two common types of data that have been studied extensively in event history studies in literature.
By recurrent event data, we mean that subjects are observed continuously in the follow-up study and thus occurrence times of recurrent events of interest are available.
For panel count data, subjects are monitored periodically at discrete observation times and thus only numbers of recurrent events between two subsequent observations are recorded.
In addition, one may face mixed panel count data in practice, which are the mixture of recurrent event data and panel count data.
They arise when each study subject may be observed continuously during the whole study period, continuously over some study periods and at some time points otherwise, or only at some discrete time points.
That is, these mixed data provide complete or incomplete information on the recurrent event process over different time periods for different subjects.
It is well-known that in panel count data, the observation process may carry information on the underlying recurrent event process and the censoring may also be dependent in practice.
Under such circumstance, the first part of this dissertation will discuss regression analysis of panel count data with informative observations and drop-outs.
For the problem, a general means model is presented that can allow both additive and multiplicative effects of covariates on the underlying recurrent event process.
In addition, the proportional rates model and the accelerated failure time model are employed to describe the covariate effects on the observation process and the dropout or follow-up process, respectively.
For estimation of regression parameters, some estimating equation-based procedures are developed and the asymptotic properties of the proposed estimators are established.
In addition, a resampling approach is proposed for the estimation of the covariance matrix of the proposed estimator and a model checking procedure is also provided.
The results from an extensive simulation study indicate that the proposed methodology works well for practical situations and it is applied to a motivated set of real data from the Childhood Cancer Survivor Study (CCSS) given in Section 1.
1.
2.
2.
In the second part of this dissertation, we will consider regression analysis of mixed panel count data.
One major problem in the statistical inference on the mixed data is to combine these two different types of data structures.
Since panel count data can be viewed as interval-censored recurrent event data with exact occurrence times of events of interest unobserved or missing, they may be augmented by filling in those missing data by imputation.
Then the mixed data can be converted to recurrent event data on which the existing statistical inference method can be easily implemented.
Motivated by this, a multiple imputation-based estimation approach is proposed.
A simulation study is conducted to study the finite-sample properties of the proposed methodology and it shows that the proposed method is more efficient than the existing method.
Also, an illustrative example from the CCSS is provided.
The third part of this dissertation still considers regression analysis of mixed panel count data but in the presence of a dependent terminal event, which precludes further occurrence of either recurrent events of interest or observations.
For this problem, we present a marginal modeling approach which acknowledges the fact that there will be no more recurrent events after the terminal event and leaves the correlation structure unspecified.
To estimate the parameters of interest, an estimating equation-based procedure is developed and the inverse probability of survival weighting technique is used.
Asymptotic properties of proposed estimators are also established and finite-sample properties are assessed in a simulation study.
We again apply this proposed methodology to the CCSS.
In the last part of this dissertation, we will discuss some work directions of the future research.
Related Results
New Semiparametric Method for Predicting High-Cost Patients
New Semiparametric Method for Predicting High-Cost Patients
Summary
Motivated by the Medical Expenditure Panel Survey containing data from individuals’ medical providers and employers across the United States, we propose a ne...
Multilevel Analysis of Determinants of Cattle deaths in Ethiopia
Multilevel Analysis of Determinants of Cattle deaths in Ethiopia
Abstract
Background
The Ethiopian economy is highly dependent on agriculture. Despite being more subsistence, agricultural production plays an important role in the econom...
Estimation of Network Parameters in Semiparametric Stochastic Perceptron
Estimation of Network Parameters in Semiparametric Stochastic Perceptron
It was reported (Kabashima and Shinomoto 1992) that estimators of a binary decision boundary show asymptotically strange behaviors when the probability model is ill-posed or semipa...
Platelet count patterns and patient outcomes in sepsis at a tertiary care center
Platelet count patterns and patient outcomes in sepsis at a tertiary care center
Abstract
Acute physiology and chronic health evaluation II (APACHE-II) scoring system is used to classify disease severity of patients in the intensive care unit. Howev...
A Deep Learning Semiparametric Regression for Adjusting Complex Confounding Structures
A Deep Learning Semiparametric Regression for Adjusting Complex Confounding Structures
Deep Treatment Learning (deepTL), a robust yet efficient deep learning-based semiparametric regression approach, is proposed to adjust the complex confounding structures in compara...
Semiparametric analysis of complex longitudinal data
Semiparametric analysis of complex longitudinal data
Event history data consist of the longitudinal records of event occurrence times. Recurrent event data and panel count data are two common types of event history data that occur in...
A More Accurate Estimation of Semiparametric Logistic Regression
A More Accurate Estimation of Semiparametric Logistic Regression
Growing interest in genomics research has called for new semiparametric models based on kernel machine regression for modeling health outcomes. Models containing redundant predicto...
Reticulocyte Count and Platelet Count as Predictors of Morphological Remission/Hemopoitic Recovery in Acute Lymphoblastic Leukemia (ALL) after Induction Chemotherapy
Reticulocyte Count and Platelet Count as Predictors of Morphological Remission/Hemopoitic Recovery in Acute Lymphoblastic Leukemia (ALL) after Induction Chemotherapy
Objectives: To determine the predictive values of reticulocyte and platelet count for remission in cases of acute lymphoblastic leukemia after induction therapy.
Â
Materials and Me...

