Javascript must be enabled to continue!

Semiparametric methods for regression analysis of panel count data and mixed panel count data

[ACCESS RESTRICTED TO THE UNIVERSITY OF MISSOURI AT AUTHOR'S REQUEST.] Recurrent event data and panel count data are two common types of data that have been studied extensively in event history studies in literature. By recurrent event data, we mean that subjects are observed continuously in the follow-up study and thus occurrence times of recurrent events of interest are available. For panel count data, subjects are monitored periodically at discrete observation times and thus only numbers of recurrent events between two subsequent observations are recorded. In addition, one may face mixed panel count data in practice, which are the mixture of recurrent event data and panel count data. They arise when each study subject may be observed continuously during the whole study period, continuously over some study periods and at some time points otherwise, or only at some discrete time points. That is, these mixed data provide complete or incomplete information on the recurrent event process over different time periods for different subjects. It is well-known that in panel count data, the observation process may carry information on the underlying recurrent event process and the censoring may also be dependent in practice. Under such circumstance, the first part of this dissertation will discuss regression analysis of panel count data with informative observations and drop-outs. For the problem, a general means model is presented that can allow both additive and multiplicative effects of covariates on the underlying recurrent event process. In addition, the proportional rates model and the accelerated failure time model are employed to describe the covariate effects on the observation process and the dropout or follow-up process, respectively. For estimation of regression parameters, some estimating equation-based procedures are developed and the asymptotic properties of the proposed estimators are established. In addition, a resampling approach is proposed for the estimation of the covariance matrix of the proposed estimator and a model checking procedure is also provided. The results from an extensive simulation study indicate that the proposed methodology works well for practical situations and it is applied to a motivated set of real data from the Childhood Cancer Survivor Study (CCSS) given in Section 1.1.2.2. In the second part of this dissertation, we will consider regression analysis of mixed panel count data. One major problem in the statistical inference on the mixed data is to combine these two different types of data structures. Since panel count data can be viewed as interval-censored recurrent event data with exact occurrence times of events of interest unobserved or missing, they may be augmented by filling in those missing data by imputation. Then the mixed data can be converted to recurrent event data on which the existing statistical inference method can be easily implemented. Motivated by this, a multiple imputation-based estimation approach is proposed. A simulation study is conducted to study the finite-sample properties of the proposed methodology and it shows that the proposed method is more efficient than the existing method. Also, an illustrative example from the CCSS is provided. The third part of this dissertation still considers regression analysis of mixed panel count data but in the presence of a dependent terminal event, which precludes further occurrence of either recurrent events of interest or observations. For this problem, we present a marginal modeling approach which acknowledges the fact that there will be no more recurrent events after the terminal event and leaves the correlation structure unspecified. To estimate the parameters of interest, an estimating equation-based procedure is developed and the inverse probability of survival weighting technique is used. Asymptotic properties of proposed estimators are also established and finite-sample properties are assessed in a simulation study. We again apply this proposed methodology to the CCSS. In the last part of this dissertation, we will discuss some work directions of the future research.

University of Missouri Libraries

Guanglei Yu

2021

Title: Semiparametric methods for regression analysis of panel count data and mixed panel count data

Description:

[ACCESS RESTRICTED TO THE UNIVERSITY OF MISSOURI AT AUTHOR'S REQUEST.

] Recurrent event data and panel count data are two common types of data that have been studied extensively in event history studies in literature.

By recurrent event data, we mean that subjects are observed continuously in the follow-up study and thus occurrence times of recurrent events of interest are available.

For panel count data, subjects are monitored periodically at discrete observation times and thus only numbers of recurrent events between two subsequent observations are recorded.

In addition, one may face mixed panel count data in practice, which are the mixture of recurrent event data and panel count data.

They arise when each study subject may be observed continuously during the whole study period, continuously over some study periods and at some time points otherwise, or only at some discrete time points.

That is, these mixed data provide complete or incomplete information on the recurrent event process over different time periods for different subjects.

It is well-known that in panel count data, the observation process may carry information on the underlying recurrent event process and the censoring may also be dependent in practice.

Under such circumstance, the first part of this dissertation will discuss regression analysis of panel count data with informative observations and drop-outs.

For the problem, a general means model is presented that can allow both additive and multiplicative effects of covariates on the underlying recurrent event process.

In addition, the proportional rates model and the accelerated failure time model are employed to describe the covariate effects on the observation process and the dropout or follow-up process, respectively.

For estimation of regression parameters, some estimating equation-based procedures are developed and the asymptotic properties of the proposed estimators are established.

In addition, a resampling approach is proposed for the estimation of the covariance matrix of the proposed estimator and a model checking procedure is also provided.

The results from an extensive simulation study indicate that the proposed methodology works well for practical situations and it is applied to a motivated set of real data from the Childhood Cancer Survivor Study (CCSS) given in Section 1.

In the second part of this dissertation, we will consider regression analysis of mixed panel count data.

One major problem in the statistical inference on the mixed data is to combine these two different types of data structures.

Since panel count data can be viewed as interval-censored recurrent event data with exact occurrence times of events of interest unobserved or missing, they may be augmented by filling in those missing data by imputation.

Then the mixed data can be converted to recurrent event data on which the existing statistical inference method can be easily implemented.

Motivated by this, a multiple imputation-based estimation approach is proposed.

A simulation study is conducted to study the finite-sample properties of the proposed methodology and it shows that the proposed method is more efficient than the existing method.

Also, an illustrative example from the CCSS is provided.

The third part of this dissertation still considers regression analysis of mixed panel count data but in the presence of a dependent terminal event, which precludes further occurrence of either recurrent events of interest or observations.

For this problem, we present a marginal modeling approach which acknowledges the fact that there will be no more recurrent events after the terminal event and leaves the correlation structure unspecified.

To estimate the parameters of interest, an estimating equation-based procedure is developed and the inverse probability of survival weighting technique is used.

Asymptotic properties of proposed estimators are also established and finite-sample properties are assessed in a simulation study.

We again apply this proposed methodology to the CCSS.

In the last part of this dissertation, we will discuss some work directions of the future research.

Back

Summary Motivated by the Medical Expenditure Panel Survey containing data from individuals’ medical providers and employers across the United States, we propose a ne...

Multilevel Analysis of Determinants of Cattle deaths in Ethiopia

Abstract Background The Ethiopian economy is highly dependent on agriculture. Despite being more subsistence, agricultural production plays an important role in the econom...

Estimation of Network Parameters in Semiparametric Stochastic Perceptron

It was reported (Kabashima and Shinomoto 1992) that estimators of a binary decision boundary show asymptotically strange behaviors when the probability model is ill-posed or semipa...

Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)

BACKGROUND As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...

Platelet count patterns and patient outcomes in sepsis at a tertiary care center

Abstract Acute physiology and chronic health evaluation II (APACHE-II) scoring system is used to classify disease severity of patients in the intensive care unit. Howev...

Semiparametric analysis of complex longitudinal data

Event history data consist of the longitudinal records of event occurrence times. Recurrent event data and panel count data are two common types of event history data that occur in...

A Deep Learning Semiparametric Regression for Adjusting Complex Confounding Structures

Deep Treatment Learning (deepTL), a robust yet efficient deep learning-based semiparametric regression approach, is proposed to adjust the complex confounding structures in compara...

A More Accurate Estimation of Semiparametric Logistic Regression

Growing interest in genomics research has called for new semiparametric models based on kernel machine regression for modeling health outcomes. Models containing redundant predicto...

Email:
Password:

Email:

Semiparametric methods for regression analysis of panel count data and mixed panel count data

Related Results