Li, Jia
(2006)
A Strategy for Stepwise Regression Procedures in Survival Analysis with Missing Covariates.
Doctoral Dissertation, University of Pittsburgh.
(Unpublished)
Abstract
The selection of variables used to predict a time to event outcome is a common and important issue when analyzing survival data. This is an essential step in accurately assessing risk factors in medical and public health studies. Ignoring an important variable in a regression model may result in biased and inefficient estimates of the outcomes. Such bias can have major implications in public health studies because it may cause potential risk factors to be falsely declared as associated with an outcome, such as mortality or conversely, be falsely declared not associated with the outcome. Stepwise regression procedures are widely used for model selection. However, they have inherent limitations, and can lead to unreasonable results when there are missing values in the potential covariates. In the first part of this dissertation, multiple imputations are used to deal with missing covariate information. We review two powerful imputation procedures, Multiple Imputation by Chain Equations (MICE) and estimation/multiple imputation for Mixed categorical and continuous data (MIX) that implement different multiple imputation methods. We compare the performance of these two procedures by assessing the bias, efficiency and robustness in several simulation studies using time to event outcomes. Practical limitations and valuable features of these two procedures are also assessed. In the second part of the dissertation, we use imputation together with a criterion called the Brier Score to formulate an overall stepwise model selection strategy. The strategy has the advantage of enabling one to perform model selection and evaluate the predictive accuracy of a selected model at the same time, all while taking into account the missing values in the covariates. This comprehensive strategy is implemented by defining the Weighted Brier Score (WBS) using weighted survival functions. We use simulations to assess this strategy and further demonstrate its use by analyzing survival data from the National Surgical Adjuvant Breast and Bowel Project (NSABP) Protocol B-06.
Share
Citation/Export: |
|
Social Networking: |
|
Details
Item Type: |
University of Pittsburgh ETD
|
Status: |
Unpublished |
Creators/Authors: |
|
ETD Committee: |
|
Date: |
25 September 2006 |
Date Type: |
Completion |
Defense Date: |
25 July 2006 |
Approval Date: |
25 September 2006 |
Submission Date: |
31 July 2006 |
Access Restriction: |
No restriction; Release the ETD for access worldwide immediately. |
Institution: |
University of Pittsburgh |
Schools and Programs: |
School of Public Health > Biostatistics |
Degree: |
PhD - Doctor of Philosophy |
Thesis Type: |
Doctoral Dissertation |
Refereed: |
Yes |
Uncontrolled Keywords: |
bootstrap; Cox mode; missing covariates; multiple imputation; predictive accuracy; the Brier Score |
Other ID: |
http://etd.library.pitt.edu/ETD/available/etd-07312006-002632/, etd-07312006-002632 |
Date Deposited: |
10 Nov 2011 19:55 |
Last Modified: |
15 Nov 2016 13:47 |
URI: |
http://d-scholarship.pitt.edu/id/eprint/8754 |
Metrics
Monthly Views for the past 3 years
Plum Analytics
Actions (login required)
 |
View Item |