Regression with incomplete multivariate surrogate responses for a latent covariate
dc.contributor.author | Shen, Hua | |
dc.contributor.author | Cook, Richard J. | |
dc.date.accessioned | 2021-01-13T20:16:59Z | |
dc.date.available | 2021-01-13T20:16:59Z | |
dc.date.issued | 2020-07-29 | |
dc.description | This is the Accepted Manuscript of this article published by Taylor & Francis in the Biostatistics & Epidemiology on July 29, 2020. The final form is available at https://doi.org/10.1080/24709360.2020.1794705. | en |
dc.description.abstract | We consider the setting in which a categorical exposure variable of interest can only be measured subject to misclassification via surrogate variables. These surrogate variables may represent the classification of an individual via imperfect diagnostic tests. In such settings, a random number of diagnostic tests may be ordered at the discretion of a treating physician with the decision to order further tests made in a sequential fashion based on the results of preliminary test results. Because the underlying latent status is not ascertainable these cheaper but imperfect surrogate test results are used in lieu of the definitive classification in a model for a long-term outcome. Naive use of a single surrogate or functions of the available surrogates can lead to biased estimators of the association and invalid inference. We propose a likelihood-based approach for modeling the effect of the latent variable in the absence of validation data with estimation based on an expectation–maximization (EM) algorithm. The method yields consistent and efficient estimates and is shown to out-perform several common alternative approaches. The performance of the proposed method is demonstrated in simulation studies and its utility is illustrated by applying the proposed method to the stimulating study on breast cancer. | en |
dc.description.sponsorship | This work was supported by grants from the Natural Sciences and Engineering Research Council of Canada [grant numbers RGPIN 04594 and RGPIN 155849], the Breast Cancer Society of Canada, and startup funds at the University of Calgary. | en |
dc.identifier.uri | https://doi.org/10.1080/24709360.2020.1794705 | |
dc.identifier.uri | http://hdl.handle.net/10012/16658 | |
dc.language.iso | en | en |
dc.publisher | Taylor & Francis | en |
dc.relation.ispartofseries | Biostatistics & Epidemiology;4(1) | |
dc.subject | expectation-maximization algorithm | en |
dc.subject | latent variable | en |
dc.subject | misspecified model | en |
dc.subject | regression | en |
dc.subject | surrogate variable | en |
dc.title | Regression with incomplete multivariate surrogate responses for a latent covariate | en |
dc.type | Article | en |
dcterms.bibliographicCitation | Hua Shen & Richard J. Cook (2020) Regression with incomplete multivariate surrogate responses for a latent covariate, Biostatistics & Epidemiology, 4:1, 247-264, DOI: 10.1080/24709360.2020.1794705 | en |
uws.contributor.affiliation1 | Faculty of Mathematics | en |
uws.contributor.affiliation2 | Statistics and Actuarial Science | en |
uws.peerReviewStatus | Reviewed | en |
uws.scholarLevel | Faculty | en |
uws.typeOfResource | Text | en |