Hostname: page-component-78c5997874-dh8gc Total loading time: 0 Render date: 2024-11-05T07:59:38.887Z Has data issue: false hasContentIssue false

A NEW MULTILEVEL MODELING APPROACH FOR CLUSTERED SURVIVAL DATA

Published online by Cambridge University Press:  03 March 2020

Jinfeng Xu
Affiliation:
The University of Hong Kong
Mu Yue
Affiliation:
University of Electronic Science and Technology of China
Wenyang Zhang*
Affiliation:
University of York
*
Address correspondence to Wenyang Zhang, Department of Mathematics, University of York, Heslington, York YO10 5DD, UK; email: Email address for correspondence: [email protected]

Abstract

In multilevel modeling of clustered survival data, to account for the differences among different clusters, a commonly used approach is to introduce cluster effects, either random or fixed, into the model. Modeling with random effects may lead to difficulties in the implementation of the estimation procedure for the unknown parameters of interest because the numerical computation of multiple integrals may become unavoidable when the cluster effects are not scalars. On the other hand, if fixed effects are used, there is a danger of having estimators with large variances because there are too many nuisance parameters involved in the model. In this article, using the idea of the homogeneity pursuit, we propose a new multilevel modeling approach for clustered survival data. The proposed modeling approach does not have the potential computational problem as modeling with random effects, and it also involves far fewer unknown parameters than modeling with fixed effects. We also establish asymptotic properties to show the advantages of the proposed model and conduct intensive simulation studies to demonstrate the performance of the proposed method. Finally, the proposed method is applied to analyze a dataset on the second-birth interval in Bangladesh. The most interesting finding is the impact of some important factors on the length of the second-birth interval variation over clusters and its homogeneous structure.

Type
ARTICLES
Copyright
© Cambridge University Press 2020

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Footnotes

We are grateful for the vast amount of editorial input by the Editor, Peter C.B. Phillips, on the final version of the manuscript. This research was supported by the National Natural Science Foundation of China (grant number 71873085).

References

REFERENCES

Andersen, P.K. & Gill, R.D. (1982) Cox’s regression model for counting processes: A large sample study. Annals of Statistics 10, 11001120.CrossRefGoogle Scholar
Ando, T. & Bai, J. (2017) Clustering huge number of financial time series: A panel data approach with high-dimensional predictors and factor structures. Journal of the American Statistical Association 112, 11821198.CrossRefGoogle Scholar
Azuma, K. (1967) Weighted sums of certain dependent random variables. Tohoku Mathematical Journal, Second Series 19, 357367.CrossRefGoogle Scholar
Bai, J. (1997) Estimating multiple breaks one at a time. Econometric Theory 13, 315352.CrossRefGoogle Scholar
Bonhomme, S. & Manresa, E. (2015) Grouped patterns of heterogeneity in panel data. Econometrica 83, 11471184.CrossRefGoogle Scholar
Bradic, J., Fan, J., & Jiang, J. (2011) Regularization for Cox’s proportional hazards model with np-dimensionality. Annals of Statistics 39, 30923210.CrossRefGoogle ScholarPubMed
Breslow, N. (1972) Comment on “regression and life tables” by DR Cox. Journal of the Royal Statistical Society, Series B 34, 216217.Google Scholar
David, C.R. (1972) Regression models and life tables (with discussion). Journal of the Royal Statistical Society, Series B 34, 187220.Google Scholar
Fleming, T.R. & Harrington, D.P. (2011) Counting Processes and Survival Analysis. Wiley.Google Scholar
Hartigan, J.A. & Wong, M.A. (1979) Algorithm as 136: A k-means clustering algorithm. Journal of the Royal Statistical Society, Series C 28, 100108.Google Scholar
Harvey, G. (2003) Multilevel Statistical Models. 3rd edition. Arnold, London, UK.Google Scholar
Hoeffding, W. (1963) Probability Inequalities for Sums of Bounded Random Variables. Vol. 58. Taylor & Francis, pp. 1330.Google Scholar
Ke, Y., Li, J., & Zhang, W. (2016) Structure identification in panel data analysis. Annals of Statistics 44, 11931233.CrossRefGoogle Scholar
Ke, Z.T., Fan, J., & Wu, Y. (2015) Homogeneity pursuit. Journal of the American Statistical Association 110, 175194.CrossRefGoogle ScholarPubMed
Mitra, S., Al-Sabir, A., Cross, A.R., & Jamil, K. (1997) Bangladesh Demographic and Health Survey 1996–1997. National Institute of Population Research and Training (NIPORT), Mitra and Associates, and Macro International Inc., Dhakaand Calverton, MD.Google Scholar
Ortega, J.M. & Rheinboldt, W.C. (1970) Iterative Solution of Nonlinear Equations in Several Variables. Academic Press, San Diego, CA.Google Scholar
Su, L. & Ju, G. (2018) Identifying latent grouped patterns in panel data models with interactive fixed effects. Journal of Econometrics 206, 554573.CrossRefGoogle Scholar
Su, L., Shi, Z., & Phillips, P.C. (2016) Identifying latent structures in panel data. Econometrica 84, 22152264.CrossRefGoogle Scholar
Su, L., Liangjun, W.X., & Jin, S. (2019) Sieve estimation of time-varying panel data models with latent structures. Journal of Business & Economic Statistics 37, 334349.CrossRefGoogle Scholar
Venkatraman, E.S. (1993) Consistency results in multiple change-point problems. Ph.D. thesis, Stanford Univ., ProQuest LLC, Ann Arbor, MI.Google Scholar
Volinsky, C.T. & Raftery, A.E. (2000) Bayesian information criterion for censored survival models. Biometrics 56, 256262.CrossRefGoogle ScholarPubMed
Vostrikova, L. (1981) Detection of the disorder in multidimensional random-processes. Doklady Akademii Nauk SSSR 259, 270274.Google Scholar
Wang, W., Phillips, P.C., & Su, L. (2018) Homogeneity pursuit in panel data models: Theory and application. Journal of Applied Econometrics 33, 797815.CrossRefGoogle Scholar
Wang, W., & Su, L. (2019) Identifying latent group structures in nonlinear panels. Journal of Econometrics (in press).Google Scholar
Zhang, W., & Steele, F. (2004) A semiparametric multilevel survival model. Journal of the Royal Statistical Society: Series C (Applied Statistics) 53, 387404.Google Scholar