Hostname: page-component-cd9895bd7-jn8rn Total loading time: 0 Render date: 2024-12-26T21:06:40.062Z Has data issue: false hasContentIssue false

Actuarial Applications of Multivariate Two-Part Regression Models

Published online by Cambridge University Press:  02 April 2013

Edward W. Frees*
Affiliation:
University of Wisconsin - Madison
Xiaoli Jin
Affiliation:
University of Wisconsin - Madison
Xiao Lin
Affiliation:
University of Wisconsin - Madison
*
*Correspondence to: Edward W. (Jed) Frees. E-mail: [email protected]

Abstract

This paper synthesizes and extends the literature on multivariate two-part regression modelling, with an emphasis on actuarial applications. To illustrate the modelling, we use data from the US Medical Expenditure Panel Survey to explore expenditures that come in two parts. In the first part, zero expenditures correspond to no payments for health care services during a year. For the second part, a positive expenditure corresponds to the payment amount, a measure of utilization. Expenditures are multivariate, the five components being (i) office-based, (ii) hospital outpatient, (iii) emergency room, (iv) hospital inpatient, and (v) home health expenditures. Not surprisingly, there is a high degree of association among expenditure types and so we utilize models that account for these associations. These models include multivariate binary regressions for the payment type and generalized linear models with Gaussian copulas for payment amounts.

As anticipated, the strong associations among expenditure types allow us to establish significant model differences on an in-sample basis. Despite these strong associations, we find that commonly used statistical measures perform similarly on a held-out validation sample. In contrast, out-of-sample risk measures used by actuaries reveal differences in the association among expenditure types.

Type
Papers
Copyright
Copyright © Institute and Faculty of Actuaries 2013 

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Carey, V., Zeger, S.L., Diggle, P. (1993). Modelling multivariate binary data with alternating logistic regressions. Biometrika, 80(3), 517526.CrossRefGoogle Scholar
de Jong, P., Heller, G.Z. (2008). Generalized Linear Models for Insurance Data. Cambridge University Press, Cambridge, UK.CrossRefGoogle Scholar
Diggle, P.J., Heagerty, P., Liang, K.Y., Zeger, S.L. (2002). Analysis of Longitudinal Data, Second Edition. Oxford University Press.Google Scholar
Ekholm, A., Smith, P.W.F., McDonald, J.W. (1995). Marginal regression analysis of a multivariate binary response. Biometrika, 82(4), 847854.Google Scholar
Frees, E.W. (2010). Regression Modelling with Actuarial and Financial Applications. Cambridge University Press, New York.Google Scholar
Frees, E.W., Gao, J., Rosenberg, M. (2011a). The frequency and amount of inpatient and outpatient healthcare expenditures. North American Actuarial Journal, 15, 377392.Google Scholar
Frees, E.W., Meyers, G., Cummings, A.D. (2010). Dependent multi-peril ratemaking models. Astin Bulletin, 40(2), 699726.Google Scholar
Frees, E.W., Meyers, G., Cummings, A.D. (2011b). Summarizing insurance scores using a Gini index. Journal of the American Statistical Association, 106, 10851098.Google Scholar
Frees, E.W., Meyers, G., Cummings, A.D. (2012). Predictive modelling of multi-peril homeowners insurance. To appear in Variance.Google Scholar
Frees, E.W., Meyers, G., Cummings, A.D. (2013). Insurance ratemaking and a Gini index. To appear in the Journal of Risk and Insurance.Google Scholar
Frees, E.W., Shi, P., Valdez, E.A. (2009). Actuarial applications of a hierarchical insurance claims model. Astin Bulletin, 39(1), 165197.Google Scholar
Frees, E.W., Sun, Y. (2010). Household life insurance demand – a multivariate two-part model. North American Actuarial Journal, 14(3), 338354.Google Scholar
Glonek, G.F.V., MuCullagh, P. (1995). Multivariate logistic models. Journal of the Royal Statistical Society B, 57(3), 533546.Google Scholar
Haberman, S., Renshaw, A.E. (1996). Generalized linear models and actuarial science. The Statistician, 45(4), 407436.Google Scholar
Liang, K.Y., Qaqish, B., Zeger, S.L. (1992). Multivariate regression analyses for categorical data. Journal of the Royal Statistical Society B, 54(1), 340.Google Scholar
Liu, L., Strawderman, R.L., Cowen, M.E., Shih, Y.T. (2010). A flexible two-part random effects model for correlated medical costs. Journal of Health Economics, 29, 110123.Google Scholar
Mullahy, J. (1998). Much ado about two: reconsidering retransformation and the two-part model in health econometrics. Journal of Health Economics, 17(3), 247281.Google Scholar
Robinson, J.W., Zeger, S.L., Forrest, C.B. (2006). A hierarchical multivariate two-part model for profiling providers’ effects on health care charges. Journal of the American Statistical Association, 101, 911923.Google Scholar
Sun, Y. (2011). Micro-Econometric Modelling of Personal Lines Insurance, unpublished dissertation, University of Wisconsin-Madison.Google Scholar