Published online by Cambridge University Press: 05 March 2016
The previous chapters have focussed on confidence distributions and associated inference for parameters of statistical models. Sometimes the goal of an analysis is, however, to make predictions about as yet unobserved or otherwise hidden random variables, such as the next data point in a sequence, or to infer values of missing data, and so forth. This chapter discusses and illustrates how the concept of confidence distributions may be lifted to such settings. Applications are given to predicting the next observation in a sequence, to regression models, kriging in geostatistics and time series models.
Introduction
In earlier chapters we have developed and discussed concepts and methods for confidence distributions for parameters of statistical models. Sometimes the goal of fitting and analysing a model to data is, however, to predict as yet unobserved random quantities, like the next observation in a sequence, a missing data point in a data matrix or inferring the distribution for a future Y0 in a regression model as a function of its associated covariates x0, and so on. For such a future or onobserved Y0 we may then wish to construct a predictive distribution, say Cpred(y0), with the property that Cpred(b)−Cpred(a) may be interpreted as the probability that a ≤ Y0 ≤ b. As such intervals for the unobserved Y0 with given coverage degree may be read off, via [C−1pred(α),C−1pred(1−α)], as for ordinary confidence intervals.
There is a tradition in some statistics literature to use ‘credibility intervals’ rather than ‘confidence intervals’, when the quantity in question for which one needs these intervals is a random variable rather than a parameter of a statistical model. This term is also in frequent use for Bayesian statistics, where there is no clear division in parameters and variables, as also model parameters are considered random. We shall, however, continue to use ‘confidence intervals’ and indeed ‘confidence distributions’ for these prediction settings.
We shall start our discussion for the case of predicting the next data point in a sequence of i.i.d. observations, in Section 12.2. Our frequentist predictive approach is different from the Bayesian one, where the model density is being integrated over the parameters with respect to their posterior distribution.
To save this book to your Kindle, first ensure [email protected] is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
Find out more about the Kindle Personal Document Service.
To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.
To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.