Skip to main content Accessibility help
×
Hostname: page-component-78c5997874-j824f Total loading time: 0 Render date: 2024-11-08T05:26:21.312Z Has data issue: false hasContentIssue false

4 - Model selection for contingency tables with algebraic statistics

from Part I - Contingency tables

Published online by Cambridge University Press:  27 May 2010

Paolo Gibilisco
Affiliation:
Università degli Studi di Roma 'Tor Vergata'
Eva Riccomagno
Affiliation:
Università degli Studi di Genova
Maria Piera Rogantin
Affiliation:
Università degli Studi di Genova
Henry P. Wynn
Affiliation:
London School of Economics and Political Science
Get access

Summary

Abstract

Goodness-of-fit tests based on chi-square approximations are commonly used in the analysis of contingency tables. Results from algebraic statistics combined with MCMC methods provide alternatives to the chi-square approximation. However, within a model selection procedure usually a large number of models is considered and extensive simulations would be necessary. We show how the simulation effort can be reduced by an appropriate analysis of the involved Gröbner bases.

Introduction

Categorical data occur in many different areas of statistical applications. The analysis usually concentrates on the detection of the dependence structure between the involved random variables. Log-linear models are adopted to describe such association patterns, see (Bishop et al. 1995, Agresti 2002) and model selection methods are used to find the model from this class, which fits the data best in a given sense. Often, goodness-of-fit tests for log-linear models are applied, which involve chi-square approximations for the distribution of the test statistic. If the table is sparse such an approximation might fail. By combining methods from computational commutative algebra and from statistics, (Diaconis and Sturmfels 1998) provide the background for alternative tests. They use the MCMC approach to get a sample from a conditional distribution of a discrete exponential family with given sufficient statistic. In particular Gröbner bases are used for the construction of the Markov chain. This approach has been applied to a number of tests for the analysis of contingency tables (Rapallo 2003, Rapallo 2005, Krampe and Kuhnt 2007). Such tests have turned out to be a valuable addition to traditional exact and asymptotic tests.

Type
Chapter
Information
Publisher: Cambridge University Press
Print publication year: 2009

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Save book to Kindle

To save this book to your Kindle, first ensure [email protected] is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

Available formats
×