An Efficient MCMC Algorithm to Sample Binary Matrices with Fixed Marginals

Norman D. Verhelst

doi:10.1007/s11336-008-9062-3

An Efficient MCMC Algorithm to Sample Binary Matrices with Fixed Marginals

Published online by Cambridge University Press: 01 January 2025

Norman D. Verhelst

Show author details

Norman D. Verhelst*: Affiliation:
CITO, National Institute for Educational Measurement
*: Requests for reprints should be sent to Norman D. Verhelst, CITO, National Institute for Educational Measurement, P.O. Box 1034, 6801 MG Arnhem, The Netherlands. E-mail: [email protected]; [email protected]

Article contents

Abstract
Footnotes
References

Get access

Rights & Permissions

Abstract

Uniform sampling of binary matrices with fixed margins is known as a difficult problem. Two classes of algorithms to sample from a distribution not too different from the uniform are studied in the literature: importance sampling and Markov chain Monte Carlo (MCMC). Existing MCMC algorithms converge slowly, require a long burn-in period and yield highly dependent samples. Chen et al. developed an importance sampling algorithm that is highly efficient for relatively small tables. For larger but still moderate sized tables (300×30) Chen et al.’s algorithm is less efficient. This article develops a new MCMC algorithm that converges much faster than the existing ones and that is more efficient than Chen’s algorithm for large problems. Its stationary distribution is uniform. The algorithm is extended to the case of square matrices with fixed diagonal for applications in social network theory.

Keywords

MCMC Rasch model nonparametric tests importance sampling social networks

Type: Theory and Methods
Information: Psychometrika , Volume 73 , Issue 4 , December 2008 , pp. 705 - 728

DOI: https://doi.org/10.1007/s11336-008-9062-3 [Opens in a new window]
Copyright: Copyright © 2008 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

Footnotes

I am indebted to my colleague Gunter Maris for his suggestion to add a Metropolis–Hastings step as the finishing touch of the algorithm.

References

Besag, J., & Clifford, P. (1989). Generalized Monte Carlo significance tests. Biometrika, 76, 633–642.CrossRef Google Scholar

Chen, Y. (2006). Simple existence conditions for zero-one matrices with at most one structural zero in each row and column. Discrete Mathematics, 306, 2870–2877.CrossRef Google Scholar

Chen, Y., Diaconis, P., Holmes, S., & Liu, J. (2005). Sequential Monte Carlo methods for statistical analysis of tables. Journal of the American Statistical Association, 100, 109–120.CrossRef Google Scholar

Chen, Y., & Small, D. (2005). Exact tests for the Rasch model via sequential importance sampling. Psychometrika, 70, 11–30.CrossRef Google Scholar

Connor, E., & Simberloff, D. (1979). The assembly of species communities: chance or competition. Ecology, 60, 1132–1140.CrossRef Google Scholar

Gale, D. (1957). A theorem on flows in networks. Pacific Journal of Mathematics, 7, 1073–1082.CrossRef Google Scholar

Guttorp, P. (1995). Stochastic modeling of scientific data, London: Chapman and Hall.CrossRef Google Scholar

Hastings, W.K. (1970). Monte Carlo sampling methods using Markov chains and their applications. Biometrika, 57, 97–109.CrossRef Google Scholar

Kong, A., Liu, J., & Wong, W. (1994). Sequential imputations and Bayesian missing data problems. Journal of the American Statistical Association, 89, 278–288.CrossRef Google Scholar

Marshall, A., & Olkin, I. (1979). Inequalities: theory of majorization and its applications, San Diego: Academic Press.Google Scholar

Metropolis, N., Rosenbluth, A.W., Rosenbluth, M.N., Teller, A.H., & Teller, E. (1953). Equations of state calculations by fast computing machines. Journal of Chemical Physics, 21, 1087–1091.CrossRef Google Scholar

Musalem, A., Bradlow, E., & Raju, J. (2008, in press). Bayesian estimation of random-coefficients models using aggregate data. Journal of Applied Econometrics.CrossRef Google Scholar

Ponocny, I. (2001). Nonparametric goodness-of-fit tests for the Rasch model. Psychometrika, 66, 437–460.CrossRef Google Scholar

Prabhu, N. (1965). Stochastic processes. Basic theory and its applications, New York: Macmillan.Google Scholar

Rao, A., Jana, R., & Bandyopadhyay, S. (1996). A Markov chain Monte Carlo method for generating random (0,1)-matrices with given marginals. Sankhya, Series A, 58, 225–242.Google Scholar

Roberts, A., & Stone, L. (1990). Island sharing by archipelago species. Oecologia, 83, 560–567.CrossRef Google Scholar PubMed

Ryser, H. (1957). Combinatorial properties of matrices with zeros and ones. The Canadian Journal of Mathematics, 9, 371–377.CrossRef Google Scholar

Ryser, H. (1963). Combinatorial mathematics. Carus mathematical monographs, Washington: The Mathematical Association of America.Google Scholar

Snijders, T. (1991). Enumeration and simulation for 0-1 matrices with given marginals. Psychometrika, 56, 397–417.CrossRef Google Scholar

Tanner, M.A. (1996). Tools for statistical inference, (3rd ed.). New York: Springer.CrossRef Google Scholar

Wasserman, S. (1977). Random directed graph distributions and the triad census in social networks. Journal of Mathematical Sociology, 5, 61–86.CrossRef Google Scholar

Article contents

An Efficient MCMC Algorithm to Sample Binary Matrices with Fixed Marginals

Abstract

Keywords

Access options

Article purchase

Temporarily unavailable

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests