The rescaled Pólya urn: local reinforcement and chi-squared goodness-of-fit test

Giacomo Aletti; Irene Crimaldi

doi:10.1017/apr.2021.56

The rescaled Pólya urn: local reinforcement and chi-squared goodness-of-fit test

Part of: Parametric inference Markov processes Limit theorems

Published online by Cambridge University Press: 18 October 2022

Giacomo Aletti

and

Irene Crimaldi

Show author details

Giacomo Aletti*: Affiliation:
Università degli Studi di Milano
Irene Crimaldi*: Affiliation:
IMT School for Advanced Studies Lucca
*: *Postal address: ADAMSS Center, Università degli Studi di Milano, Milan, Italy.
**Postal address: IMT School for Advanced Studies Lucca, Lucca, Italy.

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

Motivated by recent studies of big samples, this work aims to construct a parametric model which is characterized by the following features: (i) a ‘local’ reinforcement, i.e. a reinforcement mechanism mainly based on the last observations, (ii) a random persistent fluctuation of the predictive mean, and (iii) a long-term almost sure convergence of the empirical mean to a deterministic limit, together with a chi-squared goodness-of-fit result for the limit probabilities. This triple purpose is achieved by the introduction of a new variant of the Eggenberger–Pólya urn, which we call the rescaled Pólya urn. We provide a complete asymptotic characterization of this model, pointing out that, for a certain choice of the parameters, it has properties different from the ones typically exhibited by the other urn models in the literature. Therefore, beyond the possible statistical application, this work could be interesting for those who are concerned with stochastic processes with reinforcement.

Keywords

Empirical mean central limit theorem chi-squared test compact Markov chain Pólya urn predictive mean preferential attachment reinforcement learning reinforced stochastic process urn model

MSC classification

Primary: 60F05: Central limit and other weak theorems 62F03: Hypothesis testing

Secondary: 60J05: Discrete-time Markov processes on general state spaces 62F05: Asymptotic properties of tests

Type: Original Article
Information: Advances in Applied Probability , Volume 54 , Issue 3 , September 2022 , pp. 849 - 879

DOI: https://doi.org/10.1017/apr.2021.56 [Opens in a new window]
Copyright: © The Author(s), 2022. Published by Cambridge University Press on behalf of Applied Probability Trust

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Aletti, G., Crimaldi, I. and Ghiglietti, A. (2017). Synchronization of reinforced stochastic processes with a network-based interaction. Ann. Appl. Prob. 27, 3787–3844.CrossRef Google Scholar

Aletti, G., Crimaldi, I. and Ghiglietti, A. (2019). Networks of reinforced stochastic processes: asymptotics for the empirical means. Bernoulli 25, 3339–3378.CrossRef Google Scholar

Aletti, G., Crimaldi, I. and Ghiglietti, A. (2020). Interacting reinforced stochastic processes: statistical inference based on the weighted empirical means. Bernoulli 26, 1098–1138.CrossRef Google Scholar

Aletti, G., Crimaldi, I. and Saracco, F. (2021). A model for the Twitter sentiment curve. PLOS ONE 16, 1–28.CrossRef Google Scholar

Aletti, G., Ghiglietti, A. and Paganoni, A. M. (2013). Randomly reinforced urn designs with prespecified allocations. J. Appl. Prob. 50, 486–498.CrossRef Google Scholar

Aletti, G., Ghiglietti, A. and Rosenberger, W. F. (2018). Nonparametric covariate-adjusted response-adaptive design based on a functional urn model. Ann. Statist. 46, 3838–3866.CrossRef Google Scholar

Aletti, G., Ghiglietti, A. and Vidyashankar, A. N. (2018). Dynamics of an adaptive randomly reinforced urn. Bernoulli 24, 2204–2255.CrossRef Google Scholar

Bergh, D. (2015). Sample size and chi-squared test of fit—a comparison between a random sample approach and a chi-square value adjustment method using Swedish adolescent data. In Pacific Rim Objective Measurement Symposium (PROMS) 2014 Conference Proceedings, eds Q. Zhang and H. Yang, Springer, Berlin, Heidelberg, pp. 197–211.CrossRef Google Scholar

Berti, P., Crimaldi, I., Pratelli, L. and Rigo, P. (2011). A central limit theorem and its applications to multicolor randomly reinforced urns. J. Appl. Prob. 48, 527–546.CrossRef Google Scholar

Berti, P., Crimaldi, I., Pratelli, L. and Rigo, P. (2016). Asymptotics for randomly reinforced urns with random barriers. J. Appl. Prob. 53, 1206–1220.CrossRef Google Scholar

Bertoni, D. et al. (2018). Farmland use transitions after the CAP greening: a preliminary analysis using Markov chains approach. Land Use Policy 79, 789–800.CrossRef Google Scholar

Caldarelli, G., Chessa, A., Crimaldi, I. and Pammolli, F. (2013). Weighted networks as randomly reinforced urn processes. Phys. Rev. E 87, 020106.CrossRef Google Scholar PubMed

Caron, F. et al. (2017). Generalized Pólya urn for time-varying Pitman–Yor processes. J. Machine Learning Res. 18, 1–32.Google Scholar

Chanda, K. C. (1999). Chi-squared tests of goodness-of-fit for dependent observations. In Asymptotics, Nonparametrics, and Time Series, CRC Press, Boca Raton, FL, pp. 743–756.Google Scholar

Chen, M.-R. and Kuba, M. (2013). On generalized Pólya urn models. J. Appl. Prob. 50, 1169–1186.CrossRef Google Scholar

Chessa, A., Crimaldi, I., Riccaboni, M. and Trapin, L. (2014). Cluster analysis of weighted bipartite networks: a new copula-based approach. PLOS ONE 9, 1–12.CrossRef Google Scholar PubMed

Collevecchio, A., Cotar, C. and LiCalzi, M. (2013). On a preferential attachment and generalized Pólya’s urn model. Ann. Appl. Prob. 23, 1219–1253.CrossRef Google Scholar

Crimaldi, I. (2016). Central limit theorems for a hypergeometric randomly reinforced urn. J. Appl. Prob. 53, 899–913.CrossRef Google Scholar

Crimaldi, I. (2016). Introduzione alla nozione di convergenza stabile e sue varianti. Unione Matematica Italiana, Bologna.Google Scholar

Crimaldi, I., Dai Pra, P., Louis, P.-Y. and Minelli, I. G. (2019). Synchronization and functional central limit theorems for interacting reinforced random walks. Stoch. Process. Appl. 129, 70–101.CrossRef Google Scholar

Crimaldi, I., Dai Pra, P. and Minelli, I. G. (2016). Fluctuation theorems for synchronization of interacting Pólya’s urns. Stoch. Process. Appl. 126, 930–947.CrossRef Google Scholar

Crimaldi, I., Letta, G. and Pratelli, L. (2007). A strong form of stable convergence. In Séminaire de Probabilités XL, Springer, Berlin, Heidelberg, pp. 203–225.CrossRef Google Scholar

Dai Pra, P., Louis, P.-Y. and Minelli, I. G. (2014). Synchronization via interacting reinforcement. J. Appl. Prob. 51, 556–568.CrossRef Google Scholar

Doeblin, W. and Fortet, R. (1937). Sur des chanes à liaisons complètes. Bull. Soc. Math. France 65, 132–148.CrossRef Google Scholar

Eggenberger, F. and Pólya, G. (1923). Über die Statistik verketteter Vorgänge. Z. Angew. Math. Mech. 3, 279–289.CrossRef Google Scholar

Gasser, T. (1975). Goodness-of-fit tests for correlated data. Biometrika 62, 563–570.CrossRef Google Scholar

Ghiglietti, A. and Paganoni, A. M. (2014). Statistical properties of two-color randomly reinforced urn design targeting fixed allocations. Electron. J. Statist. 8, 708–737.CrossRef Google Scholar

Ghiglietti, A., Vidyashankar, A. N. and Rosenberger, W. F. (2017). Central limit theorem for an adaptive randomly reinforced urn model. Ann. Appl. Prob. 27, 2956–3003.CrossRef Google Scholar

Gleser, L. J. and Moore, D. S. (1983). The effect of dependence on chi-squared and empiric distribution tests of fit. Ann. Statist. 11, 1100–1108.CrossRef Google Scholar

Guivarc’h, Y. and Hardy, J. (1988). Théorèmes limites pour une classe de chaînes de Markov et applications aux difféomorphismes d’Anosov. Ann. Inst. H. Poincaré Prob. Statist. 24, 73–98.Google Scholar

Hairer, M. Ergodic properties of Markov processes. Available at http://www.hairer.org/notes/Markov.pdf.Google Scholar

Hall, P. and Heyde, C. C. (1980). Martingale Limit Theory and Its Application. Academic Press, New York.Google Scholar

Holmes, M. and Sakai, A. (2007). Senile reinforced random walks. Stoch. Process. Appl. 117, 1519–1539.CrossRef Google Scholar

Ieva, F., Paganoni, A. M., Pigoli, D. and Vitelli, V. (2013). Multivariate functional clustering for the morphological analysis of electrocardiograph curves. J. R. Statist. Soc. C [Appl. Statist.] 62, 401–418.CrossRef Google Scholar

Ionescu Tulcea, C. T. and Marinescu, G. (1950). Théorie ergodique pour des classes d’opérations non complètement continues. Ann. Math. 52, 140–147.CrossRef Google Scholar

Knoke, D., Bohrnstedt, G. W. and Potter Mee, A. (2002). Statistics for Social Data Analysis. F. E. Peacock Publishers, Itasca, IL.Google Scholar

Laruelle, S. and Pagés, G. (2013). Randomized urn models revisited using stochastic approximation. Ann. Appl. Prob. 23, 1409–1436.CrossRef Google Scholar

Mahmoud, H. M. (2009). Pólya Urn Models. CRC Press, Boca Raton, FL.Google Scholar

Métivier, M. (1982). Semimartingales. Walter de Gruyter, Berlin.CrossRef Google Scholar

Meyn, S. and Tweedie, R. L. (2009). Markov Chains and Stochastic Stability, 2nd edn. Cambridge University Press.CrossRef Google Scholar

Micheletti, A. et al. (2019). A weighted

$\chi^2$ test to detect the presence of a major change point in non-stationary Markov chains. Submitted.Google Scholar

Norman, M. F. (1972). Markov Processes and Learning Models. Academic Press, New York.Google Scholar

Pan, W. (2002). Goodness-of-fit tests for GEE with correlated binary data. Scand. J. Statist. 29, 101–110.CrossRef Google Scholar

Pei, Y., Tang, M.-L. and Guo, J. (2008). Testing the equality of two proportions for combined unilateral and bilateral data. Commun. Statist. Simul. Comput. 37, 1515–1529.CrossRef Google Scholar

Pemantle, R. (2007). A survey of random processes with reinforcement. Prob. Surveys 4, 1–79.CrossRef Google Scholar

Radlow, R. and Alf, E. F., Jr. (1975). An alternate multinomial assessment of the accuracy of the

$\chi^2$ test of goodness of fit. J. Amer. Statist. Assoc. 70, 811–813.Google Scholar

Rao, J. N. K. and Scott, A. J. (1981). The analysis of categorical data from complex sample surveys: chi-squared tests for goodness of fit and independence in two-way tables. J. Amer. Statist. Assoc. 76, 221–230.CrossRef Google Scholar

Rényi, A. (1963). On stable sequences of events. Sankhyā A 25, 293 302.Google Scholar

Robbins, H. and Siegmund, D. (1971). A convergence theorem for non negative almost supermartingales and some applications. In Optimizing Methods in Statistics, Academic Press, New York, pp. 233–257.Google Scholar

Sahasrabudhe, N. (2016). Synchronization and fluctuation theorems for interacting Friedman urns. J. Appl. Prob. 53, 1221–1239.CrossRef Google Scholar

Sherman, J. and Morrison, W. J. (1950). Adjustment of an inverse matrix corresponding to a change in one element of a given matrix. Ann. Math. Statist. 21, 124–127.CrossRef Google Scholar

Tang, M.-L., Pei, Y.-B., Wong, W.-K. and Li, J.-L. (2012). Goodness-of-fit tests for correlated paired binary data. Statist. Methods Med. Res. 21, 331–345.CrossRef Google Scholar PubMed

Tharwat, A. (2018). Independent component analysis: an introduction. Appl. Comput. Informat. Google Scholar

Williams, D. (1991). Probability with Martingales. Cambridge University Press.CrossRef Google Scholar

Xu, D. and Tian, Y. (2015). A comprehensive survey of clustering algorithms. Ann. Data Sci. 2, 165–193.CrossRef Google Scholar

Article contents

The rescaled Pólya urn: local reinforcement and chi-squared goodness-of-fit test

Abstract

Keywords

MSC classification

Access options

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests