Incomplete Tests of Conditional Association for the Assessment of Model Assumptions

Rudy Ligtvoet

doi:10.1007/s11336-022-09841-1

Incomplete Tests of Conditional Association for the Assessment of Model Assumptions

Published online by Cambridge University Press: 01 January 2025

Rudy Ligtvoet

Show author details

Rudy Ligtvoet*: Affiliation:
University of Cologne, Germany
*: Correspondence should be made to Rudy Ligtvoet, Department Erziehungs- und Sozialwissenschaften, University of Cologne, Germany, Gronewaldstr. 2a, 50931Cologne, Deutschland. Email: [email protected]; URL: https://sites.google.com/site/rligtv/

Article contents

Abstract
Properties of Multivariate Dependence
Incomplete Tests of Conditional Association
Sensitivity to Model Violations
Discussion
Funding
Footnotes
References

Rights & Permissions

Abstract

Many of the models that have been proposed for response data share the assumptions that define the monotone homogeneity (MH) model. Observable properties that are implied by the MH model allow for these assumptions to be tested. For binary response data, the most restrictive of these properties is called conditional association (CA). All the other properties considered can be considered incomplete tests of CA that alleviate the practical limitations encountered when assessing the MH model assumptions using CA. It is found that the assessment of the MH model assumptions with an incomplete test of CA, rather than CA, is generally associated with a substantial loss of information. We also look at the sensitivity of the observable properties to model violation and discuss the implications of the results. It is argued that more research is required about the extent to which the assumptions and the model specifications influence the inferences made from response data.

Keywords

Conditional association manifest monotonicity model complexity monotone homogeneity model monotone likelihood ratio multivariate totally positive of order 2 nonnegative partial correlations scalability coefficient strongly positive orthant dependency

Type: Theory and Methods
Information: Psychometrika , Volume 87 , Issue 4 , December 2022 , pp. 1214 - 1237

DOI: https://doi.org/10.1007/s11336-022-09841-1 [Opens in a new window]
Creative Commons: This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Copyright: Copyright © 2022 The Author(s)

In educational and psychological testing, latent variable models are used to account for the dependencies between the responses to multiple test items, where no one item by itself accurately represents the attribute that the test is supposed to measure. The purpose of the model is to provide an estimate of the latent variable, based on the observed responses to the test items. Many different latent variable models are used in practice, each with their own particular set of assumptions, and applicable to different type of inferences. For example, the unidimensional (UD) Rasch (Reference Rasch1960) allows for the calibration of all respondents on a common linear scale (Kelderman, Reference Kelderman1988; Wright, Reference Wright1977), which makes it useful for applications where different subsets of items are administered to different groups of respondents. The model also need to provide an accurate goodness of fit to the observed responses, and here too there may be an abundance of choice. For the Rasch model, tests of goodness of fit have been proposed that including Andersen’s (Reference Andersen1973) likelihood ratio test (Glas & Verhelst, Reference Glas, Verhelst, Fischer and Molenaar1995), nonparametric tests (Ponocny, Reference Ponocny2001; Verhelst et al., Reference Verhelst, Hatzinger and Mair2007), tests for specific model violations (Glas, Reference Glas1988; Van den Wollenberg, Reference Van den Wollenberg1979), and tests specifically designed to deal with sparse observations (Maydeu-Olivares & Joe, Reference Maydeu-Olivares and Joe2005, see Debelak, Reference Debelak2019; Suáres-Falcón & Glas, Reference Suáres-Falcón and Glas2003 for an overview). Each of these tests assesses different dependencies in the observed response distributions and may be sensitive to different model violations. For example, Glas (Reference Glas1988) proposed a statistics, specifically designed to target the assumption of local independence (LI) by utilizing the information contained in the conditional bivariate distributions of pairs of items, given each sum score. Although found to be powerful in detecting violations of the Rasch model assumptions, for larger numbers of items the statistic is computationally demanding and the observations to which the statistic pertains become more sparse, limiting the asymptotic properties of the test statistic.

A similar problem occurs in factor analysis, where the estimation of the expected frequencies of the discrete responses involves high-dimensional (numerical) integration which becomes cumbersome for more items. Jöreskog and Moustaki (Reference Jöreskog and Moustaki2001) and Katsikatsou et al. (Reference Katsikatsou, Moustaki, Yang-Wallentin and Jöreskog2012) proposed a test statistics based only on the second-order moment to overcome these difficulties, but this procedure is also associated with loss of power for detecting model violations. These examples illustrate some of the tradeoff involved in the goodness-of-fit assessment when analyzing response data.

In this paper, the main focus is on Mokken’s (Reference Mokken1971) model of monotone homogeneity (MH) for binary test data. In addition to the assumptions UD and LI, the model assumes latent monotonicity (M). The MH model is nonparametric in the sense that it does not require the response functions to belong to a particular parametric family. Further, the MH model is useful for applications that require ordinal inferences, as it implies a stochastic ordering on the latent variable by the sum score across the items (Ghurye & Wallace, Reference Ghurye and Wallace1959; Grayson, Reference Grayson1988; Huynh, Reference Huynh1994; Ünlü, Reference Ünlü2008). The assumptions that constitute the MH model are shared by a wider range of models for response data, including the Rasch model and the three-parameter logistic model (Lord & Novick, Reference Lord and Novick1968). These assumptions imply that all covariances between the test items are nonnegative. This testable property of the MH model for pairs of items is routinely used to assess the validity of the MH model assumption by means of inspecting the scalability coefficients (Loevinger, Reference Loevinger1948; Mokken, Reference Mokken1971; Warrens, Reference Warrens2008) in Mokken scale analysis (Mokken & Lewis, Reference Mokken and Lewis1982; Molenaar & Sijtsma, Reference Molenaar and Sijtsma2000; Sijtsma & Molenaar, Reference Sijtsma and Molenaar2002; Van der Ark, Reference Van der Ark2007). In Mokken scale analysis, any scalability coefficient that is below a predetermined lower bound (usually at 0.30) is flagged as a model violation that discredits the MH model, and any model that is a special case of the MH model (Junker & Sijtsma, Reference Junker and Sijtsma2001).

A problem with Mokken scale analysis based on the scalability coefficients is the somewhat arbitrary choice for the lower bounds of the coefficients. For example, Hemker et al. (Reference Hemker, Sijtsma and Molenaar1995) found that the default value of 0.30 does not always suffice to recover a unidimensional scale. Smits et al. (Reference Smits, Timmerman and Meijer2012) also warn to be cautious about making inferences about the dimensionality of a test based on an automated evaluation of scalability coefficients. Tighter lower bounds for the scalability coefficients can be obtained from the requirement of nonnegative partial correlations (NPC; Ellis, Reference Ellis2014, Reference Ellis, Millsap, Bolt, van der Ark and Wang2015; Brusco et al., Reference Brusco, Köhn and Steinley2015). Like the scalability coefficients, the partial correlation is implied to be nonnegative under the MH model, but the property NPC takes into consideration the higher-order moments contained in the trivariate distributions of item triplets. As a consequence, a violation flagged by the property of NPC may remain undetected when only evaluating the covariances between item pairs.

Beside the scalability coefficients and NPC, other observable properties have been proposed that allow the assumptions of the MH model to be tested. For example, the property of manifest monotonicity (MM; Junker, Reference Junker1993; Junker & Sijtsma, Reference Junker and Sijtsma2000) proposes that the regression of each of the item variables is a non-decreasing function of the sum of the remaining variables or rest score. Holland and Rosenbaum (Reference Holland and Rosenbaum1986) provide an overview of properties of multivariate positive dependence that are implied by the MH model, with conditional association (CA; Holland and Rosenbaum, Reference Holland and Rosenbaum1986; Rosenbaum, Reference Rosenbaum1984) being the most restrictive of these properties for binary response data. Below, we show that the observable property CA also implies MM and NPC (Ellis, Reference Ellis, Millsap, Bolt, van der Ark and Wang2015). Because the MH model cannot be directly evaluated, we rely on these observable properties to make inferences about the validity of the MH model assumptions (Sijtsma & Van der Ark, Reference Sijtsma and Van der Ark2017). A testable latent class version of the MH model was proposed by Croon (Reference Croon1990, Reference Croon1991); see also Hoijtink and Molenaar (Reference Hoijtink and Molenaar1997) and Vermunt (Reference Vermunt2001), which requires a prior specification of the number of discrete latent classes. Global tests for some observable properties implied by the MH model have also been proposed. These global tests include both likelihood ratio tests for CA and MM (Bartolucci & Forcina, Reference Bartolucci and Forcina2005; Tijmstra et al., Reference Tijmstra, Hessen, Van der Heijden and Sijtsma2013) and Bayes factors for MM (Tijmstra et al., Reference Tijmstra, Hoijtink and Sijtsma2015).

The next section starts with the introduction of the various observable properties that are implied by the MH model, and it will be shown how these properties are hierarchically related, with the property of CA imposing the tightest constants on the distribution of item responses. Because all the observable properties are implied by CA, each of these properties can be considered to be an incomplete test of CA (Maraun et al., Reference Maraun, Jackson, Luccock, Belfer and Chrisjohn1998). Due to the number of restrictions imposed by CA and sparse observations associated with many of these restrictions, it is argued that the practical assessment of the MH model assumptions relies on incomplete tests for CA. In Sect. 2, we investigate the loss of information associated when, instead of CA, an incomplete test of CA is used, for which the complexity of the observable properties is defined as the agreement of the properties with a wider range of patterns of data. In Sect. 3, we look at the sensitivity of the various properties to violations of the MH model assumptions. The results of these studies are summarized and discussed in Sect. 4 along with their implications.

1. Properties of Multivariate Dependence

In this section, seven distinct observable properties are defined for binary test data, all of which are implied by the MH model. Let $X = (X_{1}, \dots, X_{J})$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{X}=(X_1,\ldots ,X_J)$$\end{document} be the random vector containing binary item response variables $X_{i}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$X_i$$\end{document} . Also, let $Θ$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{\Theta }$$\end{document} denote the random vector of latent variables, with

\begin{matrix} p (x) = P (X = x) = \int P (X = x | Θ = θ) d F (θ) . \end{matrix}

The assumption of LI states that the variables $X_{1}, \dots, X_{J}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$X_1,\ldots ,X_J$$\end{document} are locally or conditionally independent, given $Θ = θ$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{\Theta }=\varvec{\theta }$$\end{document} . Further, let $P (X_{i} = 1 | Θ = θ)$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$P(X_i=1|\varvec{\Theta }=\varvec{\theta })$$\end{document} denote the ith response function, then the assumption M is satisfied whenever all J response functions are (element-wise) non-decreasing in $θ$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{\theta }$$\end{document} , and assumption UD holds if $Θ = Θ$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{\Theta }=\Theta $$\end{document} (i.e., scalar valued). The MH model is defined by the assumptions UD, LI, and M (Mokken, Reference Mokken1971).

It will be shown how the observable properties are related to each other, with property CA being the most restrictive of these properties. Next, several practical limitations will be discussed that relate to the number of inequality restrictions the properties impose on $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} and the problem of sparseness of observation. Finally, to account for these practical limitations, the assessment of the trivariate distributions of all triplets of item is considered, adding two more distinct properties for assessing the MH model assumption.

1.1. Observable Properties

Let $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} be a vector, which has as its elements $p_{k} = p (x)$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$p_k=p(\varvec{x})$$\end{document} , arranged in lexicographical order of $x$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{x}$$\end{document} (i.e., scores on the right run faster from zero to one). Then, $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} contains the multinomial probabilities parameters for the distribution of the frequencies of $X = x$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{X}=\varvec{x}$$\end{document} , with the restriction $1^{'} p = 1$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{1}^\prime \varvec{p}=1$$\end{document} (Holland, Reference Holland1990). Each of the observable properties that are discussed below differs with respect to the additional restrictions they impose on $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} .

1.1.1. (Conditionally) Associated Random Variables

Esary et al. (Reference Esary, Proschan and Walkup1967) defined $X$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{X}$$\end{document} to be associated (A), if the covariance between any pair of binary non-decreasing functions of $X$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{X}$$\end{document} is nonnegative. A conditional version of property A was proposed by Holland and Rosenbaum (Reference Holland and Rosenbaum1986) and Rosenbaum (Reference Rosenbaum1984), where $X$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{X}$$\end{document} is said to be CA, if for any partition $X = (Y, Z)$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{X}=(\varvec{Y},\varvec{Z})$$\end{document} , the variables $Y$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{Y}$$\end{document} are associated, given any arbitrary function of $Z$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{Z}$$\end{document} .

Assume that $p > 0$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}>\varvec{0}$$\end{document} , then CA can be concisely expressed in terms restricted log-odds ratios, as

(1)

\begin{matrix} K ln (M p) \geq 0, \end{matrix}

with $K = I_{v} \otimes (1, - 1, - 1, 1)$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {K}}={\mathbf {I}}_v\otimes (1,-1,-1,1)$$\end{document} (Kronecker product), $I_{v}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {I}}_v$$\end{document} is the identity matrix of dimensions equal to the number of restrictions v imposed by CA, and $M$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {M}}$$\end{document} is a binary design matrix (Bartolucci & Forcina, Reference Bartolucci and Forcina2005). Each of the consecutive four rows of the matrix $M$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {M}}$$\end{document} in (1) correspond to a particular restriction imposed on $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} by property CA, with $v = (2^{d} - 1) J (J - 1) / 2$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$v=(2^d-1)J(J-1)/2$$\end{document} and $d = 2^{J - 2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$d=2^{J-2}$$\end{document} . For example, in case $J = 2$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=2$$\end{document} , $M = I_{4}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {M}}={\mathbf {I}}_4$$\end{document} and (1) yields $ln p_{1} - ln p_{2} - ln p_{3} + ln p_{4} \geq 0$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\ln p_1-\ln p_2-\ln p_3+\ln p_4\ge 0$$\end{document} .

Walkup (Reference Walkup1968) characterized property A in terms of a collection of pairs of binary non-decreasing functions. For $J = 3$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=3$$\end{document} , there are nine such pairs of functions. The constraints these functions impose correspond to restrictions on $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} that can be expressed as (1), with the matrix $M$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {M}}$$\end{document} equal to

(2)

\begin{matrix} (\begin{matrix} (1, 1) \otimes I_{4} \\ I_{2} \otimes (1, 1) \otimes I_{2} \\ I_{4} \otimes (1, 1) \\ I_{2} \otimes ({(1, 0)}^{'} \otimes (1, 1), I_{2}) \\ I_{2} \otimes (I_{2}, {(0, 1)}^{'} \otimes (1, 1)) \\ (I_{2} \otimes {(1, 0)}^{'} \otimes (1, 1), I_{4}) \\ (I_{4}, I_{2} \otimes {(0, 1)}^{'} \otimes (1, 1)) \\ ((1, 1) \otimes {(1, 0)}^{'} \otimes I_{2}, I_{4}) \\ (I_{4}, (1, 1) \otimes {(0, 1)}^{'} \otimes I_{2}) \end{matrix}) . \end{matrix}

\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\begin{aligned} \left[ \begin{array}{c} (1,1)\otimes {\mathbf {I}}_4\\ {\mathbf {I}}_2\otimes (1,1)\otimes {\mathbf {I}}_2\\ {\mathbf {I}}_4\otimes (1,1)\\ {\mathbf {I}}_2\otimes ((1,0)^\prime \otimes (1, 1),{\mathbf {I}}_2)\\ {\mathbf {I}}_2\otimes ({\mathbf {I}}_2, (0, 1)^\prime \otimes (1, 1))\\ ({\mathbf {I}}_2\otimes (1, 0)^\prime \otimes (1, 1),{\mathbf {I}}_4)\\ ({\mathbf {I}}_4, {\mathbf {I}}_2\otimes (0, 1)^\prime \otimes (1, 1))\\ ((1, 1)\otimes (1, 0)^\prime \otimes {\mathbf {I}}_2,{\mathbf {I}}_4)\\ ({\mathbf {I}}_4,(1, 1)\otimes (0, 1)^\prime \otimes {\mathbf {I}}_2) \end{array}\right] . \end{aligned}$$\end{document}

The last row in (2), for example, corresponds to the restriction

\begin{matrix} ln p_{1} - ln (p_{3} + p_{5} + p_{7}) - ln p_{2} + ln (p_{4} + p_{6} + p_{8}) \geq 0, \end{matrix}

or equivalently, Cov $(1 - (1 - X_{1}) (1 - X_{2}), X_{3}) \geq 0$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$(1-(1-X_1)(1-X_2),X_3)\ge 0$$\end{document} . For $J = 4$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=4$$\end{document} , Walkup (Reference Walkup1968, pp. 1400–1401) enumerated $v = 99$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$v=99$$\end{document} pairs of binary non-decreasing functions to characterize property A.

1.1.2. Multivariate Totally Positive

Next, consider the property of multivariate totally positivity of order 2 (MTP $_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$_2$$\end{document} ; Karlin & Rinott, Reference Karlin and Rinott1980) for a random vector $U$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{U}$$\end{document} . The density $f (u)$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$f(\varvec{u})$$\end{document} is said to be ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} , if $f (u) f (v) \leq f (max (u, v)) f (min (u, v))$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$f(\varvec{u})f(\varvec{v})\le f(\max (\varvec{u},\varvec{v}))f(\min (\varvec{u},\varvec{v}))$$\end{document} , for all outcomes $u, v$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{u},\varvec{v}$$\end{document} , and with the minimum and maximum applied element-wise. For bivariate densities, the property is called ${TP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {TP}_2$$\end{document} and corresponds to a monotone likelihood ratio ordering (MLR) in case the joint density is strictly positive (Karlin, Reference Karlin1968; Sarkar, Reference Sarkar1969). This MLR property is relevant as it is the property used by Grayson (Reference Grayson1988) to establish the stochastic ordering on $Θ$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\Theta $$\end{document} by the sum scores $S = X_{1} + \dots + X_{J}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$S=X_1+\cdots +X_J$$\end{document} under the MH model.

For the binary random vector $X$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{X}$$\end{document} , assume that $p > 0$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}>\varvec{0}$$\end{document} . Then, (1) can also be used as an expression for ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} , by omitting the matrix $W$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {W}}$$\end{document} in the algorithm by Bartolucci and Forcina (Reference Bartolucci and Forcina2005, p. 41) for constructing matrix $M$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {M}}$$\end{document} , and adjusting v accordingly. The ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} property then corresponds to the requirement that Cov $(X_{i}, X_{j} | Z = z) \geq 0$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$(X_i,X_j|\varvec{Z}=\varvec{z})\ge 0$$\end{document} , for any partition $X = (X_{i}, X_{j}, Z)$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{X}=(X_i,X_j,\varvec{Z})$$\end{document} and any vector $z$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{z}$$\end{document} .

For a multidimensional vector $Θ$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{\Theta }$$\end{document} , Holland and Rosenbaum (Reference Holland and Rosenbaum1986, Theorem 7) showed that the assumptions of LI and M imply that $X$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{X}$$\end{document} satisfies the property of ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} , if $Θ$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{\Theta }$$\end{document} is ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} . Also, $X$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{X}$$\end{document} is ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} , whenever $(X, Θ)$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$(\varvec{X},\varvec{\Theta })$$\end{document} satisfying a particular higher-order factor structure (Ellis, Reference Ellis, Millsap, Bolt, van der Ark and Wang2015). These results imply that the property of ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} is not confined to unidimensional models only.

1.1.3. Nonnegative Covariances

Equation (1) can also be used to restrict the bivariate distributions of pairs of item variables $X_{i}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$X_i$$\end{document} and $X_{j}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$X_j$$\end{document} , such that Cov $(X_{i}, X_{j}) \geq 0$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$(X_i,X_j)\ge 0$$\end{document} , for all $1 \leq i < j \leq J$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$1\le i<j\le J$$\end{document} . Let

\begin{matrix} T_{ij} = ⨂_{k = 1}^{J} T_{ijk}, with T_{ijk} = (\begin{matrix} I_{2} & if either i = k or j = k \\ (1, 1) & otherwise, \end{matrix}) \end{matrix}

and let the matrix $M$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {M}}$$\end{document} be obtained by stacking on top of one another all matrices $T_{ij}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {T}}_{ij}$$\end{document} . With this matrix $M$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {M}}$$\end{document} and $v = J (J - 1) / 2$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$v=J(J-1)/2$$\end{document} , expression (1) imposes the restriction of the property of nonnegative covariances (NC), which implies that all the scalability coefficients are nonnegative (Mokken, Reference Mokken1971; Sijtsma & Molenaar, Reference Sijtsma and Molenaar2002).

1.1.4. Manifest Monotonicity

The observable property MM pertains to the regression of each $X_{i}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$X_i$$\end{document} on $S - X_{i}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$S-X_i$$\end{document} , with $S = X_{1} + \dots + X_{J}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$S=X_1+\cdots +X_J$$\end{document} . Junker (Reference Junker1993) showed that MM provides a partial characterization of a general class of latent variable models that include the MH model. To show CA implies MM, let $R = S - X_{i} - X_{j}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$R=S-X_i-X_j$$\end{document} . Then, CA implies for all $R = r$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$R=r$$\end{document} , that

\begin{matrix} \begin{matrix} P (X_{i} = 0, X_{j} = 0, R = r) P (X_{i} = 1, X_{j} = 1, R = r) \\ \geq P (X_{i} = 0, X_{j} = 1, R = r) P (X_{i} = 1, X_{j} = 0, R = r), \end{matrix} \end{matrix}

or equivalently $P (X_{i} = 1 | S - X_{i} = r) \leq P (X_{i} = 1 | S - X_{i} = r + 1)$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$P(X_i=1|S-X_i=r)\le P(X_i=1|S-X_i=r+1)$$\end{document} . The inequalities imposed by MM thus correspond to a selection of consecutive rows of $M$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {M}}$$\end{document} for CA. For example, for $J = 3$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=3$$\end{document} , matrix $M$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {M}}$$\end{document} for MM becomes

(3)

\begin{matrix} (\begin{matrix} I_{2} \otimes (I_{2}, (1, 0) \otimes {(0, 1)}^{'}) \\ I_{2} \otimes ((0, 1) \otimes {(1, 0)}^{'}, I_{2}) \\ (I_{4}, I_{2} \otimes (1, 0) \otimes {(0, 1)}^{'}) \\ (I_{2} \otimes (0, 1) \otimes {(1, 0)}^{'}, I_{4}) \\ (I_{4}, (1, 0) \otimes {(0, 1)}^{'} \otimes I_{2}) \\ ((1, 0) \otimes {(0, 1)}^{'} \otimes I_{2}, I_{4}) \end{matrix}) . \end{matrix}

\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\begin{aligned} \left[ \begin{array}{c} \varvec{I}_2\otimes (\varvec{I}_2,(1,0)\otimes (0,1)^\prime )\\ \varvec{I}_2\otimes ((0,1)\otimes (1,0)^\prime ,\varvec{I}_2)\\ (\varvec{I}_4,\varvec{I}_2\otimes (1,0)\otimes (0,1)^\prime )\\ (\varvec{I}_2\otimes (0,1)\otimes (1,0)^\prime ,\varvec{I}_4)\\ (\varvec{I}_4,(1,0)\otimes (0,1)^\prime \otimes \varvec{I}_2)\\ ((1,0)\otimes (0,1)^\prime \otimes \varvec{I}_2,\varvec{I}_4) \end{array}\right] . \end{aligned}$$\end{document}

Unlike the other observable properties that have been discussed thus far, MM for all test item does not imply that MM also holds for any subset of item. For example, for $J \geq 3$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J\ge 3$$\end{document} , MM does not imply NC nor the other way around.

1.1.5. Strongly Positive Orthant Dependency

Holland (Reference Holland1981) proposed a generalization of the MH model, by relaxing the LI condition. His approach to modeling the dependencies between the item variables uses clusters of item variables with outcomes of all zeros or ones. Let $V$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{V}$$\end{document} contain a selection of variables from $X$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{X}$$\end{document} and consider the partition $V = (Y, Z)$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{V}=(\varvec{Y},\varvec{Z})$$\end{document} . Besides UD, also assume that both

(4a)

\begin{matrix} P (V = 1 | Θ = θ) is non-decreasing in θ, and \end{matrix}

(4b)

\begin{matrix} P (V = 0 | Θ = θ) is non-increasing in θ, \end{matrix}

for any selection $V$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{V}$$\end{document} . Then, Holland (Reference Holland1981) showed that these assumptions together with the assumption of local nonnegative dependence (LND) coincide with following three inequalities:

(5a)

\begin{matrix} P (V = 1) \geq P (Y = 1) P (Z = 1), \end{matrix}

(5b)

\begin{matrix} P (V = 0) \geq P (Y = 0) P (Z = 0), and \end{matrix}

(5c)

\begin{matrix} P (Y = 1, Z = 0) \leq P (Y = 1) P (Z = 0), \end{matrix}

for any partition of the selected variables $V = (Y, Z)$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{V}=(\varvec{Y},\varvec{Z})$$\end{document} , where the assumption LND is obtained from (5a–5c) by conditioning each term on $Θ = θ$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\Theta =\theta $$\end{document} .

The observable property defined by (5a–5c), for any $V = (Y, Z)$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{V}=(\varvec{Y},\varvec{Z})$$\end{document} implies strongly positive orthant dependency (SPOD; Joag-Dev, Reference Joag-Dev1983), with the latter obtained by taking $V = X$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{V}=\varvec{X}$$\end{document} (Block & Fang, Reference Block and Fang1990). Following Holland and Rosenbaum (Reference Holland and Rosenbaum1986, p. 1531), we refer to the property defined by (5a–5c) as SPOD, but have it understood that it applies to any subset of item variables from $X$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{X}$$\end{document} .

The property SPOD can be expressed in terms of the log-odds ratios in (1) by appropriately adjusting matrix $M$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {M}}$$\end{document} and v. For example, for $V = (X_{i}, X_{j})$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{V}=(X_i,X_j)$$\end{document} , all three inequalities coincide with Cov $(X_{i}, X_{j}) \geq 0$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$(X_i,X_j)\ge 0$$\end{document} . For $J = 3$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=3$$\end{document} , let $Y = X_{1}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{Y}=X_1$$\end{document} and $Z = (X_{2}, X_{3})$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{Z}=(X_2,X_3)$$\end{document} , so that (5a) and (5b) imply that

\begin{matrix} \begin{matrix} ln p_{8} - ln p_{4} - ln (p_{5} + p_{6} + p_{7}) + ln (p_{1} + p_{2} + p_{3}) \geq 0 and \\ ln p_{1} - ln p_{5} - ln (p_{2} + p_{3} + p_{4}) + ln (p_{6} + p_{7} + p_{8}) \geq 0, \end{matrix} \end{matrix}

respectively. These two inequalities hold, if and only if (5c) holds, for $Y = (X_{2}, X_{3})$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{Y}=(X_2,X_3)$$\end{document} and $Z = X_{1}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{Z}=X_1$$\end{document} , and $Y = X_{1}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{Y}=X_1$$\end{document} and $Z = (X_{2}, X_{3})$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{Z}=(X_2,X_3)$$\end{document} , respectively. Hence, for $J = 3$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=3$$\end{document} , SPOD reduces to inequality (5c), for all $V = (Y, Z)$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{V}=(\varvec{Y},\varvec{Z})$$\end{document} .

1.1.6. Nonnegative Partial Correlations

Unlike the observable properties discussed above, NPC does not lend itself to be expressed as restrictions on the log-odds ratios. Instead, consider the selection of variables $(X_{i}, X_{j}, X_{k})$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$(X_i,X_j,X_k)$$\end{document} from $X$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{X}$$\end{document} . Then, for any such selection of variables, the property NPC requires that

(6)

\begin{matrix} Cov (X_{i}, X_{j}) Var (X_{k}) \geq Cov (X_{i}, X_{k}) Cov (X_{j}, X_{k}), \end{matrix}

which each selected variable taking on the role of $X_{k}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$X_k$$\end{document} once (Ellis, Reference Ellis2014). NPC holds, whenever all trivariate distributions of triplets of response variables satisfy ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} (Ellis, Reference Ellis, Millsap, Bolt, van der Ark and Wang2015).

1.2. Relationships Between the Observable Properties

All observable properties for the binary response data above are implied by CA (Holland & Rosenbaum, Reference Holland and Rosenbaum1986, p. 1536). Figure 1 (left) shows an overview of the observable properties and their relationships, for $J \geq 4$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J\ge 4$$\end{document} . The property MM is implied by CA, but MM neither implies, nor is implied by any of the other properties. In Fig. 1, NPC pertains to the trivariate distributions of all triplets of items, and NC pertains to the bivariate distributions of all pairs of items. The remaining observable properties apply to the multivariate distribution of all the J item variables. In case $J = 2$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=2$$\end{document} , all the properties coincide with Cov $(X_{1}, X_{2}) \geq 0$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$(X_1,X_2)\ge 0$$\end{document} . For $J = 3$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=3$$\end{document} , binary random variables, Ellis (Reference Ellis, Millsap, Bolt, van der Ark and Wang2015) showed that the properties CA and ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} coincide. Also, the properties A and SPOD coincide (‘Appendix’), as shown in Fig. 1 (right).

Figure 1. Hierarchical relationships between the observable properties, for J binary variables.

1.3. Practical Considerations

Figure 2 also shows the natural logarithm of the number of restrictions v imposed on the multivariate distribution of the item variables by the observable properties in Fig. 1. The bold line is included for reference and shows that the number of restrictions imposed by CA fast exceeds $10^{J}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$10^J$$\end{document} for $J > 6$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J>6$$\end{document} . This means that an exhaustive or complete test of CA is practically infeasible for more than five items (Bartolucci & Forcina, Reference Bartolucci and Forcina2005; De Gooijer & Yuan, Reference De Gooijer and Yuan2011).

Figure 2. The number of restrictions imposed by the observable properties as a function of J.

The many inequality restrictions imposed by the various properties limit the use of likelihood ratio tests (Bartolucci & Forcina, Reference Bartolucci and Forcina2000, Reference Bartolucci and Forcina2005; Tijmstra et al., Reference Tijmstra, Hessen, Van der Heijden and Sijtsma2013) that require the estimation of $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} under all constraints imposed by the restriction. Also, obtaining the distribution of the test statistics often involves simulations, where the problem is similar to Bayesian methods for testing the properties (e.g., Tijmstra et al., Reference Tijmstra, Hoijtink and Sijtsma2015, for MM), in that the agreement to all v restrictions need to be assessed for many samples of $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} . For local (diagnostic) tests, as performed in Mokken scale analysis (Molenaar & Sijtsma, Reference Molenaar and Sijtsma2000; Van der Ark, Reference Van der Ark2007), the problem induced by the many restrictions is that of multiple testing (Ellis, Reference Ellis2014).

Beside the many restrictions, another problem for assessing the observable properties relates to sparseness of observations. Because the number of response patterns $x$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{x}$$\end{document} increases exponentially with the number of items, many of these response patterns will be expected to have sparse observations, even for large sample sizes. The sparse observations may thus not only limit the extent to which one can rely on the asymptotic results of a likelihood ratio test, but also make the results of locally performed tests sensitive to sampling error.

Not all properties are equally sensitive to sparse observation. By pertaining only to the (marginal) bivariate distributions, the assessment of property of NC will generally involve fewer number of sparse observations than ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} , for example, where each restriction involves the joint distribution of four response patterns. For illustration, data on the performance of 425 pupils on four transitive reasoning tasks (Length) were analyzed (Verweij et al., Reference Verweij, Sijtsma and Koops1996, available from the mokken package, Van der Ark, Reference Van der Ark2007). Two of the vectors $x$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{x}$$\end{document} contained no observations, so that the active number of restrictions of CA was reduced by 12–78. Figure 3 shows the 78 estimated log-odds ratios in ascending order, along with their 95% confidence interval. The figure shows that there are 33 violations of CA; one significant violation. Figure 3 also shows the 15 out of 24 (active) logs-odds ratios for property ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} and the six estimated values for NC. Comparing the results of ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} to NC clearly illustrates how the property NC is more robust to sampling error, as reflected by the narrow confidence intervals compared to those for ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} . However, NC is also associated with a substantial loss of power, with the log-odds ratios generally located more to the right.

Figure 3. Log-odds ratios for the properties CA, ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} , and NC (for each in ascending order), along with the 95% confidence intervals.

1.4. Properties for Trivariate Distributions of Item Triplets

The previous section showed that, on the one hand, the property NC overcomes the problem of sparse observations by pertaining to the bivariate (marginal) distributions of pairs of items, but is also associated with a substantial loss of information about the validity of the MH model assumptions. On the other hand, the property ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} does appear to be more powerful in detecting violations of the model assumptions, but is rather sensitive to sparseness of observations, rendering it sensitive to sampling error.

The property NPC utilizes the information contained in the trivariate distributions of all triplets of item variables and thereby strikes a balance between the practical limitations that affect property NC and ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} . Property NPC imposes tighter constraints on $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} than NC and might therefor provide a more powerful test for detecting violations of the MH model assumptions. Also, the trivariate distribution of item triplets will generally contain few sparse observations for sufficiently large sample sizes, $N > 200$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$N>200$$\end{document} , say.

Like property NPC, consider applying the multivariate observable properties to the trivariate distributions of all triplets of item variables, and let 3-CA denote the property CA applied to the trivariate distributions of all triplets of items (similar for the other properties). Then, the properties applied to the trivariate distributions are related as shown in Fig. 4. The top two rows in Fig. 4 coincide in case $J = 3$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=3$$\end{document} .

Figure 4. Hierarchical relationships between the observable properties (excluding MM), for $J \geq 4$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J\ge 4$$\end{document} binary variables.

2. Incomplete Tests of Conditional Association

In this section, the tightness of the constraints imposed on $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} by the observable properties is investigated. With property CA implying all the other properties considered in the previous section, the other properties for assessing the MH model assumptions can be considered to be incomplete tests of CA; in the sense the properties can be obtained by relaxing some of the restrictions imposed by CA (Maraun et al., Reference Maraun, Jackson, Luccock, Belfer and Chrisjohn1998). In practice, we rely on such incomplete tests, due to the large number of restrictions CA imposes. However, the number of inequality restrictions does not provide a clear indication of the tightness of the constraints imposed by the observable properties. For example, for $J = 4$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=4$$\end{document} property A imposed 99 restrictions, which are all implied by the 24 constraints imposed by ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} . Figure 1 shows the hierarchical relationships of the observable properties, but it does not show how much information is lost when, instead of CA, an incomplete test of CA is used to make inferences about the MH model assumptions. The advantage of the use of incomplete tests is that their assessment generally involves fewer inequality restrictions, and these incomplete tests are generally less sensitive to sparse observations. As a consequence, incomplete tests of CA are practically useful, but only to the extent that they are not associated with a substantial loss of information about CA. Such a loss of information would namely result in loss of power when assessing the MH model assumptions.

In the application of their likelihood ratio procedure, Bartolucci and Forcina (Reference Bartolucci and Forcina2005) observed that only a few CA restrictions were ‘activated’ in addition the restrictions imposed by ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} . This suggests that little information may be lost when ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} is assessed, instead of property CA. Here, the tightness of the constraints imposed on $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} is investigated in terms of model complexities, which provides a general assessment of the observable properties that does not rely on the data. With the observable properties all impose inequality restrictions on the probabilities associated with the multinomial frequencies, we can think of each of these properties as a model for the multinomial response frequencies and rephrase the choice for an incomplete test for the MH model assumptions as a model selection problem.

In general, model selection involves a tradeoff between the goodness of fit of the models under consideration and the model complexities. A model is selected, if it can accurately predict future data. This requires accurate model-data fit, while also providing a description of the data that is as simple as possible (Occam’s razor), as not to overfit the data. Statistics that balance goodness of fit against model complexity include Akaike’s (Reference Akaike1974) AIC and Schwarz’s (Reference Schwarz1978) BIC, where the goodness of fit is expressed by the likelihood function, and the model is penalized by the estimated number of parameters. Complexity, however, involves more than the number of estimated parameter (Myung et al., Reference Myung, Pitt and Kim2005). For example, Bonifay and Cai (Reference Bonifay and Cai2017) found that different parametric models for response data that had the same number of parameters differed in the extent to which they fit diverse patterns of data. They thereby showed that model complexity is only partly described by the number of model parameters (Pitt et al., Reference Pitt, Myung and Zhang2002; Preacher, Reference Preacher2006). Similar to the idea of fitting propensity suggested by Preacher (Reference Preacher2006), we here define the complexities of the observable properties as the proportion of samples from the (unconstrained) multinomial model that satisfy the inequality constraints of the observable properties. By assigning a distribution to the multinomial probability parameters, this notion of complexity corresponds to the definition of model complexity for Bayes factors, with the distribution of the multinomial parameters taking up the role of the encompassing prior (Hoijtink, Reference Hoijtink2011; Klugkist & Hoijtink, Reference Klugkist and Hoijtink2007). A more complex property is then said to impose looser constraints on the outcomes, thus fitting a wider range of patters of data. In this respect, a higher complexity means that the property is generally less sensitive to model violations. Hence, property CA is the least complex of the properties considered, and NC is the most complex.

2.1. On the Complexity of the Observable Properties

A simulation study was performed as an initial assessment of the complexities of the observable properties, for $J = 3$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=3$$\end{document} . A total of one million vectors $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} where samples from a flat Dirichlet distribution, with $p > 0$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}>\varvec{0}$$\end{document} and $1^{'} p = 1$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{1}^\prime \varvec{p}=1$$\end{document} . These samples provided a uniform coverage of the outcome space of $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} (cf. Bonifay & Cai, Reference Bonifay and Cai2017). Subsequently, for each vector $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} , all the observable properties in Fig. 1 (right) were assessed. The proportion of samples that satisfy a given observable property then provides an indication of the complexity of the property.

The results of the simulation show that a total of 163,627 samples (16.36%) satisfy either NC or MM or both, with a small percentage (0.36%) that only satisfied MM, and about 5.04% that satisfy both NC and MM. Figure 5 shows the overlap between the observable properties, with the conditional percentages, given that either NC or MM or both are satisfied. Note that the intersection of NC and MM is contained in SPOD. In ‘Appendix,’ it is proven that this is always the case. Figure 5 shows that SPOD accounts for about 75.08% of all samples that satisfy either NC or MM. Of the 10 million samples (unconditionally), CA was satisfied by about 2.09% of the samples. The constraints imposed by CA are considerably tighter than those imposed by the other observable properties, with no one property containing more than 40% (38.56% for MM) of samples that also agree with CA. If both NC and MM satisfied, then about 41.35% of these samples also satisfy CA.

Figure 5. Triangular Venn diagram of properties in Fig. 1 ( $J = 3$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=3$$\end{document} ), with the overlap between NC and MM in gray, with the conditional percentages, given either NC or MM (or both).

2.2. Scalability Coefficient

Rather than using an incomplete test of CA to assess the MH model assumptions, the associations between the response variables can be expressed by a statistics, like a scalability coefficient. A desirable property of such a statistic would be that it is related to the tightness of the imposed bounds on $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} (Kimeldorf & Sampson, Reference Kimeldorf and Sampson1989), such that the value of the statistic corresponds to the hierarchical relationship in Fig. 1. To assess whether property CA can be reliably inferred from the value of scalability coefficients H, the coefficient was computed for each of the previously sampled vectors $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} (e.g., Roskam et al., Reference Roskam, Van den Wollenberg and Jansen1986, p. 266).

Figure 6 shows the estimated conditional densities of H, given each of the observable properties in Fig. 1 (right). Although the ordering of these densities roughly agrees with the hierarchical relationships between the properties, Fig. 6 shows that the densities have a considerable overlap. This means that it is practically impossible to reliably infer which property holds, given the value of H. Moreover, the value of coefficient H was below the default recommended value of 0.30 for 40.75% of the cases for which property CA was satisfied.

Figure 6. Conditional densities (vertically displayed) of the scalability H, given the properties in Fig. 1 ( $J = 3$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=3$$\end{document} ), along with the percentages $H < 0.30$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$H<0.30$$\end{document} .

2.3. Manifest Monotonicity

Property MM was found to be the least complex of the incomplete tests of CA for $J = 3$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=3$$\end{document} , imposing the tightest constraints on $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} after CA. Here, we further explore the discrepancy in complexity between MM and CA as J increases. To this end, a Gibbs sampler was employed to sample 10,000 vectors $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} under the constraints imposed by MM and assess the percentage of these samples that also satisfy CA. We first explain the Gibbs sampling procedure (cf. Ligtvoet & Vermunt, Reference Ligtvoet and Vermunt2012; Hoijtink & Molenaar, Reference Hoijtink and Molenaar1997).

Gibbs sampler Suppose we wish to sample a vector $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} from a flat Dirichlet distribution under the constraints imposed by v inequality restrictions. Also, suppose we already have the vector $q$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{q}$$\end{document} that satisfies these constraints. Then, we can sequentially sample the values $p_{j}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$p_j$$\end{document} by following the next three steps. First, compute from the inequalities imposed on $p_{j}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$p_j$$\end{document} the maximum lower bound $a_{j}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$a_j$$\end{document} and the minimum upper bounds $b_{j}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$b_j$$\end{document} , using the values $q_{1}, \dots, q_{j - 1}, q_{j + 1}, \dots, q_{2^{J}}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$q_1,\ldots ,q_{j-1},q_{j+1},\ldots , q_{2^J}$$\end{document} . For example, for $J = 3$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=3$$\end{document} , the element $p_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$p_2$$\end{document} is bounded from above by MM by the first restriction in (3): $p_{2} \leq q_{1} (q_{7} + q_{7}) / q_{5} - q_{3}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$p_2\le q_1(q_7+q_7)/q_5-q_3$$\end{document} . Second, sample a value $q_{j}^{*}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$q^*_j$$\end{document} from a gamma distribution (unit shape) that is truncated from below by $max (0, a_{j})$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\max (0,a_j)$$\end{document} and from above by $b_{j}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$b_j$$\end{document} . From this, the new vector $q = q^{*} / 1^{'} q^{*}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{q}=\varvec{q}^*/\varvec{1}^\prime \varvec{q}^*$$\end{document} is obtained, with $q^{*} = {(q_{1}, \dots, q_{j - 1}, q_{j}^{*}, q_{j + 1}, \dots, q_{2^{J}})}^{'}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{q}^*=(q_1,\ldots ,q_{j-1},q_j^*,q_{j+1},\ldots , q_{2^J})^\prime $$\end{document} . Third, we have for $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} the vector $q$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{q}$$\end{document} obtained by repeating the first two steps for all $p_{j}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$p_j$$\end{document} .

To obtain the initial vector $q$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{q}$$\end{document} for the Gibbs sampler, a single sample is taken from the flat Dirichlet distribution, for which we assess the required restrictions. Those restrictions that are satisfied are then ‘activated’ and the Gibbs sampler is run using the active restrictions only, resulting in a new vector for which at least the active restrictions are satisfied. The vector $q$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{q}$$\end{document} is then obtained by repeating the Gibbs sampler and activating those (additional) restrictions that are satisfied at each step, until all v restrictions are active.

Recall that for $J = 3$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=3$$\end{document} , 38.56% of the samples that satisfied property MM also satisfied CA. Of the 10,000 samples obtained from the Gibbs sampler for $J = 4$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=4$$\end{document} , about 0.06% were found to also satisfy CA. Increasing the number of items to five further reduced this percentage to below 0.01%. The results strongly suggest that the discrepancy in complexity between the properties MM and CA increases as the number of items increases.

2.4. The Distributions of Subsets of Item Variables

The complexities of the properties are further investigated for $J = 4$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=4$$\end{document} , which extends the results in Fig. 5 (excluding MM) and includes the properties ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} and A, along with 3-CA and 3-SPOD for the trivariate distributions of all four triplets of item variables. A total of 10 million samples of the vector $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} were obtained from a flat Dirichlet distribution. Of these 10 million samples, 343,556 (3.44%) satisfied NC. For these 343,556 samples, Fig. 7 shows the percentages of overlap between the observable properties. For example, the gray areas in Fig. 7 correspond to the properties A and 3-CA, where A accounts for about 34.76% of the samples that satisfy NC and the property 3-CA accounts for about 0.45%, with the latter, thus imposing considerably tighter constraints on $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} (less complex). Of the samples that satisfy NC, both ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} and CA were satisfied by less than 0.01%. After CA and ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} , the properties 3-CA and NPC imposed the tightest constraints on $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} , which were satisfied by, respectively, 0.45% and 33.83% of all samples that satisfied NC (0.02% and 1.16% of all 10 million samples). However, even for those samples that satisfied 3-CA, only about 0.77% also satisfied CA. Hence, for $J = 4$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=4$$\end{document} , the results show that there exists a considerable gap between the complexity of property CA and any of the incomplete tests for CA (except ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} ).

Figure 7. Triangular Venn diagram of properties in Fig. 4, with the conditional percentages, given NC. The properties A and 3-CA and their overlap are shown in gray.

For $J > 3$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J>3$$\end{document} , the property CA implies ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} , but not the other way around. However, because of the small number of cases that satisfied ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} , none of the samples contained cases for which ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} was satisfied and CA was not. To further investigate the distinction between the complexity of ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} and CA, the Gibbs sampler (Sect. 2.3) was employed to sample 10,000 vectors $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} under the constraints imposed by ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} . For $J = 4$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=4$$\end{document} , the percentage of samples that satisfied CA was about 98.38%. Using the same procedure for $J = 5$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=5$$\end{document} , this percentage slightly reduced to 94.48%, with the log-odds ratio of the largest observed violation of CA corresponding to a small effect size (Haddock et al., Reference Haddock, Rindskopf and Shadish1998; Hasselblad & Hedges, Reference Hasselblad and Hedges1995). This result agrees with the observation mentioned earlier by Bartolucci and Forcina (Reference Bartolucci and Forcina2005).

3. Sensitivity to Model Violations

All the observable properties considered in the previous section are implied by the MH model for binary response variables, such that the violation of any of these properties discredits the assumptions that define the MH model. The different properties may, however, not be equally sensitive to different model violations. Insights into the sensitivity of the observable properties to various model violations may aid the development of goodness-of-fit statistics for specific model assumptions.

3.1. Violations of Local Independence

The MH model consists of the assumptions of LI, UD, and M. Holland (Reference Holland1981) suggested an alternative set of assumptions, consisting of LND, UD, and the monotonicity assumption of perfect scores in (4a) and (4b), which imply M. Here, LND relaxes the LI assumption, whereby LI is obtained from the LND assumption by replacing the inequality restrictions of LND by equalities (Holland Reference Holland1981, Theorem 1). The alternative set of assumptions coincide with the observable property of SPOD, which means that SPOD corresponds to a model for which LI is not assumed to hold. Furthermore, the MH model implies CA, which in turn implies SPOD Rosenbaum (Reference Rosenbaum1984).

As was shown in the previous section, CA occupies only a very small section of the outcomes space that satisfies SPOD. Hence, CA is a priori unlikely to hold, given that the data satisfy a model that does not imply LI. Consequently, we may conclude that CA is sensitive to violations of the LI assumptions. Based on the results in Fig. 7 ( $J = 4$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=4$$\end{document} ), the same may be concluded for ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} and (tentatively) for 3-CA, as these properties show little overlap with SPOD.

Neither of the properties NC, NPC nor MM imply SPOD, which means that these properties may or may not hold, irrespective of SPOD. The properties may then be sensitive to violations of LI when modeled in a specific way, but not to violations of LI in general. Property MM, however, is shown in Fig. 5 to be almost completely encompassed by SPOD and thus may be found to be sensitive to violations of LI more generally. For Mokken scale analysis based on these properties, this means that a violation of NC or NPC discredits the MH model, but from this it cannot be concluded that the observed violation was due to a violation of the LI assumption.

3.2. Violations of Unidimensionality

Holland and Rosenbaum (Reference Holland and Rosenbaum1986) referred to a model that satisfies LI and M, but allows $Θ$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{\Theta }$$\end{document} to be multidimensional, as a monotone latent variable model. They showed that any monotone latent variable model implies property ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} , if the density of $Θ$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{\Theta }$$\end{document} is ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} . A similar result was obtained by Ellis (Reference Ellis, Millsap, Bolt, van der Ark and Wang2015), in case $(X, Θ)$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$(\varvec{X},\varvec{\Theta })$$\end{document} satisfies a particular higher-order factor structure. This means that one cannot make inferences about the dimensionality of (the unobserved) $Θ$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{\Theta }$$\end{document} based on the confirmation of ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} or any property it implies. Because of the minor discrepancy found between the properties ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} and CA, the assessment of the dimensionality of $Θ$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{\Theta }$$\end{document} poses a real challenge for future research.

Another difficulty, when studying the influence of violations of UD, is that the addition of more latent variables in a model generally coincides with a violation of the LI assumptions when fitting a unidimensional model.

3.3. Violations of Monotonicity

A small simulation study is performed to investigate the sensitivity of the observable properties to violations of assumption M. Given the assumptions of LI and UD, a choice needs to be made for the number of items, the distribution of the latent variable, and a way of inducing and quantifying violations of M. The results of the analysis on the sensitivity of the observable properties to violations of M highly depend on these choices. In order to make the results fairly generalizable across a wide range of choices of model specifications, a latent class approach is used (e.g., Croon, Reference Croon1990; Heinen, Reference Heinen1993; Lazarsfeld, Reference Lazarsfeld, Stouffer, Guttman, Suchman, Lazarsfeld, Star and Clausen1950). The approach consists of assuming a discrete distribution for the latent variable. By taking the number of latent classes to equal to the number of distinct response patterns, this approach is highly flexible with respect to the shape of the distribution of the latent variable and the shape of the response functions.

The choice for the number of items is motivated by the results on the complexities of the properties, which were shown to be very restrictive, especially for large numbers of items. By initially taking $J = 4$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=4$$\end{document} , we may expect the latent class model to generate sufficient samples of the vector $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} for which the properties hold, in order to compare the size of the violations of M between those cases where the property is violated to those cases where the property holds. For $J = 3$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=3$$\end{document} , the results are similar to the ones presented here.

3.3.1. Procedure

For the distribution of $Θ$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\Theta $$\end{document} , a vector $c = {(c_{1}, \dots, c_{16})}^{'}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{c}=(c_1,\ldots ,c_{16})^\prime $$\end{document} was sampled from a Dirichlet distribution, which contains the latent class proportions $c_{k} = P (Θ = k)$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$c_k=P(\Theta =k)$$\end{document} . The parameters of the Dirichlet distribution were chosen, such that the middle latent classes had generally more support. Further, let $b_{i} = {(b_{i 1}, \dots, b_{i 16})}^{'}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{b}_i=(b_{i1},\ldots ,b_{i16})^\prime $$\end{document} , with $b_{ik} = P (X_{i} = 1 | Θ = k)$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$b_{ik}=P(X_i=1|\Theta =k)$$\end{document} sampled from a beta distribution, and with the elements in $b_{i}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{b}_i$$\end{document} arranged in increasing order in agreement with assumption of M. Figure 8 shows an example of four response functions $P (X_{i} = 1 | Θ = k)$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$P(X_i=1|\Theta =k)$$\end{document} , with in light gray the 95% intervals of the response functions under the simulation conditions, along with the intervals for the latent classes. To induce a violation of M, six adjacent element of $b_{i}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{b}_i$$\end{document} were randomly selected, and reversely ordered, leading to locally decreasing response functions. Assuming LI, we then get $p = A c$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}={\mathbf {A}}\varvec{c}$$\end{document} , with $A = (a_{1}, \dots, a_{16})$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {A}}=(\varvec{a}_1,\ldots ,\varvec{a}_{16})$$\end{document} and $a_{k} = {(1 - b_{1 k}, b_{1 k})}^{'} \otimes \dots \otimes {(1 - b_{4 k}, b_{4 k})}^{'}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{a}_k=(1-b_{1k},b_{1k})^\prime \otimes \cdots \otimes (1-b_{4k},b_{4k})^\prime $$\end{document} . A total of 10,000 such vectors $p$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}$$\end{document} were generated, each containing the multinomial parameters for the outcomes of the four item variables, with each response functions violating the assumption M.

Figure 8. Example of four item response functions that violate M, with the density of $Θ$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\Theta $$\end{document} given below. The light-gray areas show the 95% intervals under which the functions were generated before inducing a violation of M. The dark-gray areas (above the local decreases) show the size of the violations of M, with $V_{i}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$V_i$$\end{document} expressing the size of the area weighted by the density of the latent variable.

To quantify the size of the violation of M, let $d_{i} = {(b_{i 1}, d_{i 2}, \dots, d_{i 16})}^{'}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{d}_i=(b_{i1},d_{i2},\ldots ,d_{i16})^\prime $$\end{document} , with the values of $d_{i 2}, \dots, d_{i 16}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$d_{i2},\ldots ,d_{i16}$$\end{document} obtained sequentially as $d_{ik} = max (d_{i, k - 1}, b_{ik})$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$d_{ik}=\max (d_{i,k-1},b_{ik})$$\end{document} . Then,

(7)

\begin{matrix} V_{i} = c^{'} (d_{i} - b_{i}) \times 100 %, \end{matrix}

which expresses the average probability (as percentage) required to compensate for the local decreases of the initial response function. Figure 8 shows for each item the value $V_{i}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$V_i$$\end{document} , corresponding to the dark-gray area above the local decrease, weighted by the probability mass function of $Θ$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\Theta $$\end{document} . For example, for the first two items in Fig. 8, $V_{1} = 4.69$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$V_1=4.69$$\end{document} and $V_{2} = 16.99$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$V_2=16.99$$\end{document} , where the second response function shows a decrease at a denser region of $Θ$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\Theta $$\end{document} .

3.3.2. Results

Let $V_{M}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$V_{\text{ M }}$$\end{document} denote the average value of $V_{i}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$V_i$$\end{document} , across the four items. The results of the simulation show that ${\bar{V}}_{M} = 8.169$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\overline{V}}_{\text{ M }}=8.169$$\end{document} across the 10,000 generated cases (with the 1st and 3th quartile at 6.389 and 9.700, respectively), which is about equal to the value of $V_{M}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$V_{\text{ M }}$$\end{document} obtained for Fig. 8.

Assessing the validity of the observable properties and evaluating the distributions of $V_{M}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$V_{\text{ M }}$$\end{document} for those cases for which the properties held true showed that the distributions of $V_{M}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$V_{\text{ M }}$$\end{document} were about the same for the properties 3-SPOD, A, and SPOD, and about the same for both ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} and CA. The results of the simulation are therefore discussed further only for the properties NC, 3-CA, NPC, SPOD, MM, and CA.

For each property, Fig. 9 shows the estimated densities of $V_{M}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$V_{\text{ M }}$$\end{document} (vertically displayed) in case the property was satisfied (True; false discovery) and in case it was violated (False). Figure 9 also shows the percentage of times each property was satisfied, with property CA satisfied about 24.67% of the time, 3-CA satisfied about half the time, and the remaining properties satisfied most of the time. The percentages listed in Fig. 9 roughly agree with the hierarchical ordering of the property in Fig. 1.

Figure 9. Conditions distributions of the size of the violations of M ( $V_{M}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$V_{\text{ M }}$$\end{document} ), given that the properties NC, 3-CA, NPC, SPOD, MM, and CA hold (True; with percentage of cases) or are violated (False). Results for the properties 3-SPOD and A are similar as for 3-CA, and the results for ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} are similar as for CA.

The differences of the violations $V_{M}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$V_{\text{ M }}$$\end{document} between the True cases and the False cases were found to be of a small to medium size for the properties 3-CA, MM, and CA, in in accordance to Cohen’s (Reference Cohen1988) d. Figure 9 shows that the properties NPC, NC, and SPOD are most sensitive to the size of the violations of M, each corresponding to a large effect size, with the larges value $d = 1.127$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$d=1.127$$\end{document} . Hence, SPOD is not sensitive to violations of M in a strict sense (only rejected about 6.21% of the time), but the property is more likely to be rejected when the violations of M are larger. This in contrast to property CA, which is generally more likely to be rejected, irrespective of the size of the violations. For practical purposes, however, it may be argued that a relative small violation of M should not matter. This would mean that CA may impose constraints on the observable data distribution that just are too restrictive. For example, one might only be interested in testing the MH model assumptions, because this model implies a MLR ordering on the latent variable by the sum score. Then, for the practical use of the sum score, the size of the violation of assumption M matters only to the extent to which it jeopardizes the MLR property.

3.3.3. The Monotone Likelihood Ratio Property

To assess the influence of the M assumption on the MLR property, the response functions that violate M are combined for each of the 10,000 cases to give an expression for the violation of property MLR, similar to $V_{M}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$V_{\text{ M }}$$\end{document} . To this end, let $E = HA$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {E}}=\mathbf {HA}$$\end{document} , with element $e_{sk} = P (S = s - 1 | Θ = k)$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$e_{sk}=P(S=s-1|\Theta =k)$$\end{document} . Here, $A$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {A}}$$\end{document} is obtained from the simulation and $H$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {H}}$$\end{document} is a matrix to relate the vectors $x$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{x}$$\end{document} to their sum scores. Specifically, let $H_{1} = I_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {H}}_1={\mathbf {I}}_2$$\end{document} and $H_{i + 1} = [{(H_{i}, 0^{'})}^{'}, {(0^{'}, H_{i})}^{'}]$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {H}}_{i+1}=[({\mathbf {H}}_i,\varvec{0}^\prime )^\prime ,(\varvec{0}^\prime ,{\mathbf {H}}_i)^\prime ]$$\end{document} , from which $H = H_{J}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {H}}={\mathbf {H}}_J$$\end{document} is obtained sequentially. Then, vector $b_{s} = {(b_{s 1}, \dots, b_{s 16})}^{'}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{b}_s=(b_{s1},\ldots ,b_{s16})^\prime $$\end{document} , with

\begin{matrix} b_{sk} = e_{s + 1, k} / (e_{sk} + e_{s + 1, k}) = P (S = s | S = s - 1 \lor S = s, Θ = k) . \end{matrix}

The MLR property requires this last expression is non-decreasing in k. Hence, defining $d_{s}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{d}_s$$\end{document} analogous to $d_{i}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{d}_i$$\end{document} , we define $V_{MLR}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$V_{\text{ MLR }}$$\end{document} as the average of $V_{s}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$V_s$$\end{document} obtained from (7) after substituting the item index by the sum score s.

Figure 10 contains the density plot with the estimated 50%, 95%, and 99% confidence regions of $ln V_{M}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\ln V_{\text{ M }}$$\end{document} and $ln V_{MLR}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\ln V_{\text{ MLR }}$$\end{document} , which shows a weak but positive relationship between the size of the violations of M and the size of the violations of MLR. As the size of the violation of M increases, so does the strength of the relationship. However, the size of the violations of MLR is generally small, with ${\bar{V}}_{MLR} = 2.011$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\overline{V}}_{\text{ MLR }}=2.011$$\end{document} (the 1st and 3th quartile at 1.018 and 2.565, respectively). This means that none of the violations of M substantially invalidate the MLR property. The values $V_{MLR}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$V_{\text{ MLR }}$$\end{document} were further compared between the True and False cases, for each property. These results showed no difference beyond a small effect size for any of the observable properties. Hence, the results suggest that the MLR property is robust against violations of assumption M.

Figure 10. Empirical confidence regions of the size of the violation of M against the size of the violation of property MLR (on a logarithmic scale).

Molenaar (Reference Molenaar1997) generalized the MH model to polytomously scored items, where assumption M can be defined for different definitions of the response function (Mellenbergh, Reference Mellenbergh1995). Unlike the MH model for binary response data, these polytomous models do not imply the MLR property (Hemker et al., Reference Hemker, Sijtsma, Molenaar and Junker1996, Reference Hemker, Sijtsma, Molenaar and Junker1997) without imposing additional restrictions on the shape of the response function (Ligtvoet, Reference Ligtvoet2012). Although these polytomous models (assuming UD and LI) do not imply the MLR property, Van der Ark (Reference Van der Ark2005) found that generally only few violations of MLR actually occurred, and that these violations had little effect on the ordering of respondents by their sum score. Our results for violations of M for binary response data are in line with these findings.

3.4. Violations of Local Independence (Continued)

For the practical use of the sum score, it was found that the violations of M have little impact on the validity of the MLR property. Here, we consider again the MH model assumption of LI and investigate the impact a violation of LI has on the MLR property, using a latent class approach. As a model for generating the probabilities $P (X = x | Θ = k)$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$P(\varvec{X}=\varvec{x}|\Theta =k)$$\end{document} , the assumptions proposed by Holland (Reference Holland1981) are considered (Sect. 1.1.5), for $J = 3$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=3$$\end{document} items. This small number of items clearly limits the extent to which the results can be generalized (as with the previous studies), so the results of this study should only be interpreted tentatively.

3.4.1. Procedure

For eight latent classes, let the matrix $P = (p_{1}, \dots, p_{8})$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {P}}=(\varvec{p}_1,\ldots ,\varvec{p}_8)$$\end{document} contain the elements $p_{jk} = P (X = x | Θ = k)$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$p_{jk}=P(\varvec{X}=\varvec{x}|\Theta =k)$$\end{document} , for which assumption LND dictates that each $p_{k}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}_k$$\end{document} is SPOD. For three items, SPOD coincides with property A, so LND implies that $K log (M p_{k}) \geq 0$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {K}}\log ({\mathbf {M}}\varvec{p}_k)\ge \varvec{0}$$\end{document} , for $k = 1, \dots, 8$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$k=1,\ldots ,8$$\end{document} , and with the design matrix $M$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {M}}$$\end{document} given in (2). That is, LND imposed constraints on the entries within each column of $P$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {P}}$$\end{document} . The monotonicity requirements in (4a) and (4a) impose additional constraints across the columns of $P$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {P}}$$\end{document} . Let $N_{0} = ({(0, 1)}^{'}, {(1, 1)}^{'})$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {N}}_0=((0,1)^\prime ,(1,1)^\prime )$$\end{document} , $N_{1} = N_{0} \otimes N_{0} \otimes N_{0}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {N}}_1={\mathbf {N}}_0\otimes {\mathbf {N}}_0\otimes {\mathbf {N}}_0$$\end{document} , and $N_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {N}}_2$$\end{document} is like $N_{1}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {N}}_1$$\end{document} but with its columns reversed. Then, the monotonicity assumption implies that the elements within each rows of $N_{1} P$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {N}}_1{\mathbf {P}}$$\end{document} are non-decreasing in k, and for $N_{2} P$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {N}}_2{\mathbf {P}}$$\end{document} non-increasing in k. Hence, the assumptions proposed by Holland (Reference Holland1981) correspond to the restrictions impose on $P$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {P}}$$\end{document} by the matrices $M$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {M}}$$\end{document} , $N_{1}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {N}}_1$$\end{document} , and $N_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {N}}_2$$\end{document} . Using the Gibbs sampler (Sect. 2.3), a total of 2000 such matrices $P$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {P}}$$\end{document} were simulated. Next, let $E = FP$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {E}}=\mathbf {FP}$$\end{document} , with element $e_{sk} = P (S = s - 1 | Θ = k)$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$e_{sk}=P(S=s-1|\Theta =k)$$\end{document} . Then, for each matrix $P$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {P}}$$\end{document} the statistic $V_{MLR}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$V_{\text{ MLR }}$$\end{document} can be computed (as above), with $V_{MLR}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$V_{\text{ MLR }}$$\end{document} expressing the size of the violation of the MLR property, as a result of relaxing the LI assumption.

3.4.2. Results

The results of the simulation yield the average ${\bar{V}}_{MLR} = 6.404$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\overline{V}}_{\text{ MLR }}=6.404$$\end{document} (with the 1st and 3th quartile at 3.065 and 8.361, respectively). These violations of the MLR property are substantially higher than those found above due to the violations of the M assumption. Hence, the property of MLR is sensitive to violations of LI. Unfortunately, the assumption LI in our setup does not lend itself for an expression that can serve as a measure for quantifying the size of the violation of the LI assumption.

Evaluating the properties MM and CA (based on the previous analysis in Sect. 3.1), the results showed that property MM was satisfied for half or the cases, whereas CA was satisfied 37.5% of the time. Neither of the properties was found to be sensitive to the size of the violations $V_{MLR}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$V_{\text{ MLR }}$$\end{document} .

4. Discussion

Observable properties were evaluated that are all implied by the MH model for binary response data. Any violation of a property discredits the MH model assumptions. The most restrictive of these properties is CA, whereby each of the other properties can be interpreted as an incomplete test of CA. The incomplete tests of CA are hierarchically related and differ with respect to the inequality restrictions that they impose on the observable response distribution. The least restrictive of the properties is NC, and it implies that all covariances between pairs of item variables are nonnegative. The NC property forms the basis of the scalability coefficients used in Mokken scale analysis. The other incomplete tests of CA take into consideration the higher-order moments contained in the trivariate and multivariate distributions of the item scores.

The practical assessment of property CA is limited by the large number of inequality restrictions it imposes. These large number of inequality restrictions not only limit the feasibility of a global test of CA (as for property A), but for local (diagnostic) tests also induce problems associated with multiple testing. In addition to the large number of inequality restrictions, the assessment of the MH model assumptions will inevitably need to deal with sparse observations. Particularly the property of ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} , which pertains to the joint distribution of individual response patterns, is sensitive to such sparse observations, and as a result, to sampling error. Due to the number of restrictions imposed by CA and the problem of sparseness of observations, the practical assessment of the MH model assumptions always relies on an incomplete test of CA.

4.1. Complexities of the Observable Properties

The computational burden associated with the large number of inequalities means that the observable properties could be studied only for small numbers of items. In a first series of small studies, we investigated the loss of information, when instead of CA an incomplete test of CA is used. For this purpose, the complexities of the incomplete tests were defined as their tendency to agree with a wide range of patters of data, with CA being the least and NC the most complex of the properties considered. For more than three items, the distinction between the complexities of CA and the incomplete tests of CA was found to be very large, and increased with increasing number of items. The exception to this rule was ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} , which agrees largely with CA. It may therefore be suggested that ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} provides a practical alternative to CA for testing the MH model assumptions, which is associated with little loss of power.

Two remarks about the complexities of the properties are in order. First, the definition of complexity allowed for the loss of information to be studies, without relying on sample size, but this also means that we cannot infer from these results the exact extent to this loss of information translates to a loss of power when assessing the properties on real data. Second, psychological and educational tests contain items that are expected to relate to a common attribute, by design. Real response data will therefore generally agree more with the observable properties than random response patterns from a flat distribution. The complexities of the properties as presented here thus only provide a benchmark against which the relative agreement of different properties can be compared, when applied to real data. This is similar to the way the BIC penalizes the likelihood by the number of parameters. Here, the complexity, in terms of the number, also does not relate to real data.

4.2. On the Sensitivity to Model Violations

A second series of studies was performed to investigate the sensitivity of the observable properties to different violations of the MH model assumptions M, LI, and UD. Only the properties CA and ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} were found to be sensitive to violations of assumption M. However, these violations of M seem to have little impact on the MLR property for ordering respondents by means of their sum scores. The assumption of LI appears to be more relevant to the MLR property. Property CA was found to be sensitive to violations of LI (here, CA coincides with ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} ), and to a lesser degree also MM. Finally, a violation of UD does not imply that ${MTP}_{2}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} is violated.

Besides the incomplete tests of CA considered, other observable properties have been proposed that were not considered. When assessing property MM in Mokken scale analysis, sparse observations are accounted for by joining adjacent rest scores into rest-score groups (Van der Ark, Reference Van der Ark2007). Assessing MM across these rest-score groups thus constitutes an incomplete test for MM. An incomplete test of CA can be similarly obtained by conditioning on the rest scores (Straat et al., Reference Straat, Van der Ark and Sijtsma2016), or some other ‘carefully selected’ sub-test score as suggested by Stout (Reference Stout2002). Further, Ellis and Junker (Reference Ellis and Junker1997) and Junker and Ellis (Reference Junker and Ellis1997) provide a characterization of the MH model, whereby the vector of item variables is taken to be embedded within an infinite sequence of item variables (cf. Junker, Reference Junker1991, Reference Junker1993; Stout, Reference Stout1987, Reference Stout1990). Within this framework, other the observable properties have been proposed, like vanishing conditional dependence and negative conditional covariance (De Gooijer & Yuan, Reference De Gooijer and Yuan2011; Junker, Reference Junker1993; Yuan & Clarke, Reference Yuan and Clarke2001).

4.3. Implications

The results of the studies presented show that CA is a difficult property to assess. Most of the incomplete tests of CA are associated with a substantial loss of information and seem not to be sensitive to specific violations of the MH model assumptions. However, it is also good to keep in mind that any violation of any of the properties considered is sufficient for discrediting the MH model. The challenge herein lies in combining the multitude of information obtained from the data to derive at a single conclusion about the significance of observed violations. This problem can be illustrated in Fig. 3, which shows the results of the log-odds ratios related to CA. Here, only 78 restrictions were considered, but it is not obvious from the results how to combine these into a single conclusion about the validity of the MH model assumptions. A global test may produce a single p-value for this example, but becomes infeasible for more items. Also, different tests might balance the odds on the left and right differently or overemphasize the extreme values. These issues, however, mostly relate to goodness of fit. This is the other aspect of model selection that we didn’t focus on.

The primary focus of this paper is complexity, which mostly concerned the inferences that we can make about CA, based on an incomplete test. It is about the extent to which the confirmation of an incomplete test of CA warrants the validity of CA or (by extension) the MH model assumptions. The results of our analysis have specific implications for the interpretation of results of automated item selections procedures in Mokken scale analysis (Brusco et al., Reference Brusco, Köhn and Steinley2015; Mokken, Reference Mokken1971; Molenaar & Sijtsma, Reference Molenaar and Sijtsma2000; Sijtsma & Molenaar, Reference Sijtsma and Molenaar2002; Straat et al., Reference Straat, Van der Ark and Sijtsma2013). As explained in Mokken et al. (Reference Mokken, Lewis and Sijtsma1986, p. 280), the selection of items based on requirement imposed on the scalability coefficients provides an operational definition of a scale that need not necessarily agree with the MH model. Beside the issue of sampling error, our results show that rules of thumb used for construction such scales are rather arbitrary (cf. Hemker et al., Reference Hemker, Sijtsma and Molenaar1995; Smits et al., Reference Smits, Timmerman and Meijer2012). In addition, in constructing these scales, the higher-order moments contained in the multivariate distributions of the item scores are ignored, which was shown to be associated with a substantial loss of information about the validity of the MH model assumptions. Hence, the scales produced by the automatic item selection procedure may not be very informative about the model underlying the scale and as such provide only an initial selection of items that require further analysis using more powerful tests for detecting violations of the model assumptions.

4.4. Conclusion

The MH model is a very general model, which assumptions are shared by many of the response models used in practice. The assessment of these assumptions thus has implications that stretch beyond just the use of the MH model. As mentioned by Molenaar (Reference Molenaar2004), the inferences from a model are contingent on the validity of the model assumptions. A global test of goodness of fit may reject a model, but this would tell us little about why this is the case or what the problem might be. More research is required about the extent to which the assumptions and the specifications of response models influence the type of inferences one wishes to make (Sinharay & Haberman, Reference Sinharay and Haberman2014; Crişan et al., Reference Crişan, Tendeiro and Meijer2017). For example, our results suggest that the MLR property is less dependent on the specification of the item response functions (cf. Van der Ark, Reference Van der Ark2005) than on the LI assumption. This is important for the applied researcher who may want to test the MH model, not because she cases so much about the model, but because it allows respondents to be ordered on a common scales and it implies testable properties that reassure her that the decisions and inferences she makes based on the sum scores are theoretically justified and empirically supported.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Appendix

Assuming $p > 0$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}>\varvec{0}$$\end{document} , SPOD coincides with property A, in case of $J = 3$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=3$$\end{document} binary variables.

For any subset of two variables from $X = (X_{1}, X_{2}, X_{3})$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{X}=(X_1,X_2,X_3)$$\end{document} , SPOD implies that the covariance between the two variables is positive. This corresponds to the first three rows of the matrix in (2) for the three distinct subsets $V = (X_{2}, X_{3})$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{V}=(X_2,X_3)$$\end{document} , $V = (X_{1}, X_{3})$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{V}=(X_1,X_3)$$\end{document} , and $V = (X_{1}, X_{3})$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{V}=(X_1,X_3)$$\end{document} , respectively. The remainder of the proof consists of going through the process of exhaustively listing all restrictions imposed by SPOD, and expressing these in terms of the log-odds ratios. It can then be shown that the last six rows of the matrix in (2) match one to one with those obtained for property SPOD. As an example, consider the inequality in (5c), which reduces for $Y = (X_{1}, X_{2})$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{Y}=(X_1,X_2)$$\end{document} and $Z = X_{3}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{Z}=X_3$$\end{document} to $(p_{7} + p_{8}) (p_{1} + p_{3} + p_{5} + p_{7}) \geq p_{7}$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$(p_7+p_8)(p_1+p_3+p_5+p_7)\ge p_7$$\end{document} and yields $ln p_{8} - ln (p_{2} + p_{4} + p_{6}) - ln p_{7} + ln (p_{1} + p_{3} + p_{5}) \geq 0$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\ln p_8-\ln (p_2+p_4+p_6)-\ln p_7+\ln (p_1+p_3+p_5)\ge 0$$\end{document} . The last inequality is obtained from (1) using the eighth row of the matrix in (2) for $M$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$${\mathbf {M}}$$\end{document} . The remaining five inequalities can be obtained similarly.

Assuming $p > 0$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\varvec{p}>\varvec{0}$$\end{document} , MM and NC jointly imply the A, in case of $J = 3$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=3$$\end{document} binary variables.

For property A, matrix (2) contains in its first three rows the constraints imposed by NC. Further, the first two rows of the matrix in (3) correspond to the MM property for $i = 1$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$i=1$$\end{document} , which implies both $P (X_{1} = 0, S = 0) P (X_{1} = 1, S > 0) \geq P (X_{1} = 1, S = 0) P (X_{1} = 0, S > 0)$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$P(X_1=0,S=0)P(X_1=1,S>0)\ge P(X_1=1,S=0)P(X_1=0,S>0)$$\end{document} and $P (X_{1} = 1, S = 2) P (X_{1} = 0, S < 2) \geq P (X_{1} = 0, S = 2) P (X_{1} = 1, S < 2)$ \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$P(X_1=1,S=2)P(X_1=0,S<2)\ge P(X_1=0,S=2)P(X_1=1,S<2)$$\end{document} . These last two inequalities correspond to the restrictions imposed by the fourth and fifth row of (2). Likewise, the remaining four restrictions in (3) imply the last four restrictions in (2).

Footnotes

Publisher's Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Akaike, H., (1974). A new look at the statistical model identification IEEE Transactions on Automatic Control 19 716–723 10.1109/tac.1974.1100705CrossRef Google Scholar

Andersen, E. B., (1973). A goodness of fit test for the Rasch model Psychometrika 38 123–140 10.1007/bf02291180CrossRef Google Scholar

Bartolucci, F., Forcina, A., (2000). A likelihood ratio test for MTP2 within binary variables Annals of Statistics 28 1206–1218 10.1214/aos/1015956713CrossRef Google Scholar

Bartolucci, F., Forcina, A., (2005). Likelihood inference on the underlying structure of IRT models Psychometrika 70 31–43 10.1007/s11336-001-0934-zCrossRef Google Scholar

Block, H. W., Fang, Z., (1990). Setwise independence for some dependence structures Journal of Multivariate Analysis 32 103–119 10.1016/0047-259X(90)90075-SCrossRef Google Scholar

Bonifay, W., Cai, L., (2017). On the complexity of item response theory models Multivariate Behavioral Research 52 465–484 10.1080/00273171.2017.1309262 28426237CrossRef Google Scholar PubMed

Brusco, M. J., Köhn, H. F., Steinley, D., (2015). An exact method for partitioning dichotomous items within the framework of the monotone homogeneity model Psychometrika 80 949–967 10.1007/s11336-015-9459-8 25850618CrossRef Google Scholar PubMed

Cohen, J. (1988). Statistical power analysis for the behavioral sciences. Lawrence Erlbaum Associates.Google Scholar

Crişan, D. R., Tendeiro, J. N., Meijer, R. R., (2017). Investigating the practical consequences of model misfit in unidimensional IRT models Applied Psychological Measurement 41 439–455 10.1177/0146621617695522 28804181 5533251CrossRef Google Scholar PubMed

Croon, M., (1990). Latent class analysis with ordered latent classes British Journal of Mathematical and Statistical Psychology 43 171–192 10.1111/j.2044-8317.1990.tb00934.xCrossRef Google Scholar

Croon, M., (1991). Investigating Mokken scalability of dichotomous items by means of ordinal latent class analysis British Journal of Mathematical and Statistical Psychology 44 315–331 10.1111/j.2044-8317.1991.tb00964.xCrossRef Google Scholar

De Gooijer, J. G., Yuan, A., (2011). Some exact tests for manifest properties of latent trait models Computational Statistics & Data Analysis 55 34–44 10.1016/j.csda.2010.04.022CrossRef Google Scholar PubMed

Debelak, R., (2019). An evaluation of overall goodness-of-fit tests for the Rasch model Frontiers in Psychology 9 2710 10.3389/fpsyg.2018.02710 30687170 6335387CrossRef Google Scholar PubMed

Ellis, J. L., (2014). An inequality for correlations in unidimensional monotone latent variable models for binary variables Psychometrika 79 303–316 10.1007/s11336-013-9341-5 24659373CrossRef Google Scholar PubMed

Ellis, J. L. (2015). MTP2 and partial correlations in monotone higher-order factor models. In Millsap, R. E. Bolt, D. M. van der Ark, L. A. & Wang, W. C. (Eds.), Quantitative psychology research (pp. 261–272). Springer. https://doi.org/10.1007/978-3-319-07503-7_16 CrossRef Google Scholar

Ellis, J. L., Junker, B. W., (1997). Tail-measurability in monotone latent variable models Psychometrika 62 495–523 10.1007/BF02294640CrossRef Google Scholar

Esary, J. D., Proschan, F., Walkup, D. W., (1967). Association of random variables, with applications The Annals of Mathematical Statistics 38 1466–1474 10.1214/aoms/1177698701CrossRef Google Scholar

Ghurye, S. G., Wallace, D. L., (1959). A convolutive class of monotone likelihood ratio families The Annals of Mathematical Statistics 30 1158–1164 10.1214/aoms/1177706101CrossRef Google Scholar

Glas, C. A. W., (1988). The derivation of some tests for the Rasch model from the multinomial distribution Psychometrika 53 525–546 10.1007/BF02294405CrossRef Google Scholar

Glas, C. A. W., & Verhelst, N. D. (1995). Testing the Rasch model. In Fischer, G. H. & Molenaar, I. W. (Eds.), Rasch models: Foundations, recent developments, and applications (pp. 69–95). Springer. https://doi.org/10.1007/978-1-4612-4230-7_5 CrossRef Google Scholar

Grayson, D. A., (1988). Two-group classification in latent trait theory: Scores with monotone likelihood ratio Psychometrika 53 383–392 10.1007/BF02294219CrossRef Google Scholar

Haddock, C. K., Rindskopf, D., Shadish, W. R., (1998). Using odds ratios as effect sizes for meta-analysis of dichotomous data: A primer on methods and issues Psychological Methods 3 339–353 10.1037/1082-989X.3.3.339CrossRef Google Scholar

Hasselblad, V., Hedges, L. V., (1995). Meta-analysis of screening and diagnostic tests Psychological Bulletin 117 167–178 10.1037/0033-2909.117.1.167 7870860CrossRef Google Scholar PubMed

Heinen, T. (1993). Discrete latent variable models. Tilburg University Press.Google Scholar

Hemker, B. T., Sijtsma, K., Molenaar, I. W., (1995). Selection of unidimensional scales from a multidimensional item bank in the polytomous Mokken IRT model Applied Psychological Measurement 19 337–352 10.1177/014662169501900404CrossRef Google Scholar

Hemker, B. T., Sijtsma, K., Molenaar, I. W., Junker, B. W., (1996). Polytomous IRT models and monotone likelihood ratio of the total score Psychometrika 61 679–693 10.1007/BF02294042CrossRef Google Scholar

Hemker, B. T., Sijtsma, K., Molenaar, I. W., Junker, B. W., (1997). Stochastic ordering using the latent trait and the sum score in polytomous IRT models Psychometrika 62 331–347 10.1007/BF02294555CrossRef Google Scholar

Hoijtink, H. (2011). Informative hypotheses: Theory and practice for behavioral and social scientists. CRC Press. https://doi.org/10.1201/b11158 CrossRef Google Scholar

Hoijtink, H., Molenaar, I. W., (1997). A multidimensional item response model: Constrained latent class analysis using the Gibbs sampler and posterior predictive checks Psychometrika 62 171–189 10.1007/BF02295273CrossRef Google Scholar

Holland, P. W., (1981). When are item response models consistent with observed data? Psychometrika 46 79–92 10.1007/BF02293920CrossRef Google Scholar

Holland, P. W., (1990). On the sampling theory foundations of item response theory models Psychometrika 55 577–601 10.1007/BF02294609CrossRef Google Scholar

Holland, P. W., Rosenbaum, P. R., (1986). Conditional association and unidimensionality in monotone latent variable models The Annals of Statistics 14 1523–1543 10.1214/aos/1176350174CrossRef Google Scholar

Huynh, H., (1994). A new proof for monotone likelihood ratio for the sum of independent Bernoulli random variables Psychometrika 59 77–79 10.1007/BF02294266CrossRef Google Scholar

Joag-Dev, K., (1983). Independence via uncorrelatedness under certain dependence structures The Annals of Probability 11 1037–1041 10.1214/aop/1176993452CrossRef Google Scholar

Jöreskog, K. G., Moustaki, I., (2001). Factor analysis of ordinal variables: A comparison of three approaches Multivariate Behavioral Research 36 347–387 10.1207/S15327906347-387 26751181CrossRef Google Scholar PubMed

Junker, B. W., (1991). Essential independence and likelihood-based ability estimation for polytomous items Psychometrika 56 255–278 10.1007/BF02294462CrossRef Google Scholar

Junker, B. W., (1993). Conditional association, essential independence and monotone unidimensional item response models The Annals of Statistics 21 1359–1378 10.1214/aos/1176349262CrossRef Google Scholar

Junker, B. W., Ellis, J. L., (1997). A characterization of monotone unidimensional latent variable models The Annals of Statistics 25 1327–1343 10.1214/aos/1069362751CrossRef Google Scholar

Junker, B. W., Sijtsma, K., (2000). Latent and manifest monotonicity in item response models Applied Psychological Measurement 24 65–81 10.1177/01466216000241004CrossRef Google Scholar

Junker, B. W., Sijtsma, K., (2001). Nonparametric item response theory in action: An overview of the special issue Applied Psychological Measurement 25 211–220 10.1177/01466210122032028CrossRef Google Scholar

Karlin, S. (1968). Total positivity. Stanford University Press.Google Scholar

Karlin, S., Rinott, Y., (1980). Classes of orderings of measures and related correlation inequalities. I. Multivariate totally positive distributions Journal of Multivariate Analysis 10 467–498 10.1016/0047-259X(80)90065-2CrossRef Google Scholar

Katsikatsou, M., Moustaki, I., Yang-Wallentin, F., Jöreskog, K. G., (2012). Pairwise likelihood estimation for factor analysis models with ordinal data Computational Statistics & Data Analysis 56 4243–4258 10.1016/j.csda.2012.04.010CrossRef Google Scholar

Kelderman, H., (1988). Common item equating using the loglinear Rasch model Journal of Educational Statistics 13 319–336 10.3102/10769986013004319CrossRef Google Scholar

Kimeldorf, G., Sampson, A. R., (1989). A framework for positive dependence Annals of the Institute of Statistical Mathematics 41 31–45 10.1007/BF00049108CrossRef Google Scholar

Klugkist, I., Hoijtink, H., (2007). The Bayes factor for inequality and about equality constrained models Computational Statistics & Data Analysis 51 6367–6379 10.1016/j.csda.2007.01.024CrossRef Google Scholar

Lazarsfeld, P. F. (1950). The logical and mathematical foundation of latent structure analysis & The interpretation and mathematical foundation of latent structure analysis. In Stouffer, S. A. Guttman, L. Suchman, E. A. Lazarsfeld, P. F. Star, S. A. & Clausen, J. A. (Eds.), Measurement and Prediction (pp. 362–472). Princeton: Princeton University Press.Google Scholar

Ligtvoet, R., (2012). An isotonic partial credit model for ordering subjects on the basis of their sum scores Psychometrika 77 479–494 10.1007/s11336-012-9272-6 27519777CrossRef Google Scholar PubMed

Ligtvoet, R., Vermunt, J. K., (2012). Latent class models for testing monotonicity and invariant item ordering for polytomous items British Journal of Mathematical and Statistical Psychology 65 237–250 10.1111/j.2044-8317.2011.02019.x 21651508CrossRef Google Scholar PubMed

Loevinger, J., (1948). The technic of homogeneous tests compared with some aspects of “scale analysis” and factor analysis Psychological Bulletin 45 507–530 10.1037/h0055827 18893224CrossRef Google Scholar PubMed

Lord, F. M., & Novick, M. R. (1968). Statistical theories of mental test scores. Addison-Wesley.Google Scholar

Maraun, M. D., Jackson, J. S. H., Luccock, C. R., Belfer, S. E., Chrisjohn, R. D., (1998). CA and SPOD for the analysis of tests comprised of binary items Educational and Psychological Measurement 58 916–928 10.1177/0013164498058006004CrossRef Google Scholar

Maydeu-Olivares, A., Joe, H., (2005). Limited-and full-information estimation and goodness-of-fit testing in

2^{n}

contingency tables: A unified framework Journal of the American Statistical Association 100 1009–1020 10.1198/016214504000002069CrossRef Google Scholar

Mellenbergh, G. J., (1995). Conceptual notes on models for discrete polytomous item responses Applied Psychological Measurement 19 91–100 10.1177/014662169501900110CrossRef Google Scholar

Mokken, R. J. (1971). A theory and procedure of scale analysis. Walter de Gruyter. https://doi.org/10.1515/9783110813203 CrossRef Google Scholar

Mokken, R. J., Lewis, C., (1982). A nonparametric approach to the analysis of dichotomous item responses Applied Psychological Measurement 6 417–430 10.1177/014662168200600404CrossRef Google Scholar

Mokken, R. J., Lewis, C., Sijtsma, K., (1986). Rejoinder to “The Mokken scale: A critical discussion” Applied Psychological Measurement 10 279–285 10.1177/014662168601000306CrossRef Google Scholar

Molenaar, I. W. (1997). Nonparametric models for polytomous responses. In Handbook of modern item response theory (pp. 369–380). https://doi.org/10.1007/978-1-4757-2691-6_21 CrossRef Google Scholar

Molenaar, I. W., (2004). About handy, handmade and handsome models Statistica Neerlandica 58 1–20 10.1046/j.0039-0402.2003.00110.xCrossRef Google Scholar

Molenaar, I. W., & Sijtsma, K. (2000). User’s manual MSP5 for Windows. iecProGAMMA.Google Scholar

Myung, I. J., Pitt, M. A., & Kim, W. (2005). Model evaluation, testing and selection. In Handbook of cognition (pp. 422–436). https://doi.org/10.4135/9781848608177.n19 CrossRef Google Scholar

Pitt, M. A., Myung, I. J., Zhang, S., (2002). Toward a method of selecting among computational models of cognition Psychological Review 109 472–491 10.1037/0033-295X.109.3.472 12088241CrossRef Google Scholar

Ponocny, I., (2001). Nonparametric goodness-of-fit tests for the Rasch model Psychometrika 66 437–459 10.1007/BF02294444CrossRef Google Scholar

Preacher, K. J., (2006). Quantifying parsimony in structural equation modeling Multivariate Behavioral Research 41 227–259 10.1207/s15327906mbr4103_1 26750336CrossRef Google Scholar PubMed

Rasch, G. (1960). Probabilistic models for some intelligence and attainment tests. Nielsen & Lydiche.Google Scholar

Rosenbaum, P. R., (1984). Testing the conditional independence and monotonicity assumptions of item response theory Psychometrika 49 425–435 10.1007/BF02306030CrossRef Google Scholar

Roskam, E. E., Van den Wollenberg, A. L., Jansen, P. G. W., (1986). The mokken scale: A critical discussion Applied Psychological Measurement 10 265–277 10.1177/014662168601000305CrossRef Google Scholar

Sarkar, T. K. (1969). Some lower bounds of reliability. Technical Report 124 Department of Operations Research and Statistics, Stanford University.Google Scholar

Schwarz, G., (1978). Estimating the dimension of a model The Annals of Statistics 6 461–464 10.1214/aos/1176344136CrossRef Google Scholar

Sijtsma, K., Molenaar, I. W., (2002). Introduction to nonparametric item response theory Sage 10.4135/9781412984676Google Scholar

Sijtsma, K., Van der Ark, L. A., (2017). A tutorial on how to do a mokken scale analysis on your test and questionnaire data British Journal of Mathematical and Statistical Psychology 70 137–158 10.1111/bmsp.12078 27958642CrossRef Google Scholar

Sinharay, S., Haberman, S. J., (2014). How often is the misfit of item response theory models practically significant? Educational Measurement: Issues and Practice 33 23–35 10.1111/emip.12024CrossRef Google Scholar

Smits, I. A. M., Timmerman, M. E., Meijer, R. R., (2012). Exploratory Mokken scale analysis as a dimensionality assessment tool: Why scalability does not imply unidimensionality Applied Psychological Measurement 36 516–539 10.1177/0146621612451050CrossRef Google Scholar

Stout, W., (1987). A nonparametric approach for assessing latent trait unidimensionality Psychometrika 52 589–617 10.1007/BF02294821CrossRef Google Scholar

Stout, W., (1990). A new item response theory modeling approach with applications to unidimensionality assessment and ability estimation Psychometrika 55 293–325 10.1007/BF02295289CrossRef Google Scholar

Stout, W., (2002). Psychometrics: From practice to theory and back Psychometrika 67 485–518 10.1007/BF02295128CrossRef Google Scholar

Straat, J. H., Van der Ark, L. A., Sijtsma, K., (2013). Comparing optimization algorithms for item selection in Mokken scale analysis Journal of Classification 30 75–99 10.1007/s00357-013-9122-yCrossRef Google Scholar

Straat, J. H., Van der Ark, L. A., Sijtsma, K., (2016). Using conditional association to identify locally independent item sets Methodology 12 117–123 10.1027/1614-2241/a000115CrossRef Google Scholar

Suáres-Falcón, J. C., Glas, C. A. W., (2003). Evaluation of global testing procedures for item fit to the Rasch model British Journal of Mathematical and Statistical Psychology 56 127–143 10.1348/000711003321645395CrossRef Google Scholar

Tijmstra, J., Hessen, D. J., Van der Heijden, P. G. M., Sijtsma, K., (2013). Testing manifest monotonicity using order-constrained statistical inference Psychometrika 78 83–97 10.1007/s11336-012-9297-x 25107519CrossRef Google Scholar PubMed

Tijmstra, J., Hoijtink, H., Sijtsma, K., (2015). Evaluating manifest monotonicity using Bayes factors Psychometrika 80 880–896 10.1007/s11336-015-9475-8 26377889 4644216CrossRef Google Scholar PubMed

Ünlü, A., (2008). A note on monotone likelihood ratio of the total score variable in unidimensional item response theory British Journal of Mathematical and Statistical Psychology 61 179–187 10.1348/000711007X173391 17535477CrossRef Google Scholar PubMed

Van den Wollenberg, A. L. (1979). The Rasch model and time-limit tests: An application and some theoretical contributions. Ph.D. thesis, Katholieke Universiteit Nijmegen.Google Scholar

Van der Ark, L. A., (2005). Stochastic ordering of the latent trait by the sum score under various polytomous irt models Psychometrika 70 283–304 10.1007/s11336-000-0862-3CrossRef Google Scholar

Van der Ark, L. A., (2007). Mokken scale analysis in R Journal of Statistical Software 20 (11) 1–19 10.18637/jss.v020.i11Google Scholar

Verhelst, N. D., Hatzinger, R., Mair, P., (2007). The Rasch sampler Journal of Statistical Software 20 (4) 1–14 10.18637/jss.v020.i04CrossRef Google Scholar

Vermunt, J. K., (2001). The use of restricted latent class models for defining and testing nonparametric and parametric item response theory models Applied Psychological Measurement 25 283–294 10.1177/01466210122032082CrossRef Google Scholar

Verweij, A. C., Sijtsma, K., Koops, W., (1996). A Mokken scale for transitive reasoning suited for longitudinal research International Journal of Behavioral Development 19 219–238 10.1177/016502549601900115CrossRef Google Scholar

Walkup, D. W., (1968). Minimal conditions for association of binary variables SIAM Journal on Applied Mathematics 16 1394–1403 10.1137/0116115CrossRef Google Scholar

Warrens, M. J., (2008). On association coefficients for 2

\times

2 tables and properties that do not depend on the marginal distributions Psychometrika 73 777–789 10.1007/s11336-008-9070-3 20046834 2798022CrossRef Google Scholar

Wright, B. D., (1977). Solving measurement problems with the Rasch model Journal of Educational Measurement 14 97–116 10.1111/j.1745-3984.1977.tb00031.xCrossRef Google Scholar

Yuan, A., Clarke, B., (2001). Manifest characterization and testing for certain latent properties Annals of Statistics 29 876–898 10.1214/aos/1009210693Google Scholar

Figure 1. Hierarchical relationships between the observable properties, for J binary variables.

Figure 2. The number of restrictions imposed by the observable properties as a function of J.

Figure 3. Log-odds ratios for the properties CA, MTP2\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document}, and NC (for each in ascending order), along with the 95% confidence intervals.

Figure 4. Hierarchical relationships between the observable properties (excluding MM), for J≥4\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J\ge 4$$\end{document} binary variables.

Figure 5. Triangular Venn diagram of properties in Fig. 1 (J=3\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=3$$\end{document}), with the overlap between NC and MM in gray, with the conditional percentages, given either NC or MM (or both).

Figure 6. Conditional densities (vertically displayed) of the scalability H, given the properties in Fig. 1 (J=3\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$J=3$$\end{document}), along with the percentages H<0.30\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$H<0.30$$\end{document}.

Figure 7. Triangular Venn diagram of properties in Fig. 4, with the conditional percentages, given NC. The properties A and 3-CA and their overlap are shown in gray.

Figure 8. Example of four item response functions that violate M, with the density of Θ\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\Theta $$\end{document} given below. The light-gray areas show the 95% intervals under which the functions were generated before inducing a violation of M. The dark-gray areas (above the local decreases) show the size of the violations of M, with Vi\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$V_i$$\end{document} expressing the size of the area weighted by the density of the latent variable.

Figure 9. Conditions distributions of the size of the violations of M (VM\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$V_{\text{ M }}$$\end{document}), given that the properties NC, 3-CA, NPC, SPOD, MM, and CA hold (True; with percentage of cases) or are violated (False). Results for the properties 3-SPOD and A are similar as for 3-CA, and the results for MTP2\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\hbox {MTP}_2$$\end{document} are similar as for CA.

Figure 10. Empirical confidence regions of the size of the violation of M against the size of the violation of property MLR (on a logarithmic scale).

Article contents

Incomplete Tests of Conditional Association for the Assessment of Model Assumptions

Abstract

Keywords

1. Properties of Multivariate Dependence

1.1. Observable Properties

1.1.1. (Conditionally) Associated Random Variables

1.1.2. Multivariate Totally Positive

1.1.3. Nonnegative Covariances

1.1.4. Manifest Monotonicity

1.1.5. Strongly Positive Orthant Dependency

1.1.6. Nonnegative Partial Correlations

1.2. Relationships Between the Observable Properties

1.3. Practical Considerations

1.4. Properties for Trivariate Distributions of Item Triplets

2. Incomplete Tests of Conditional Association

2.1. On the Complexity of the Observable Properties

2.2. Scalability Coefficient

2.3. Manifest Monotonicity

2.4. The Distributions of Subsets of Item Variables

3. Sensitivity to Model Violations

3.1. Violations of Local Independence

3.2. Violations of Unidimensionality

3.3. Violations of Monotonicity

3.3.1. Procedure

3.3.2. Results

3.3.3. The Monotone Likelihood Ratio Property

3.4. Violations of Local Independence (Continued)

3.4.1. Procedure

3.4.2. Results

4. Discussion

4.1. Complexities of the Observable Properties

4.2. On the Sensitivity to Model Violations

4.3. Implications

4.4. Conclusion

Funding

Appendix

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests