Hostname: page-component-cd9895bd7-dzt6s Total loading time: 0 Render date: 2025-01-06T04:17:48.117Z Has data issue: false hasContentIssue false

Adventitious Error and Its Implications for Testing Relations Between Variables and for Composite Measurement Outcomes

Published online by Cambridge University Press:  01 January 2025

Paul De Boeck*
Affiliation:
The Ohio State University
Michael L. DeKay
Affiliation:
The Ohio State University
Jolynn Pek
Affiliation:
The Ohio State University
*
Correspondence should be made to Paul De Boeck, Department of Psychology, The Ohio State University, 1827 Neil Avenue, Columbus, OH43210, USA. Email: [email protected]
Rights & Permissions [Opens in a new window]

Abstract

Wu and Browne (Psychometrika 80(3):571–600, 2015. https://doi.org/10.1007/s11336-015-9451-3; henceforth W & B) introduced the notion of adventitious error to explicitly take into account approximate goodness of fit of covariance structure models (CSMs). Adventitious error supposes that observed covariance matrices are not directly sampled from a theoretical population covariance matrix but from an operational population covariance matrix. This operational matrix is randomly distorted from the theoretical matrix due to differences in study implementations. W & B showed how adventitious error is linked to the root mean square error of approximation (RMSEA) and how the standard errors (SEs) of parameter estimates are augmented. Our contribution is to consider adventitious error as a general phenomenon and to illustrate its consequences. Using simulations, we illustrate that its impact on SEs can be generalized to pairwise relations between variables beyond the CSM context. Using derivations, we conjecture that heterogeneity of effect sizes across studies and overestimation of statistical power can both be interpreted as stemming from adventitious error. We also show that adventitious error, if it occurs, has an impact on the uncertainty of composite measurement outcomes such as factor scores and summed scores. The results of a simulation study show that the impact on measurement uncertainty is rather small although larger for factor scores than for summed scores. Adventitious error is an assumption about the data generating mechanism; the notion offers a statistical framework for understanding a broad range of phenomena, including approximate fit, varying research findings, heterogeneity of effects, and overestimates of power.

Type
Theory & Methods
Creative Commons
Creative Common License - CCCreative Common License - BY
This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Copyright
Copyright © 2024 The Author(s)

A common problem of estimating a covariance structure model (CSM; e.g., a factor model or latent growth curve model) is that the estimated model is usually somewhat misspecified (e.g., see MacCallum, Reference MacCallum2003). The specified model is at best only an approximate model of the data, which is an exemplification of Box’s (Reference Box1976) statement that all models are wrong (but some are useful). Goodness-of-fit (GOF) indices have been developed to evaluate how well the estimated CSM fits the data. For example, the root mean squared error of approximation (RMSEA) is a popular GOF index, and it is common practice to accept an estimated model as sufficiently approximate if certain cutoff values of GOF indices are met (e.g., RMSEA 0.05 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\textrm{RMSEA}\le 0.05$$\end{document} ; Browne & Cudeck, Reference Browne, Cudeck, Bollen and Long1993). One reason for a model being approximate is that small sources of covariance, which could be treated as minor factors or error covariances, are omitted from the specified model and treated as nuisance instead (e.g., Tucker et al., Reference Tucker, Koopman and Linn1969). This practice can lead the model to be rejected based on statistical criteria such as GOF χ 2 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\chi ^{2}$$\end{document} test.

There are two different approaches to deal with imperfect model fit. One approach is model respecification, for example, following suggestions based on modification indices (Sörbom, Reference Sörbom1989), which may result in model extensions that no longer lead to a statistical rejection of perfect model fit. However, potential issues are the power of the GOF test and the possible ad hoc nature and non-replication of model modifications (e.g., MacCallum, Reference MacCallum1986). An alternative approach, which we follow here, is to accept the approximate nature of models (Browne and Cudeck, Reference Browne, Cudeck, Bollen and Long1993) while allowing that models can sometimes have a perfect fit (e.g., Bollen, Reference Bollen1989; Jöreskog, Reference Jöreskog1969). Although the approximate nature of models and the consequent notion of approximate fit are broadly accepted (e.g., using the RMSEA), the discrepancy between the theoretical and approximate models has received limited statistical formalization (but see Cudeck & Henly, Reference Cudeck and Henly1991).

Wu and Browne (Reference Wu and Browne2015; henceforth W &B) proposed the concept of adventitious error within a statistical framework for formalizing the approximate nature of CSMs instead of either simply ignoring or explicitly accounting for minor sources of covariance beyond sampling variation (i.e., specifying a more complex model). W &B formalized adventitious error as a random distortion of the “theoretical population” covariance matrix due to the specificities of each implemented study but unrelated to sampling variation, yielding a different “operational population” covariance matrix for each study. Sampling variation for each study is centered at the operational population covariance matrix and not at the theoretical population covariance matrix. In short, adventitious error and sampling variation are separate, additive sources of uncertainty. If approximate fit stems from adventitious error—a random source—rather than a systematic source, it does not follow that the model parameters are biased: “Under the assumption of large sample size and small adventitious error, the estimators for covariance structure parameters are consistent and asymptotically normally distributed” (W &B, p. 595).

Embracing adventitious error requires the analyst to abandon the ideas of a fixed reality and fixed population models. Fixed population models are not sufficiently flexible to take into account the undeniable complexity of realities that interest psychologists and other social scientists. To model the complexity explicitly and fully, the models would need to be as complex as the phenomena being modeled. A common but necessarily incomplete approach to explaining deviations from a fixed population model (i.e., beyond sampling variation) is to include moderators as explanatory variables, so that variations in one or more effects might be attributed to different levels of one or more moderators. Such patterns can make interpretation difficult and raise concerns about replicability.

The random nature of adventitious error allows for a different approach. Adventitious error does not provide specific conceptual explanations for deviations from a fixed population model because randomness does not qualify as an explanation. Instead, adventitious error provides a statistical framework that quantifies the approximate nature of models, resulting in downstream adjustments to inferential devices, including more conservative significance tests and wider confidence intervals. We argue that, in contrast, fixed population models overestimate the certainty in such inferences (i.e., that fixed population models are misspecified).

There are precedents for a randomness approach. For example, the scientific community has been willing to accept that higher-order interactions are absorbed in a random term in ANOVA (e.g., McGraw & Wong, Reference McGraw and Wong1996), to accept random-effect models in meta-analysis (e.g., Borenstein et al., Reference Borenstein, Hedges, Higgins and Rothstein2009), and to approach large-scale replication projects with random-effect modeling (e.g., McShane et al., Reference McShane, Böckenholt and Hansen2022). Perhaps the most important precedent is the concept of measurement error, which is so well established that one would not even think of trying to describe every specific source of error. Instead, we treat random deviations due to measurement error as if the term “measurement error” is self-explanatory.

Adventitious error is distinct from measurement error because adventitious error refers to the covariance among variables in a study. Although conceptually distinct, adventitious error and measurement error share the same feature of randomness, and, as will be explained, adventitious error has consequences for composite measurement outcomes as well. Just as measurement error refers to a specific combination of influences and co-occurrences associated with an individual measurement outcome, adventitious error refers to a specific combination of influences and co-occurrences associated with the covariance of variables in an individual study. Although different from measurement error, the effects of adventitious error can trickle down to composite measurement outcomes, as will be shown.

Overview

Our contribution here is to discuss and illustrate possible implications of adventitious error in a context broader than the CSM framework. For example, descriptive correlations between variables in the absence of an explicit model for the correlations should also be subject to adventitious error. There is no reason why adventitious error should occur only when variables are expressed within in a CSM model or why adventitious error should not also be relevant for the uncertainty of relations between variables in a more general sense. We may expect that distortions to variance-covariance values among manifest variables (MVs), due to the specificities of individual studies, are independent of the type of analysis that follows the data collection. Thus, the occurrence of adventitious error does not depend on the use of a CSM and is not necessarily reflected in the GOF of the model. For example, adventitious error may lead to varying relations between a pair of variables (e.g., varying correlations) based on the specificities of studies, but it would not lead to GOF issues in a simple regression model because the model is saturated (i.e., GOF will be perfect). Stated differently, the estimated regression effect may deviate from the true effect due to adventitious error even when the model perfectly fits the data. We accept that random distortions of covariances among MVs are independent of the data-analytic framework applied to the data. The CSM context is a special case in which adventitious error is reflected in GOF if the operational population covariance matrix deviates from the theoretical population covariance matrix in a way that is described in Sect. 1.1. Importantly, W &B also showed that the RMSEA GOF index gives us an indication of the extent of adventitious error, as we explain in the same section.

Our further contribution consists of three parts. In Part 1, we introduce a formalization of adventitious error and its impact on uncertainty, which is based on the work of W &B on CSMs. In Part 2, we illustrate the consequences of accepting adventitious error for the relations between variables, regardless of whether these relations are modeled within the CSM framework. Three different sections of Part 2 are devoted to illustrations of the consequences, with each section designed to answer a different question:

Question 1: Are the inferential consequences of adventitious error in a CSM context generalizable beyond the CSM context, such as to simple relations between MVs?

Question 2: How might one interpret the heterogeneity of effects such as correlations in meta-analysis from the perspective of adventitious error?

Question 3: What are the consequences of adventitious error for statistical power for testing the significance of correlations and regression slopes in simple regression?

These three sections provide only simple illustrations to give the reader an initial idea of the possible consequences of adventitious error.

Finally, in Part 3, we discuss a way to estimate the impact of adventitious error on the uncertainty of composite measurement outcomes (e.g., summed scores and factor scores). The uncertainty of measurement outcomes depends on their variability. A well-known source of variability is measurement error, formulated as deviations from the expected value of measurement outcomes in a standard condition. We believe that adventitious error also contributes to measurement uncertainty, although in an additional way compared with the common measurement error notion.

As a precautionary note, we want to clarify our claims. First, although it would follow from adventitious error that CSMs are approximate and that minor nuisance sources of covariance occur, these latter phenomena do not prove the occurrence of adventitious error. However, we do claim that phenomena other than those that led to the notion of adventitious error, such as the heterogeneity of effects and the overestimation of statistical power can also be explained as stemming from adventitious error. Second, the adventitious error rate cannot be easily estimated. Although W &B derived a way to connect the RMSEA GOF index to the adventitious error rate in the context of CSMs, we do not know of any other model or analysis for which the adventitious error rate is mathematically related to a statistic derived from the analysis. Furthermore, we recognize that most empirical studies do not make use of CSMs. For all these reasons, we utilize a benchmark rate of adventitious error to illustrate its effects, as explained in Sect. 1.2.

1. Formalization of Adventitious Error

1.1. Sampling Error and Adventitious Error for CSMs

The common statistical approach for CSMs is to assume multivariate normality for data associated with a p × p \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p\times p$$\end{document} population covariance matrix Σ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varSigma }$$\end{document} , with p as the number of MVs. A p × p \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p\times p$$\end{document} observed covariance matrix S \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\varvec{S}}$$\end{document} is sampled from a Wishart distribution with Σ / n \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varSigma }/n$$\end{document} as its scale matrix and n (sample size N minus 1) as its degrees of freedom: S W ( Σ / n , n ) \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\varvec{S}}\,\sim W(\varvec{\varSigma }/n,n)$$\end{document} . Stated differently, the sampling distribution for S \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\varvec{S}}$$\end{document} is centered at Σ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varSigma }$$\end{document} with dispersion 1/n. Increasing n decreases sampling error.

Figure 1 gives a conceptual representation of how the observed covariance matrix is generated in the presence of adventitious error. A theoretical population model covariance matrix Ω \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varOmega }$$\end{document} is first randomly distorted in each study, indexed by s, so that the operational population covariance matrix Σ s \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varSigma }_{s}$$\end{document} deviates from the theoretical population covariance matrix Ω \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varOmega }$$\end{document} due to the specificities of each study (i.e., how each study is uniquely implemented). Next, the observed covariance matrix S s \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\varvec{S}}_{{\varvec{s}}}$$\end{document} is sampled from Σ s \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varSigma }_{s}$$\end{document} . The random distortion of the theoretical p × p \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p\times p$$\end{document} covariance matrix Ω \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varOmega }$$\end{document} is based on an inverse Wishart distribution: Σ s I W ( m Ω , m ) , \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varSigma }_{s} {\sim } IW(m\varvec{\varOmega },m),$$\end{document} with m Ω \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$m\varvec{\varOmega }$$\end{document} as the scale matrix and m as its degrees of freedom. The adventitious error distribution is centered at Ω \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varOmega }$$\end{document} , with dispersion 1/m. Thus, the operational covariance matrix Σ s \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varSigma }_{s}$$\end{document} is randomly perturbed from Ω \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varOmega }$$\end{document} , and the observed covariance matrix S s \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\varvec{S}}_{s}$$\end{document} is then sampled from Σ s \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varSigma }_{s}$$\end{document} and not from Ω \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varOmega }$$\end{document} .

Notes. (1) The expected value of the IW distribution centered at Ω \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varOmega }$$\end{document} is not Ω \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varOmega }$$\end{document} , but m Ω / ( m - p - 1 ) \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$m\varvec{\varOmega }/(m-p-1)$$\end{document} instead. (2) Although the IW distribution is used within a Bayesian framework as a conjugate prior for covariance matrices, the distribution as such also makes sense in a frequentist perspective. (3) Adventitious error makes the uncertainty about parameter estimates aleatory. Aleatory uncertainty about a parameter estimate cannot be reduced with more information (see O’Hagan, Reference O’Hagan2004, for more details). Specifically, the variability of relations between variables cannot be reduced by more information, such as when multiple studies are conducted.

Figure 1 Sampling scheme for an operational population covariance matrix and an observed covariance matrix. n = sample size N - 1 and m is the degrees of freedom of an inverse Wishart distribution.

1.2. Standard Errors in the Presence of Adventitious Error

W &B derived a crucial link between GOF and uncertainty due to adventitious error (if it occurs) within the context of CSM. The RMSEA is related to the dispersion of the IW distribution with scale matrix Ω \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varOmega }$$\end{document} :

(1) RMSEA 2 = 1 / m . \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \text {RMSEA}^{2}=1/m. \end{aligned}$$\end{document}

The RMSEA has been developed to quantify imperfect model fit. An RMSEA of 0.00 indicates perfect fit, values of 0.05 or lower indicate good fit, and values higher than 0.05 up to 0.08 indicate satisfactory fit. Values beyond 0.08 suggest poor GOF (see Browne & Cudeck, Reference Browne, Cudeck, Bollen and Long1993; Hu & Bentler, 1999). These benchmark values are rather arbitrary and may differ depending on the authors. For example, Hu & Bentler used 0.06 instead of 0.05 as a cutoff value. Although a CSM may be satisfactory in terms of GOF in the presence of adventitious error, there are implications for the uncertainty of inferences regarding CSM parameters (e.g., structural parameters, correlations among factors). It follows from W &B (p. 590) that in the presence of adventitious error, the standard errors (SEs) of parameter estimates increase by a factor

(2) q = ( 1 / m + 1 / n ) / ( 1 / n ) \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} q=\sqrt{(1/m+1 /n)/(1/n)} \end{aligned}$$\end{document}

so that the new SE is the traditional SE × q \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {SE}\times q$$\end{document} . The first term within parentheses under the square root in Eq. (2), 1/m, is the extra source of variation (and thus uncertainty) in parameter estimates. Hereafter, SEs based on Eq. (2) are called augmented SEs. Recall that for CSMs, the dispersion of the IW distribution corresponds with RMSE A 2 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\textrm{A}^{\textrm{2}}$$\end{document} ; see Eq. (1).

Whether or not a measurement model is estimated, it is idealistic to assume that the MVs used in a study have an observed covariance structure that is directly sampled from the theoretical population model Ω \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varOmega }$$\end{document} . A more realistic assumption is that in each study, an operational population model applies, so that an operational population covariance matrix Σ s \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varSigma }_{s}$$\end{document} underlies the covariances among the observed MVs.

To the extent that an RMSEA 0.08 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\textrm{RMSEA}\le 0.08$$\end{document} is considered acceptable, one can use the 0.08 value to determine a benchmark value of the adventitious error rate when no RMSEA value is available or when one moves out of the CSM framework, such as for a simple regression analysis. Although somewhat arbitrary, in what follows, the benchmark value of 0.08 is used to illustrate the consequences of adventitious error. Based on Eq. (1), the dispersion of the IW distribution that would lead to an RMSEA of 0.08 is 1 / m = 0.0064 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1 / m=0.0064$$\end{document} . The corresponding degrees of freedom are approximately 1 / 0.0064 = 156 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1/0.0064=156$$\end{document} , which is to say that the uncertainty associated with a dispersion value of 0.0064 is comparable to the sampling error resulting from a sample size of approximately 157.

To illustrate the size of distortions that can result from this benchmark dispersion value, we conducted a small simulation study for a 4 × 4 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$4\times 4$$\end{document} theoretical covariance matrix ( Ω ) \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varOmega })$$\end{document} with 1.00 on the diagonal and 0.49 in the off-diagonal cells (as for the MVs in Sect. 2.1) and with an IW dispersion of 1 / m = 0.0064 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1/m = 0.0064$$\end{document} . Based on 1000 replications, the average absolute deviation from the theoretical covariance matrix across all cells (including the diagonal cells) was 0.079. For nearly all illustrations in Parts 2 and 3, we use a dispersion of 0.0064. Occasionally, we also use an IW dispersion of 0.0025, corresponding to an RMSEA of 0.05 for CSMs. We do not intend to imply that in the absence of an RMSEA, a benchmark adventitious error rate should be adopted for inferences. Instead, we are interested in benchmark adventitious error rates only for illustrative purposes.

2. Adventitious Error and Relations Between Variables

2.1. Standard Errors for CSM and Linear Regression Parameter Estimates

In this section, we use small simulation studies to investigate whether the inflation factor q from Eq. (2) for the SEs of parameter estimates in the presence of adventitious error applies to CSMs (as already verified by W &B) and to other types of analysis such as simple linear regression. W &B derived the factor q for CSMs based on statistical theory to quantify the augmented uncertainty due to adventitious error (i.e., SE × q ) \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {SE}\times q)$$\end{document} . Therefore, we call factor q from Eq. (2) the theoretical augmented uncertainty factor (TAUF). In addition, we generate data subject to adventitious error with the benchmark IW dispersion ( 1 / m = 0.0064 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1/m = 0.0064$$\end{document} ). Doing so allows us to obtain a simulation-based augmented uncertainty factor (SAUF), not derived from Eq. (2) but estimated from simulated data dividing the simulation-based SE with adventitious error by the simulation-based SE without adventitious error. SAUF, q ^ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{q}$$\end{document} , is a simulation-based estimate of the TAUF, q. The purpose of the simulations is to answer Question 1 of Part 2: “Are the inferential consequences of adventitious error in a CSM context generalizable beyond the CSM context?”

We compare SAUF and TAUF for a CSM and for two simple linear regression models. For the CSM, we expect SAUF to correspond closely to TAUF, in line with the findings by W &B. For the simple linear regressions, the comparison of SAUF and TAUF gives us an indication of how well the TAUF for the benchmark adventitious error rate applies beyond the CSM context. These small simulations serve as a proof of concept, providing illustrations that might help to familiarize the reader with the augmented uncertainty due to adventitious error.

More specifically, we consider three cases: 1, 2a, and 2b, as depicted in Fig. 2. Case 1 concerns two MVs, an independent variable (IV), X, and a dependent variable (DV), Y, using simple linear regression to estimate the effect of the IV. Case 2a concerns a CSM with a manifest IV, X,  and a latent DV, θ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\theta $$\end{document} , with four indicator variables, Y 1 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Y_{1}$$\end{document} to Y 4 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Y_{4}$$\end{document} . Case 2b is inspired by case 2a in that the summed score of Y 1 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Y_{1}$$\end{document} to Y 4 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Y_{4}$$\end{document} is used instead of θ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\theta $$\end{document} as a DV, again using simple linear regression to estimate the effect of the IV on the DV.

For each of the three cases, a covariance matrix Ω \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varOmega }$$\end{document} is specified based on the following assumptions (which should not and—based on some additional simulations—do not appear to affect the main lines of the results):

  • A variance of 1.00 is assigned to all variables in Ω \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varOmega }$$\end{document} , making Ω \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varOmega }$$\end{document} a correlation matrix.

  • The covariance of X with Y (case 1) is 0.25 and the covariances of X with Y 1 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Y_{1}$$\end{document} to Y 4 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Y_{4}$$\end{document} (cases 2a and 2b) are 0.35.

  • The loadings for all four indicator variables (cases 2a and 2b) are 0.70, yielding covariances of 0.49.

The above choices lead to a 2 × 2 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$2 \times 2$$\end{document} covariance matrix for case 1 and a 5 × 5 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$5\times 5$$\end{document} covariance matrix for cases 2a and 2b. The simulations consist of four steps, with two levels of sample size ( N = 200 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N=200$$\end{document} and 1000):

Step 1. Randomly distort Ω \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varOmega }$$\end{document} 2000 times based on an IW dispersion parameter of 1 / m = 0.0064 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1/m =0.0064$$\end{document} , using the riwish() function from the MCMCpack version 1.4-5 package in R (Martin et al., 2011): riwish(m, omega m ) \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$*\,m)$$\end{document} , where m is the inverse of the dispersion parameter 1/m and omega is Ω \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varOmega }$$\end{document} . This yields 2000 Σ s \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varSigma }_{s}$$\end{document} matrices for case 1 and 2000 Σ s \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varSigma }_{s}$$\end{document} matrices for case 2a and for case 2b. We report that using 1 / ( m - p + 1 ) \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1 /{(m-p+1)}$$\end{document} instead of 1/m as the dispersion parameter derived from the RMSEA (as suggested by W &B) leads to nearly identical simulation results. Step 1 refers to the first arrow of adding adventitious error to Ω \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\varvec{\Omega }}$$\end{document} in Fig. 1 to obtain Σ s \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varSigma }_{s}$$\end{document} for s = 1 , , 2000 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s =1,\ldots ,2000$$\end{document} .

Step 2. Generate data for two variables for case 1 and five variables for cases 2a and 2b, for each sample size (200 or 1000). Here, we use the mvrnorm( ) function (with empirical = FALSE) from the MASS package in R (Venables & Ripley, Reference Venables and Ripley2002), using the 2000 Σ s \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varSigma }_{s}$$\end{document} matrices and with means for the two variables generated based on a normal distribution with zero mean vector and SD = 0.5 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {SD} = 0.5$$\end{document} . The means and SDs are arbitrary values because the mean does not affect the covariances (under multivariate normality) and should therefore not affect the outcome. Step 2 refers to data generation based on Σ s \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varSigma }_{s}$$\end{document} as the operational covariance matrix. In Fig. 1, S s \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\varvec{S}}_{s}$$\end{document} for s = 1 , , 2000 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s = 1,\ldots , 2000$$\end{document} is the covariance matrix of the generated data and is subject to sampling error.

Step 3. Given the generated variables for case 1 and each sample size, conduct a simple linear regression analysis for each of the 2000 generated data sets. Given the generated variables for cases 2a and each sample size, fit a structural equation model to each data set using the cfa( ) function from the lavaan version 0.6-5 package in R (Rosseel, Reference Rosseel2012) using ML estimation. For case 2b, conduct a simple linear regression with the sum of Y 1 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Y_{1}$$\end{document} to Y 4 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Y_{4}$$\end{document} as the DV for each of the 2000 data sets. For all three analyses (cases 1, 2a, and 2b), the SD of the 2000 estimated effects is a simulation-based estimate of the augmented SE (i.e., SE × q ) \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {SE}\times q)$$\end{document} that serves as the numerator in the expression for SAUF. In sum, Step 3 provides us with a simulation-based estimate of the augmented SE with and without the use of a CSM (case 2a versus cases 1 and 2b, respectively).

Step 4. Repeat Step 2 for cases 1, 2a, and 2b, based on the corresponding Ω \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varOmega }$$\end{document} matrix instead of the Σ s \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varSigma }_{s}$$\end{document} matrices; in other words, generate sample covariance matrices without distorting Ω \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varOmega }$$\end{document} (i.e., omitting adventitious error). Given these sample covariance matrices, conduct the same analyses as in Step 3. For each of the three cases, determine the SD of the effect across the 2000 analyses. These SDs are simulation-based estimates of the traditional SE, serving as the denominator of the SAUF (i.e., of q ^ ) \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{q})$$\end{document} . We also used two alternative methods to generate the traditional SE estimates: (a) the median SE across the 2000 analyses from Step 3 and (b) the median SE across the corresponding 2000 analyses from Step 4. Because these two alternative approaches yield very similar SE estimates that are also very similar to the SD-based estimates from Step 4, we only report the latter.

Figure 2 Cases 1, 2a and 2b of relations between variables.

Based on Steps 3 and 4, we have the necessary information to calculate the SAUF in each case, to be compared with the TAUF for a dispersion parameter of 0.0064. The results appear in Table 1. The simulation-based factors (SAUFs) corroborate the theoretical factors (TAUFs) based on Eq. (2), which are 1.51 and 2.72 for N = 200 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N= 200$$\end{document} and N = 1000 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N = 1000$$\end{document} , respectively. The larger ratios for N = 1000 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N = 1000$$\end{document} may seem counterintuitive, but the traditional SE shrinks with sample size, whereas the 1/m term in the q factor in Eq. (2) does not shrink with sample size.

The results are slightly different depending on the case (1, 2a, or 2b) and the sample size ( N = 200 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N = 200$$\end{document} or 1000). The SAUF is slightly larger for case 2a compared to the other two cases, at least for the larger sample size, and it is also slightly larger than the TAUF. More extensive simulations would be required to find out whether this observation replicates across other conditions.

The finding that the SAUF is a good approximation of the TAUF means that one can use TAUF to indicate how much the SE needs to be augmented, conditional on the sample size and the chosen adventitious error rate, not only for CSM parameter estimates but also for linear regression. We would expect this result to hold for adventitious error rates other than the benchmark rate used in the simulations.

Table 1 Theoretical and simulation-based augmented uncertainty factors for SEs.

TAUF = theoretical augmented uncertainty factor, which equals q in Eq. (2). SAUF = simulation-based augmented uncertainty factor, q ^ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{q}$$\end{document} . For both TAUF and SAUF, dispersion 1 / m = 0.0064 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1 /m= 0.0064$$\end{document} .

2.2. Adventitious Error and Heterogeneity in Meta-Analysis

When an effect is estimated across several studies in a meta-analysis, either a fixed-effect model or a random-effect model is used. The former model specifies a single effect size for all studies. The latter model allows for effects to vary across studies. In the random-effect model, the standard error of the estimated summary effect (the mean or weighted mean) across studies consists of two components: one that is attributed to sampling variation and another that stems from true variation of the effect across studies (i.e., heterogeneity of the effects). The purpose of this section is to answer Question 2 of Part 2: “How might one interpret the heterogeneity of effects such as correlations in meta-analysis from the perspective of adventitious error?”

To discuss a possible link between adventitious error and the heterogeneity of effects, we use the Cohen’s d scale. Let us denote the first component of the standard error variance as SE samp 2 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {SE}_\textrm{samp}^{2}$$\end{document} , with a subscript “samp” to indicate that its source is sampling variation. SE samp 2 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {SE}_\textrm{samp}^{2}$$\end{document} tends to decrease with increasing total number of observations, which is N × S \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N\times S$$\end{document} for S studies with equal sample size N; SE samp 2 = var / NS \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {SE}_\textrm{samp}^{2}={var} / {NS}$$\end{document} , where var is the within-study variance about the study-specific effect. The second component is SE het 2 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {SE}_\textrm{het}^{2}$$\end{document} , with subscript “het” to indicate that it is based on between-study heterogeneity. SE het 2 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {SE}_\textrm{het}^{2}$$\end{document} tends to decrease as the number of studies, S, increases: SE het 2 = τ 2 / S \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {SE}_\textrm{het}^{2}={\tau ^{2}} / S$$\end{document} , where τ 2 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\tau ^{2}$$\end{document} is the variance of the study-specific effects. Using Cohen’s d scale for meta-analysis, let τ 2 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\tau ^{2}$$\end{document} be in the standard scale. The SE of a meta-analyzed estimate of d, SE ( d met ) \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {SE}(d_\textrm{met})$$\end{document} , is the square root of the sum of the two components: SE d met = SE samp 2 + S E het 2 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {SE}\left( d_\textrm{met} \right) =\sqrt{\text {SE}_\textrm{samp}^{2}+SE_\textrm{het}^{2}} $$\end{document} . Unless the number of studies is small, a random-effect approach provides a useful starting point (e.g., Dettori et al., 2022; Tufanaru et al., Reference Tufanaru, Munn, Stephenson and Aromataris2015).

The specific question is to what extent does adventitious error, if it occurs, contribute to the heterogeneity of effects, in terms of standardized τ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\tau $$\end{document} (i.e., the SD of the effects absent of sampling variation)? We focus our query on correlationsFootnote 1 as effects, using Cohen’s d as an effect-size measure, in line with common practice. The equation that expresses a correlation in the scale of Cohen’s d is:

(3) d = 2 r / ( 1 - r 2 ) , \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} d={2r} / {(1-r^{2})}, \end{aligned}$$\end{document}

with r to denote the correlation. In the same metric, the corresponding SE for a correlation observed in one study is:

(4) SE d = 2 / N - 1 ( 1 - r 2 ) , \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \text {SE}\left( d \right) =2 / \sqrt{\left( N-1\right) (1-r^{2})}, \end{aligned}$$\end{document}

based on Borenstein et al. (Reference Borenstein, Hedges, Higgins and Rothstein2009). The question is how much larger would this SE \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ SE$$\end{document} be in the presence of adventitious error at the benchmark rate ( 1 / m = 0.0064 ) \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1 / m=0.0064)$$\end{document} . In what follows, we compare the augmented SE with the SE from Eq. (4) to estimate standardized τ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\tau $$\end{document} as a measure of heterogeneity to quantify the effect of adventitious error in the context of meta-analysis.

For a correlation of .49, for example between the MVs in Sect. 2.1, d = 1.290 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$d=1.290$$\end{document} , which is a very large effect. Following Eq. (4), the corresponding SE associated with the correlation for N = 200 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N = 200$$\end{document} is 0.163 on the Cohen’s d scale. Note that this value is for just one study. The TAUF for an adventitious error with the benchmark rate and N = 200 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N=200$$\end{document} is 1.508, so that the SE for one study is augmented to SE × q = 0.163 × 1.508 = 0.246 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {SE}\times q= 0.163 \times 1.508 = 0.246$$\end{document} . In terms of variance, the corresponding values are 0 . 163 2 = 0.027 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.163^{\textrm{2}} =0.027$$\end{document} (non-augmented) and 0.246 2 = 0.061 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$^{\textrm{2}} = 0.061$$\end{document} (augmented). For one study, S = 1 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S=1$$\end{document} , standardized τ 2 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\tau ^{2}$$\end{document} would be 0.061 - 0.027 = 0.034 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$- 0.027 = 0.034$$\end{document} , and standardized τ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\tau $$\end{document} would be 0.184. According to overviews in the literature (De Boeck and Jeon, Reference De Boeck and Jeon2018; Stanley et al., Reference Stanley, Carter and Doucouliagos2018; van Erp et al., Reference van Erp, Verhagen, Grassman and Wagenmakers2017), this estimate of τ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\tau $$\end{document} reflects a moderate amount of heterogeneity. In comparison, the corresponding standardized τ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\tau $$\end{document} for a correlation of .25 is 0.165, which is not much different from the 0.184 for a correlation of .49. Based on overviews of meta-analysis heterogeneity values, the above values are well within the expected range.

It seems that at least for rather typical correlations and sample sizes, one can expect a non-ignorable level of between-study heterogeneity in the presence of adventitious error. However, we do not claim that heterogeneity of effects is necessarily caused by adventitious error. For example, fixed effects of moderator variables can also lead to heterogeneity. We only wanted to illustrate how a benchmark level of adventitious error is in line with moderate levels of effect-size heterogeneity.

On the condition that a multilevel model for covariance matrices with a dispersion parameter for the IW distribution could be estimated, increasing the number of studies may lead to (a) a more precise estimate of the variation (but not to its reduction) and (b) smaller SEs of the parameter estimates of the theoretical population covariance matrix. Unfortunately, as far as we know, the condition of an estimable model with adventitious error is not met. In contrast, the two effects of increasing the number of studies described in (a) and (b) are available in practice for meta-analysis: more precise estimates of heterogeneity and the overall effect.

2.3. Adventitious Error, Power, and Sample Size

In this section, we outline the implications of adventitious error for power and sample size, using simple regression (like case 1 of Sect. 2.2) as an example. The purpose of this section is to answer Question 3 of Part 2: “What are the consequences of adventitious error for statistical power when testing the significance of correlations and regression slopes in simple regression?”

In particular, we will discuss the case of correlations between two MVs.Footnote 2 If the variables have a variance of one or have equal variance (as in Sect. 2.1), the regression slope b equals the correlation. For calculations, we use the Fisher z transformation, z(r). When adventitious error is ignored, the traditional SE = 1 / ( N - 3 ) \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {SE}=\sqrt{1 / {(N-3)}}$$\end{document} . When it is included, Eq. (2) implies that the augmented SE = N - 1 N - 3 1 m + 1 N - 3 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {SE}=\sqrt{\left( \frac{N-1}{N-3} \right) \frac{1}{m}+\frac{1}{N-3}} $$\end{document} , where 1/m is the dispersion of adventitious error. As N becomes very large, the traditional SE approaches zero, whereas the augmented SE approaches 1 / m \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sqrt{1 / m}$$\end{document} .

Adventitious error does not merely increase the SE; it does so in a way that is impervious to increases in sample size. Note that for our benchmark dispersion values of 0.0064 and 0.0025, 1 / m \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sqrt{1 / m}$$\end{document} equals 0.08 and 0.05, respectively. The asymptotes for the SE (when sample size is infinitely increased) imply wide confidence intervals. For an observed correlation of .25, for example, the 95% confidence interval cannot be narrower than [.098, .390] when dispersion = 0.0064 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$= 0.0064$$\end{document} or narrower than [.156, .339] when dispersion = 0.0025 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$= 0.0025$$\end{document} .

It is easy to show the consequences for statistical power based on the augmented SE and the SE asymptotes as just discussed. Consider power when α = . 05 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha =.05$$\end{document} and the alternative hypothesis is that r has some positive value, such as .25. For simplicity, we ignore the chance of obtaining a significant negative result; in effect, we assess one-tailed power with α = . 025 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha =.025$$\end{document} . For r = . 01 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r =.01$$\end{document} to .50, we computed the sample size required for power = . 80 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=.80$$\end{document} (“a convention used for general use”; Cohen, Reference Cohen1992, p. 156) in the usual way, ignoring adventitious error. For example, the required N is 124 when r = . 25 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r =.25$$\end{document} and 47 when r = . 40 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r =.40$$\end{document} . For each of these (r, N) pairs, we then computed power using the augmented SE corresponding to the inclusion of adventitious error (we used unrounded values for N). For dispersion = 0.0064 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$= 0.0064$$\end{document} , power = . 55 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=.55$$\end{document} when r = . 25 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r =.25$$\end{document} and N = 124 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N = 124$$\end{document} , and power = . 69 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=.69$$\end{document} when r = . 40 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r =.40$$\end{document} and N = 47 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N = 47$$\end{document} . The full curves for dispersion = 0.0025 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$= 0.0025$$\end{document} and 0.0064 are shown in the left panel of Fig. 3. In the figure, power is always .80 when adventitious error is ignored (when dispersion = 0 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$= 0$$\end{document} ) because the Ns were selected to yield that level of power for every correlation. Using those same Ns, power is substantially lower when adventitious error is considered, especially when correlations are small.

Figure 3 The effects of adventitious error on power and required sample sizes for detecting significant correlations. Left panel: For each correlation, the sample size required for power = . 80 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=.80$$\end{document} with no adventitious error was also used for the two nonzero levels of adventitious error, yielding lower power. Right panel: Sample sizes required for power = . 80 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=.80$$\end{document} at three levels of adventitious error. For the two nonzero levels of adventitious error, the vertical asymptotes in gray indicate correlations below which it is impossible to achieve power = . 80 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=.80$$\end{document} .

How much larger must N be, relative to the usual value, to attain power = . 80 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=.80$$\end{document} for a given value of r? Let N 1 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N_{\textrm{1}}$$\end{document} and N 2 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N_{\textrm{2}}$$\end{document} be the required sample sizes for a planned study when adventitious error is ignored and considered, respectively. The goal is to choose N 2 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N_{\textrm{2}}$$\end{document} such that the augmented SE (considering adventitious error) equals the traditional SE (ignoring adventitious error). From the expressions above, we have 1 N 1 - 3 = N 2 - 1 N 2 - 3 1 m + 1 N 2 - 3 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sqrt{\frac{1}{N_{1}-3}} =\sqrt{\left( \frac{N_{2}-1}{N_{2}-3} \right) \frac{1}{m}+\frac{1}{N_{2}-3}} $$\end{document} . Solving for the larger sample size yields

(5) N 2 = N 1 - N 1 - 3 1 / m 1 - N 1 - 3 1 / m . \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} N_{2}=\frac{N_{1}-\left( N_{1}-3 \right) \left( 1 / m \right) }{1-\left( N_{1}-3 \right) \left( 1 / m \right) }. \end{aligned}$$\end{document}

One can calculate the required N 1 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N_{\textrm{1}}$$\end{document} in the usual manner and then scale it up to get the required N 2 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N_{\textrm{2}}$$\end{document} . For dispersion 1 / m = 0.0064 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1 / m = 0.0064$$\end{document} , the required N increases from 124 to 533 for r = . 25 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r =.25$$\end{document} and from 47 to 65 for r = . 40 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r =.40$$\end{document} ; the increase is much more dramatic for the smaller correlation. The curves for dispersion = 0.0025 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$= 0.0025$$\end{document} and 0.0064 are shown in the right panel of Fig. 3, along with the curve for the required sample size when adventitious error is ignored (dispersion = 0).

In the figure, the three curves for the required sample size have different vertical asymptotes. In the absence of adventitious error, the asymptote is r = 0 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r = 0$$\end{document} . One can obtain power = . 80 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$=.80$$\end{document} for arbitrarily small hypothesized nonzero correlations if one collects a large enough sample (e.g., an N of nearly 20,000, which is off the chart, for r = . 02 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r =.02$$\end{document} ). However, in the presence of adventitious error, the asymptote can be noticeably larger than r = 0 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r = 0$$\end{document} . Recall that in this condition, the augmented SE approaches 1 / m \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sqrt{1/m}$$\end{document} as the sample size increases without bound. To achieve power of .80, (the Fisher z of) the alternative hypothesis value of r must be about 0.84 × SE \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.84\times \text {SE}$$\end{document} greater than the critical value, which is 1.96 × SE \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1.96\times \text {SE}$$\end{document} . In other words, the alternative hypothesis value of z(r) must be at least 0.84 + 1.96 × S E \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\left( 0.84+1.96 \right) \times SE$$\end{document} , where the SE in question is now the augmented version ( SE × q ) \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {SE}\times q)$$\end{document} . For dispersion = 0.0064 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$= 0.0064$$\end{document} , the threshold value for z(r) is 2.80 × 0.0064 = 0.224 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$2.80\times \sqrt{0.0064} =0.224$$\end{document} , or r = . 220 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r =.220$$\end{document} . For correlations at or below this threshold, it is impossible to achieve 80% power in a single study, regardless of its sample size. For dispersion = 0.0025 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$= 0.0025$$\end{document} , the threshold for r is .139. (Below these thresholds, the above equation for N 2 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N_{\textrm{2}}$$\end{document} does not hold because the traditional and augmented SEs cannot be equal for positive values of N 2 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N_{\textrm{2}}$$\end{document} when 1 ( N 1 - 3 ) < 1 m \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\frac{1}{{(N}_{1}-3)}<\frac{1}{m}$$\end{document} .)

Not only is power reduced and bounded in the presence of adventitious error because of the augmented SE, there also are consequences for whether observed correlations can be significant. The null hypothesis cannot be rejected in a single study of any size if (the Fisher z of) the observed correlation is below 1.96 times the smallest possible augmented SE. For example, correlations below .157 (for dispersion = 0.0064 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$= 0.0064$$\end{document} ) or below .098 (for dispersion = 0.0025 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$= 0.0025$$\end{document} ) cannot lead to a rejection of the null hypothesis. If adventitious error behaves like heterogeneity, as suggested in the previous section, it would require multiple studies to reduce the uncertainty about the summary effect. Compared with the traditional assertion that all null hypotheses are false, adventitious error implies a more nuanced view. Due to adventitious error, theoretical correlations of zero may be distorted to nonzero correlations. Choosing a dispersion value for adventitious error could be a way to set boundary values to prevent false rejections of the null hypothesis.

The implications of adventitious error for power, sample size, and the values below which parameter estimates cannot lead to a rejection of the null hypothesis parallel those of effect-size heterogeneity in meta-analysis (Kenny and Judd, Reference Kenny and Judd2019; McShane and Böckenholt, Reference McShane and Böckenholt2014; Pawel and Held, Reference Pawel and Held2020; Pek et al., Reference Pek, Pitt and Wegener2024), which is not surprising given the discussion in Sect. 2.2. Here, we highlight one implication that has thus far received insufficient attention: Although increasing the sample size beyond that recommended by traditional power analysis is essential for maintaining power when the hypothesized effect is modest (e.g., r = . 30 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r =.30$$\end{document} ), the same strategy may be futile when the hypothesized effect is small (e.g., r = . 10 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r =.10$$\end{document} ) because no sample size will suffice. For small observed values, it may be impossible to reject the null hypothesis in a single study if heterogeneity or adventitious error is considered.

3. Adventitious Error and Measurement Uncertainty

3.1. Measurement Error and Adventitious Error

Measurement error is the deviation of a measurement outcome from the expected value for hypothetical independent repeated measurement outcomes for the same individual under the same standard condition. The expected value is taken over a propensity distribution of independent repeated outcomes using measurement tool g for an individual in the standard condition. The expected value is also called the true value, denoted here as t gi \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$t_{gi}$$\end{document} , with i as an index for individuals. The deviation of a measurement outcome X gi \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_{gi}$$\end{document} from the expected value t gi \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$t_{gi}$$\end{document} is the measurement error for an individual, ε gi \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varepsilon _{gi}$$\end{document} , so that X gi = t gi + ε gi \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_{gi}=t_{gi}+\varepsilon _{gi}$$\end{document} . The variance of ε gi \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varepsilon _{gi}$$\end{document} per individual (across the propensity distribution for individual i), σ ε i 2 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sigma _{\varepsilon i}^{2}$$\end{document} , cannot be directly estimated, but the average error variance across individuals (the average across individuals of the within-individual σ ε i 2 ) \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sigma _{\varepsilon i}^{2})$$\end{document} can be estimated through the variance of measurement error as a random variable across individuals (Lord & Novick, Reference Lord and Novick1968, p. 35), denoted here as σ e 2 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sigma _{e}^{2}$$\end{document} . An estimate of σ e 2 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sigma _{e}^{2}$$\end{document} is often taken as an estimate of σ ε i 2 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sigma _{\varepsilon i}^{2}$$\end{document} , and σ e 2 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sigma _{e}^{2}$$\end{document} is used as the standard error of measurement (S.E.M.), an inverse index of measurement precision. Thus, a between-individual parameter ( σ e 2 ) \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sigma _{e}^{2})$$\end{document} is used as a statistical proxy for a within-individual parameter ( σ ε i 2 ) \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sigma _{\varepsilon i}^{2})$$\end{document} .

Multiple items (manifest variables, MVs) are commonly used for the measurement of constructs. For a summed score based on a set of items, the S.E.M. can be estimated, for example, with coefficient alpha or McDonald’s omega or some other reliability coefficient depending on assumptions about the items (e.g., tau equivalence for the alpha coefficient). Alternatively, factor scores can be used as surrogates for the construct, in which case the variance of regression factor scores is taken as a reliability coefficient of the scores, which can in turn be used to estimate the S.E.M.

Lord and Novick (Reference Lord and Novick1968) interpreted measurement error as “a disturbance that is due to a composite of a multitude of factors not controlled in the measurement procedure” (p. 38). That measurement error is formulated as random can be understood as a way to deal with unspecified complexities about factors with an effect on the observed measurement outcome. The question we want to address in this third part is how to consider adventitious error in relation to measurement error. Like measurement error, σ ε i 2 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sigma ^2_{\varepsilon _i}$$\end{document} , can we obtain an estimate of the within-individual variance (and standard deviation, SD) due to adventitious error? If so, the SD due to adventitious error would give us an indication of the impact of adventitious error on the uncertainty of measurement outcomes for an individual using a given measurement tool, in a similar way as the S.E.M. does for measurement error.

However, unlike measurement error, adventitious error is associated with the variances and covariances of MVs in ways that reflect the implementation of a specific study and is thus irrelevant to the theoretical structure of the MVs. For example, when a set of MVs is used to measure a construct, adventitious error generates deviations from the theoretical relations, and those deviations are conceptually irrelevant to the construct. An alternative view is that the nature of the construct itself is not fixed, as discussed by De Boeck et al. (Reference De Boeck, Pek, Walton, Wegener, Turner, Andersen, Beauchaine, Lecavalier, Myung and Petty2023), but here we stay with the common view that the nature of a construct is fixed and reflected in fixed relations of the factors (latent variables) with the MVs, except that this common view ignores the distinction between the theoretical population and the operational population. It is self-evident that measurement error has implications for the variation and uncertainty of measurement outcomes. The consequences for measurement outcomes due to adventitious error are less obvious, which is why a simulation study may help.

3.2. Purpose of the Simulation Study

The purpose of the simulation study is to assess the variation in measurement outcomes (e.g., summed scores and factor scores) that is specifically induced by imposing adventitious error on data from a one-dimensional CSM. Our simulation allows us to isolate adventitious error from variation based on measurement error (i.e., S.E.M.) and from the sampling variation that normally arises in the generation of multivariate data. This separation can be realized by reformulating adventitious error as a linear transformation of a set of variables and by taking advantage of the internal algorithmic process when using the mvrnorm( ) function (see Song, 2021, for an explication of the algorithm). In this process, the linear transformation that adds adventitious error is at the last stage, as explained in Sect. 3.3. For a more accurate estimate of the impact of adventitious error on the variation of measurement outcomes for the same individuals within a single sample, the simulation is repeated for different simulated samples of individuals. As mentioned, we focus on composite scores (summed scores and factor scores) because most measurement outcomes are composite scores. Given that composite scores change when the set of variables involved in the composites are linearly transformed due to a change of the covariance matrix of the variables, we expect that the composite scores will be affected by adventitious error.

3.3. Design and Algorithm

The design of the simulation study is that 1000 data sets are generated for each of 10 different independent simulation samples, each with 1000 individuals and four MVs. The 1000 data sets within a simulation sample differ only with respect to adventitious error, and not in any other respect such as measurement error of sampling variation. Between simulation samples, the variability is a reflection of sampling variability. The total number of data sets to be generated is 10 × \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\times $$\end{document} 1000.

As mentioned, the measurement outcomes to be considered for these data sets (1000 data sets for each of the 10 samples) are simple summed scores and one-factor model factor scores per individual based on the four MVs. Per individual within a sample, there are 1000 summed scores and 1000 factor scores. These composite measurement outcomes per individual within a simulation sample vary depending on the data set, and this variation is due solely to adventitious error.

For the simulation algorithm, the following steps are repeated for each of the 10 simulation samples, with k = 1 , . . , K = 10 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k=1, .., K=10$$\end{document} , as an index for the samples.

Step 1. Generate 1000 independent operational covariance matrices Σ s k \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varSigma }_{s_{k}}$$\end{document} with size p × p \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p\times p$$\end{document} , where p = 4 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p=4$$\end{document} as the number of MVs, and s k = 1 , . . , S k \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k}=1, .., S_{k}$$\end{document} , as an index for the data sets to be generated for sample k, based on the following inverse Wishart distribution:

(6) Σ s k | Ω , m I W ( m Ω , m ) , \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \varvec{\varSigma }_{s_{k}}\vert \varvec{\varOmega }, m \sim IW(m\varvec{\varOmega },m), \end{aligned}$$\end{document}

where Ω \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varOmega }$$\end{document} is the theoretical covariance matrix, and m = 1 / R M S E A 2 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$m=1/RMSEA^{2}$$\end{document} .

For our simulation study, m = 1 / 0.0 8 2 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$m=\textrm{1} / {\mathrm {0.0}\textrm{8}^{\textrm{2}}}$$\end{document} (0.08 is the reference RMSEA value associated with a benchmark level of adventitious error); and

Ω = 1 0.49 0.49 0.49 0.49 1 0.49 0.49 0.49 0.49 1 0.49 0.49 0.49 0.49 1 , \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \varvec{\varOmega } =\left[ {\begin{array}{cccc} 1 &{} 0.49 &{} 0.49 &{} 0.49\\ 0.49 &{} 1 &{} 0.49 &{} 0.49\\ 0.49 &{} 0.49 &{} 1 &{} 0.49\\ 0.49 &{} 0.49 &{} 0.49 &{} 1\\ \end{array} } \right] , \end{aligned}$$\end{document}

corresponding to a one-factor model with loadings of 0.70 and residual variances of 0.51 for all four MVs. There are 10 x 1000 Σ s k \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varSigma }_{s_{k}}$$\end{document} matrices. The variation between these covariance matrices reflects adventitious error. Note that the Σ s k \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varSigma }_{s_{k}}$$\end{document} matrices do not depend on k; they are just being used for sample k.

Step 2. Generate a random seed h k \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$h_{k}$$\end{document} for sample k, to be used in step 3.

Step 3. Generate 1000 N × p \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N\times p$$\end{document} data sets X s k \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\varvec{X}}_{s_{k}}$$\end{document} for sample k, where N = 1000 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N=1000$$\end{document} , based on the following multivariate normal distribution:

(7) X s k | h k , Σ s k , μ M V R ( Σ s k , μ ) , \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} {\varvec{X}}_{s_{k}}\vert h_{k},\varvec{\varSigma }_{s_{k}},\varvec{\mu }\sim MVR(\varvec{\varSigma }_{s_{k}}{,{\varvec{\mu }} }), \end{aligned}$$\end{document}

where μ = 0 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\mu } ={\textbf {0}}$$\end{document} , a vector with length p.

For each value of s k \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_{k}$$\end{document} , the generation of X s k \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\varvec{X}}_{s_{k}}$$\end{document} consists of two substeps:

Step 3a. Generate an orthonormal preliminary data set Z k \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\varvec{Z}}_{k}$$\end{document} with size N × p \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N\times p$$\end{document} using random seed h k \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$h_{k}$$\end{document} and based on the following multivariate normal distribution:

(8) Z k | h k , I , μ M V R ( μ , I ) , \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} {\varvec{Z}}_{k}\vert h_{k}{, {\varvec{I}}, {\varvec{\mu }} }\sim MVR(\varvec{\mu },{\varvec{I}}), \end{aligned}$$\end{document}

where I \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\varvec{I}}$$\end{document} is a p × p \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p\times p$$\end{document} identity matrix. The random seed h k \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$h_{k}$$\end{document} ensures that the individuals from the 1000 data sets within each kth sample are the same.

Step 3b. Linearly transform Z k \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\varvec{Z}}_{k}$$\end{document} into X s k \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\varvec{X}}_{s_{k}}$$\end{document} using Σ s k \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varSigma }_{s_{k}}$$\end{document} (based on an eigen-decomposition of Σ s k ) \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varSigma }_{s_{k}})$$\end{document} , resulting in 1000 data matrices X s k \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\varvec{X}}_{s_{k}}$$\end{document} for sample k. Thus, the variation of X s k \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\varvec{X}}_{s_{k}}$$\end{document} between the 1000 data sets within each kth sample reflects adventitious error.

Through executing the three steps for each of the 10 samples, we obtain 10 × \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\times $$\end{document} 1000 data sets. The 1000 data sets within each sample are dependent as linear transformations of the same preliminary data set. They also are linear transformations of each other, induced by the variability of adventitious error.

3.4. Measurement Outcomes

For each sample’s 1000 data matrices X s k \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\varvec{X}}_{s_{k}}$$\end{document} , a one-factor model is fit to the data using lavaan with ML estimation, resulting in estimated regression factor scores (Thurstone, Reference Thurstone1935). In addition, the summed scores (sums of the four MVs) are determined per individual. This yields 1000 factor scores and 1000 summed scores per individual in each of the 10 samples, one factor score and one summed score for each of the 1000 data sets in sample k. These factor scores and summed scores are standardized within each data set, so that all measurement outcomes have the same scale and can be compared. The downsides of this standardization are that (a) the variation across the data sets within a simulation sample is reduced because the variation of SDs due to adventitious error is eliminated and (b) standardization has scale effects.

Per individual (which is the same across the S = 1000 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S = 1000$$\end{document} data sets per sample because the random seed is the same for each k), the SD across 1000 data sets is calculated for the factor scores and for the summed scores. These SDs are a quantification of the measurement outcome variation due to adventitious error.

Finally, the SDs per individual are averaged across all individuals from the same kth simulation sample, for factor scores and for sum scores. In this way, each sample provides an independent estimate of the average within-individual SD for factor scores and for summed scores.

3.5. Results and Discussion

The results are reported in Table 2 and can be summarized as follows:

The findings can be generalized across the 10 samples. The within-individual SD for factor scores of the 10 samples is on average 0.084, roughly 8.4% of the between-individual SD of factor scores (which is 1 because of the standardization). The estimated within-individual SD would lead to a 95% confidence range with a width of roughly 0.33 on the standard scale (the difference between 0.084 × 1.96 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\times 1.96$$\end{document} and - \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document} 0.084 × \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\times $$\end{document} 1.96). The within-individual SD for summed scores is approximately 0.035, roughly 3.5% of the between-row SD (which is 1). This would lead to a 95% confidence range with a width of 0.14 on the standard scale (the difference between 0.035 × 1.96 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.035 \times 1.96$$\end{document} and - 0.035 × \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-0.035 \times $$\end{document} 1.96). These CIs represent variation due to adventitious error.

The within-individual variation across data sets within samples (i.e., solely due to adventitious error) is smaller for sum scores than for factor scores, although the variation is measured on the same scale. The difference can be explained by the fact that the MV weights for factor scores (but not for summed scores) vary with the operational covariance matrix. The factor score coefficients (i.e., the weights) to estimate the factor scores depend on Σ sk \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varSigma }_{sk}$$\end{document} , but the weights for summed scores do not. As explained earlier, the differences between the data matrices for the same sample k are not affected by sampling variation because all data sets of the same sample are linear transformations of each other, for factor scores and summed scores alike, so that sampling variation cannot explain the differences in SDs for the two types of composite scores.

For factor scores and summed scores, the within-individual variation due to adventitious error may seem small compared with the variation due to measurement error. For purposes of comparison, we use a reliability coefficient of .90, a value that may be considered high for research purposes, given Cronbach’s (Reference Cronbach1951) appraisal of 0.70 as fairly large and given that Nunnally (Reference Nunnally1978) and the APA Dictionary of Psychology (Vandenbos, 2015) recommend 0.80. For a standardized scale, the corresponding S.E.M. is 0.316, and the width of the 95% confidence range is 1.24 (the difference between 0.316 × 1.96 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.316 \times 1.96$$\end{document} and - 0.316 × 1.96 ) \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-\,0.316 \times 1.96)$$\end{document} , which is more than 100% of the SD of standardized between-individual differences. These high values may be surprising, but what looks good in terms of reliability coefficients is not so good in terms of S.E.M. This explains why Nunnally (Reference Nunnally1978) recommends a higher reliability coefficient for decisions based on individual test scores than for research.

There are three possible reasons for the small variation induced by adventitious error:

A. First, the adventitious error rate corresponds to an acceptable GOF for CSM-based measurement models. Therefore, the small within-individual SDs are not surprising. Our result can be interpreted as corroborating evidence for considering an RMSEA value of 0.08 as acceptable.

B. Second, the factor scores and summed scores in our simulation rely on four MVs with an average inter-MV correlation of .49, so the composite scores should be relatively stable. Recall that in our simulation study in the introductory section, the mean absolute difference between operational correlation matrices due to adventitious error was only 0.079. When the loadings of the four MVs of the one factor are reduced to 0.50 so that the theoretical correlations among the MSs are .25, and a simulation study is conducted with these new values, we find that the average SDs of the standardized factor scores and the summed scores increase by a factor of 2.5.

C. Finally, the within-individual SDs of factor scores and summed scores resulting solely from adventitious error are rather small because they stem from linear transformations of the same sample-specific preliminary data set Z k \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\varvec{Z}}_{k}$$\end{document} .

Table 2 Simulation results for within-individual variation of composite measurement outcomes based only on adventitious error.

Means of within-individual SDs were computed across 1000 individuals per sample. The within-individual SDs were based on 1000 randomly generated covariance matrices designed to represent adventitious error, using the benchmark adventitious error rate (0.0064) and in the absence of measurement error and sampling error.

General Discussion

In Part 2, we have illustrated the consequences of adventitious error for the SE of a relationship between variables. In Part 3, we have focused on composite measurement outcomes that are (weighted or unweighted) sums of four MVs. The consequences of adventitious error seem larger for the uncertainty of relations than for the uncertainty of composite measurement outcomes. A possible reason is that the results of Part 3 concern aggregate level statistics (weighted and unweighted summed scores), whereas the results of Part 2 concern relation statistics (i.e., relations between pairs of variables).

Two features of adventitious error should be emphasized for a more nuanced interpretation of the results. First, adventitious error implies random disturbances of the operational covariance matrix ( Σ s ) \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varSigma }_{s}) $$\end{document} from the theoretical covariance matrix ( Ω ) \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varvec{\varOmega })$$\end{document} . Therefore, adventitious error cannot lead to systematic bias (i.e., an expected nonzero difference), but it could lead to a random discrepancy with true parameter values in a single study or a small set of studies due to the random (non-systematic) distortion it introduces. Second, adventitious error is unrelated to random sampling of MVs that reflect a construct. Random sampling of MVs (i.e., items rather than participants) can have other and possibly more drastic effects on the within-individual SD of measurement outcomes compared with adventitious error because adventitious error can be reduced to linear transformations of the same MVs, whereas sampling MVs implies that different sets of MVs are used.

To summarize, the benchmark adventitious error rate (based on RMSEA = 0.08 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$= 0.08$$\end{document} for CSMs) has serious consequences for the uncertainty regarding relations between variables and seemingly less serious consequences for the uncertainty of measurement outcomes. However, this conclusion depends on the metric one wants to use to appraise the consequences. If adventitious error is present, one may also expect heterogeneity of effects in meta-analytic studies and (if adventitious error is ignored) an overestimation of power, especially for smaller effect sizes.

A key limitation of our contribution is that there is no empirical proof of adventitious error. Instead, we consider adventitious error as a framework, based on the assumption that all models are wrong (i.e., have approximate GOF). The framework can be useful for interpreting well-established phenomena such as heterogeneity of effects, overestimated power, and variability in research findings generally. That these phenomena follow from the same hypothetical principle of adventitious error is not a proof of that principle, but rather an illustration that the principle provides a parsimonious description of seemingly unrelated regularities.

Another limitation is that our simulations are far from comprehensive. We have mainly used one adventitious error rate (i.e., 0.0064, corresponding to an RMSEA of 0.08 for CSMs), a covariance matrix associated with a one-dimensional factor model (with rather high MV correlations), and only a couple levels of sample size. More extensive simulation studies would be desirable.

A final limitation is that, unfortunately, a multilevel model with a dispersion parameter for covariance matrices is at present not within reach. If it were, one could use data from a rather large number of studies to estimate the adventitious error dispersion as a parameter of that multilevel model. Available multilevel factor models are based on the assumption of homogeneous covariance matrices, except for possible differences in the variances of the factors (Muthén, Reference Muthén1994; Stapleton et al., Reference Stapleton, Yang and Hancock2016). Other multilevel approaches focus either on variation per cell of the covariance matrix (Cheung and Chan, Reference Cheung and Chan2005) or on a specific effect in a structural equation model (Preacher et al., Reference Preacher, Zyphur and Zhang2010), without considering the random variation of the whole covariance matrix. Even so, a multilevel model with the dispersion of adventitious error as a parameter would still not solve the problems of adventitious error associated with (a) single studies or (b) studies with single variables or data analyses without covariance structure modeling. Even with such a multilevel model available, one cannot rely on estimated adventitious error rates in those situations. As a final consideration, although we have used benchmark values for the adventitious error rate to illustrate the consequences for inferences regarding relations between variables and for the uncertainty of composite measurement outcomes, we refrain from recommending benchmark values for actual inferences in practice.

We conclude that a possible cause of approximate fit, in the form of adventitious error, has consequences for the uncertainty on relations between variables and on composite measurement outcomes. The custom of accepting models with approximate fit may have further consequences beyond having a useful model. Whereas approximate models can certainly be useful, their approximate fit also suggests that there is more uncertainty in research findings and in measurement than we commonly suppose there is.

Funding

The authors did not receive support from any organization for the submitted work.

Declarations

Conflict of interest

The authors have no relevant financial or non-financial interests to disclose. There are no conflicts of interest to declare.

Human and animal rights

See declaration on interests here above. The research did not involve human participants or animals.

Footnotes

The first half of the manuscript covers an important part of the 2021 lifetime achievement award lecture presented by the first author at IMPS 2021, entitled “All models are wrong, but some model violations are useful.” The second half, on the implications of adventitious error for effect-size heterogeneity, power, and measurement uncertainty, consists of further developments in line with the award lecture. The manuscript is a collaboration with Michael DeKay and Jolynn Pek, in alphabetical order. Shortcomings should be blamed on the presenter of the award lecture.

Publisher's Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

1 Because correlations are bounded, the conclusions are only approximate.

2 Because correlations are bounded, unlike covariances, the q factor might not apply strictly to correlations. Applying the Fisher z transformation to correlations does not solve this potential problem. Although our results are only approximate, they remain consistent with our main conclusions.

.

References

Bollen, K. A. (1989). Structural equations with latent variables, Hoboken: Wiley.CrossRefGoogle Scholar
Borenstein, M., Hedges, L. V., Higgins, J. P. T., Rothstein, H. R. (2009). Introduction to meta-analysis, Hoboken: Wiley.CrossRefGoogle Scholar
Box, G. E. P. (1976). Science and statistics. Journal of the American Statistical Association, 71(356), 791799.CrossRefGoogle Scholar
Browne, M. W., Cudeck, R. (1993). Alternative ways of assessing model fit. In Bollen, K. A., Long, J. S. (Eds), Testing structural equation models, Sage 136162.Google Scholar
Cheung, MW-L, Chan, W. (2005). Meta-analytic structural equation modeling: A two-stage approach. Psychological Methods, 10(1), 4064.CrossRefGoogle ScholarPubMed
Cohen, J. (1992). A power primer. Psychological Bulletin, 112(1), 155159.CrossRefGoogle ScholarPubMed
Cronbach, L. J. (1951). Coefficient alpha and the internal structure of tests. Psychometrika, 16, 297334.CrossRefGoogle Scholar
Cudeck, R., Henly, S. J. (1991). Model selection in covariance structures analysis and the “problem” of sample size: A clarification. Psychological Bulletin, 109(3), 512519.CrossRefGoogle ScholarPubMed
De Boeck, P., Jeon, M. (2018). Perceived crisis and reforms: Issues, explanations, and remedies. Psychological Bulletin, 144(7), 757777.CrossRefGoogle ScholarPubMed
De Boeck, P., Pek, J., Walton, K., Wegener, D., Turner, B., Andersen, B., Beauchaine, T., Lecavalier, L., Myung, J., Petty, R. (2023). Questioning psychological constructs: Current issues and proposed changes. Psychological Inquiry, 34(4), 239257.CrossRefGoogle Scholar
Dettori, J. R., Norvell, D. C., & Chapman, J. R. (2022). Fixed-effect vs random-effects models for meta-analysis: Points to consider. Global Spine Journal, 12(7), 1624–1626. https://doi.org/10.1177/21925682221110527.CrossRefGoogle Scholar
Hu, L. T., & Bentler, P. M. (1999). Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Structural Equation Modeling, 6(1), 1–55. https://doi.org/10.1080/10705519909540118.CrossRefGoogle Scholar
Jöreskog, K. G. (1969). A general approach to confirmatory maximum likelihood factor analysis. Psychometrika, 34 2, Pt.1183202.CrossRefGoogle Scholar
Kenny, D. A., Judd, C. M. (2019). The unappreciated heterogeneity of effect sizes: Implications for power, precision, planning of research, and replication. Psychological Methods, 24(5), 578589.CrossRefGoogle ScholarPubMed
Lord, F. M., Novick, M. R. (1968). Statistical theories of mental test scores, Boston: Addison-Wesley.Google Scholar
Martin, A. D., Quinn, K. M., & Park, J. H. (2011). MCMCpack: Markov Chain Monte Carlo in R. Journal of Statistical Software, 42(9), 1–21. https://doi.org/10.18637/jss.v042.i09.CrossRefGoogle Scholar
MacCallum, R. C. (1986). Specification searches in covariance structure modeling. Psychological Bulletin, 100(1), 107120.CrossRefGoogle Scholar
MacCallum, R. C. (2003). 2001 presidential address: Working with imperfect models. Multivariate Behavioral Research, 38(1), 113139.CrossRefGoogle ScholarPubMed
McGraw, K. O., Wong, S. P. (1996). Forming inferences about some intraclass correlation coefficients. Psychological Methods, 1(1), 3046.CrossRefGoogle Scholar
McShane, B. B., Böckenholt, U. (2014). You cannot step into the same river twice: When power analyses are optimistic. Perspectives on Psychological Science, 9(6), 612625.CrossRefGoogle ScholarPubMed
McShane, B. B., Böckenholt, U., Hansen, K. T. (2022). Variation and covariation in large-scale replication projects: An evaluation of replicability. Journal of the American Statistical Association, 117(540), 16051621.CrossRefGoogle Scholar
Muthén, B. O. (1994). Multilevel covariance structure analysis. Sociological Methods & Research, 22(3), 376398.CrossRefGoogle Scholar
Nunnally, J. C. (1978). Psychometric theory, 2New York: McGraw-Hill.Google Scholar
O’Hagan, A. (2004). Dicing with the unknown. Significance, 1(3), 132133.CrossRefGoogle Scholar
Pawel, S., Held, L. (2020). Probabilistic forecasting of replication studies. PLoS ONE, 15(4), .CrossRefGoogle ScholarPubMed
Pek, J., Pitt, M. A., Wegener, D. T. (2024). Uncertainty limits the use of power analysis. Journal of Experimental Psychology: General, 153(4), 11391151.CrossRefGoogle ScholarPubMed
Preacher, K. J., Zyphur, M. J., Zhang, Z. (2010). A general multilevel SEM framework for assessing multilevel mediation. Psychological Methods, 15(3), 209233.CrossRefGoogle ScholarPubMed
Rosseel, Y. (2012). An R Package for structural equation modeling. Journal of Statistical Software, 48(2), 136.CrossRefGoogle Scholar
Sörbom, D. (1989). Model modification. Psychometrika, 54, 371384.CrossRefGoogle Scholar
Stanley, T. D., Carter, E. C., Doucouliagos, H. (2018). What meta-analyses reveal about the replicability of psychological research. Psychological Bulletin, 144(12), 13251346.CrossRefGoogle ScholarPubMed
Stapleton, L. M., Yang, J. S., Hancock, G. R. (2016). Construct meaning in multilevel settings. Journal of Educational and Behavioral Statistics, 41(5), 481520.CrossRefGoogle Scholar
Thurstone, L. L. (1935). The vectors of mind, Chicago: University of Chicago Press.Google Scholar
Tucker, L. R., Koopman, R. F., Linn, R. L. (1969). Evaluation of factor analytic research procedures by means of simulated correlation matrices. Psychometrika, 34(4), 421459.CrossRefGoogle Scholar
Tufanaru, C., Munn, Z., Stephenson, M. M., Aromataris, E. (2015). Fixed or random effects meta-analysis? Common methodological issues in systematic reviews of effectiveness. International Journal of Evidence-Based Healthcare, 13(3), 196207.CrossRefGoogle ScholarPubMed
VandenBos, G. R. (Ed.). (2015). APA dictionary of psychology (2nd ed.). American Psychological Association. https://doi.org/10.1037/14646-000.CrossRefGoogle Scholar
van Erp, S., Verhagen, J., Grassman, R. P., Wagenmakers, E. J. (2017). Estimates of between-study heterogeneity for 705 meta-analyses reported in Psychological Bulletin from 1990–2013. Journal of Open Psychology Data, 5(1), 4.CrossRefGoogle Scholar
Venables, W. N., Ripley, B. D. (2002). Modern applied statistics with S, Berlin: Springer.CrossRefGoogle Scholar
Wu, H., Browne, M. W. (2015). Quantifying adventitious error in a covariance structure as a random effect. Psychometrika, 80(3), 571600.CrossRefGoogle Scholar
Figure 0

Figure 1 Sampling scheme for an operational population covariance matrix and an observed covariance matrix. n = sample size N - 1 and m is the degrees of freedom of an inverse Wishart distribution.

Figure 1

Figure 2 Cases 1, 2a and 2b of relations between variables.

Figure 2

Table 1 Theoretical and simulation-based augmented uncertainty factors for SEs.

Figure 3

Figure 3 The effects of adventitious error on power and required sample sizes for detecting significant correlations. Left panel: For each correlation, the sample size required for power =.80\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$=.80$$\end{document} with no adventitious error was also used for the two nonzero levels of adventitious error, yielding lower power. Right panel: Sample sizes required for power =.80\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$=.80$$\end{document} at three levels of adventitious error. For the two nonzero levels of adventitious error, the vertical asymptotes in gray indicate correlations below which it is impossible to achieve power =.80\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$=.80$$\end{document}.

Figure 4

Table 2 Simulation results for within-individual variation of composite measurement outcomes based only on adventitious error.