A Unified Neural Network Framework for Extended Redundancy Analysis

Ranjith Vijayakumar; Ji Yeh Choi; Eun Hwa Jung

doi:10.1007/s11336-022-09853-x

A Unified Neural Network Framework for Extended Redundancy Analysis

Published online by Cambridge University Press: 01 January 2025

Ranjith Vijayakumar ,

Ji Yeh Choi

and

Eun Hwa Jung

Show author details

Ranjith Vijayakumar: Affiliation:
National University Of Singapore
Ji Yeh Choi*: Affiliation:
York University
Eun Hwa Jung: Affiliation:
Kookmin University
*: Correspondence should be made to Ji Yeh Choi, Department of Psychology, York University, 4700 Keele St., Toronto, ON, Canada. Email: [email protected]

Article contents

Abstract
Footnotes
References

Get access

Rights & Permissions

Abstract

Component-based approaches have been regarded as a tool for dimension reduction to predict outcomes from observed variables in regression applications. Extended redundancy analysis (ERA) is one such component-based approach which reduces predictors to components explaining maximum variance in the outcome variables. In many instances, ERA can be extended to capture nonlinearity and interactions between observed and components, but only by specifying a priori functional form. Meanwhile, machine learning methods like neural networks are typically used in a data-driven manner to capture nonlinearity without specifying the exact functional form. In this paper, we introduce a new method that integrates neural networks algorithms into the framework of ERA, called NN-ERA, to capture any non-specified nonlinear relationships among multiple sets of observed variables for constructing components. Simulations and empirical datasets are used to demonstrate the usefulness of NN-ERA. The conclusion is that in social science datasets with unstructured data, where we expect nonlinear relationships that cannot be specified a priori, NN-ERA with its neural network algorithmic structure can serve as a useful tool to specify and test models otherwise not captured by the conventional component-based models.

Keywords

component-based model extended redundancy analysis Neural Networks nonlinearity and partial dependence plot

Type: Application Reviews and Case Studies
Information: Psychometrika , Volume 87 , Issue 4 , December 2022 , pp. 1503 - 1528

DOI: https://doi.org/10.1007/s11336-022-09853-x [Opens in a new window]
Copyright: Copyright © 2022 The Author(s) under exclusive licence to The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

Footnotes

Supplementary Information The online version contains supplementary material available at https://doi.org/10.1007/s11336-022-09853-x.

References

Al-Alawi, S. M., Abdul-Wahab, S. A., Bakheit, C. S., (2008). Combining principal component regression and artificial neural networks for more accurate predictions of ground-level ozone Environmental Modelling and Software 23 (4) 396–403 10.1016/j.envsoft.2006.08.007CrossRef Google Scholar

Bengio, Y. (2012). Practical recommendations for gradient-based training of deep architectures. arXiv:1206.5533.Google Scholar

Berk, R. A. (2016). Statistical learning from a regression perspective (2nd ed.). Springer. https://doi.org/10.1007/978-0-387-77501-2.CrossRef Google Scholar

Bishop, C. M. (1995). Neural networks for pattern recognition. Oxford University Press.CrossRef Google Scholar

Bottou, L. (2012). Stochastic gradient descent tricks. In Montavon, G. Orr, G. B. & Müller, K. R. (Eds.), Neural networks: Tricks of the trade. Lecture notes in computer science (Vol. 7700, pp. 421–436). Springer. https://doi.org/10.1007/978-3-642-35289-8_25.CrossRef Google Scholar

Breiman, L., (2001). Random forests Machine Learning 45 5–32 10.1023/A:1010933404324CrossRef Google Scholar

Buckler, F. (2003). NEUSREL: Using neural networks to reveal causal relationships and present them in an understandable way. In Neural networks in marketing management (pp. 103–126). Gabler VerlagGoogle Scholar

Buckler, F., & Hennig-Thurau, T. (2008). Identifying hidden structures in marketing’s structural models through universal structure modeling: An explorative Bayesian neural network complement to LISREL and PLS. Marketing - Journal of Research in Management, 4(2), 49–68. https://doi.org/10.15358/0344-1369-2008-jrm-2-47 CrossRef Google Scholar

Byrd, R. H., Lu, P., Nocedal, J., Zhu, C., (1995). A limited memory algorithm for bound constrained optimization SIAM Journal on Scientific Computing 16 (5) 1190–1208 10.1137/0916069CrossRef Google Scholar

Coveney, P. V., Dougherty, E. R., Highfield, R. R., (2016). Big data need big theory too Philosophical Transactions of Royal Society A 374 20160153 10.1098/rsta.2016.0153CrossRef Google Scholar PubMed

Choi, J. Y., Kyung, M., Hwang, H., Park, J-H (2020). Bayesian extended redundancy analysis: A Bayesian approach to component-based regression with dimension reduction Multivariate Behavioral Research 55 (1) 30–48 10.1080/00273171.2019.1598837 31021267CrossRef Google Scholar PubMed

de Leeuw, J., Young, F. W., Takane, Y., (1976). Additive structure in qualitative data: An alternating least squares method with optimal scaling features Psychometrika 41 (4) 471–503 10.1007/BF02296971CrossRef Google Scholar

Diamantaras, K. I., & Kung, S. Y. (1996). Principal component neural networks: Theory and applications. Wiley.Google Scholar

Freitas, A. A., (2006). Are we really discovering interesting knowledge from data? Expert Update 9 (1) 41–47Google Scholar

Friedman, J. H. (2001). Greedy function approximation: A gradient boosting machine. Annals of Statistics. JSTOR. https://doi.org/10.1214/aos/1013203451 CrossRef Google Scholar

Girosi, F., Jones, M., Poggio, T., (1995). Regularization theory and neural networks architectures Neural Computation 7 219–269 10.1162/neco.1995.7.2.219CrossRef Google Scholar

Gunther, F., Fritsch, S., (2010). neuralnet: Training of neural networks The R Journal 2 (1) 30–38 10.32614/RJ-2010-006CrossRef Google Scholar

Hastie, T., Tibshirani, R., & Friedman, J. (2016). Elements of statistical learning: Data mining, inference, and prediction (2nd ed). Springer. https://doi.org/10.1007/978-0-387-84858-7.CrossRef Google Scholar

Jolliffe, I. T., (1982). A note on the use of principal components in regression Applied Statistics 10.2307/2348005CrossRef Google Scholar

Kelava, A., Werner, C. S., Schermelleh-Engel, K., Moosbrugger, H., Zapf, D., Ma, Y., Cham, H., Aiken, L. S., West, S. G., (2011). Advanced nonlinear latent variable modeling: Distribution analytic LMS and QML estimators of interaction and quadratic effects Structural Equation Modeling: A Multidisciplinary Journal 18 (3) 465–491 10.1080/10705511.2011.582408CrossRef Google Scholar

Kenny, D. A., Judd, C. M., (1984). Estimating the nonlinear and interactive effects of latent variables Psychological Bulletin 96 (1) 201–210 10.1037/0033-2909.96.1.201CrossRef Google Scholar

Klein, A. G., Moosbrugger, H., (2000). Maximum likelihood estimation of latent interaction effects with the LMS method Psychometrika 65 (4) 457–474 10.1007/bf02296338CrossRef Google Scholar

Kok, B. C., Choi, J. S., Oh, H., Choi, J. Y., (2019). Sparse extended redundancy analysis: Variable selection via the exclusive LASSO Multivariate Behavioral Research 10.1080/00273171.2019.1694477 31777286Google Scholar PubMed

LeCun, Y., Bengio, Y., Hinton, G., (2015). Deep learning Nature 521 (7553) 436–444 10.1038/nature14539 26017442CrossRef Google Scholar PubMed

Marsh, H. W., Wen, Z., Hau, K. T., (2004). Structural equation models of latent interactions: Evaluation of alternative estimation strategies and indicator construction Psychological Methods 9 (3) 275–300 10.1037/1082-989X.9.3.275 15355150CrossRef Google Scholar PubMed

McIntosh, C. N., Edwards, J. R., Antonakis, J., (2014). Reflections on partial least squares path modeling Organizational Research Methods 17 (2) 210–251 10.1177/1094428114529165CrossRef Google Scholar

Moody, J., Hanson, S., Krog, H. A., Hertz, J. A., (1995). A simple weight decay can improve generalization Advances in Neural Information Processing Systems 4 950–957Google Scholar

Muthen, B., & Asparouhov, T. (2011). Beyond multilevel regression modeling: Multilevel analysis in a general latent variable framework. In Hox, J., Roberts, J. (Eds.). Handbook of advanced multilevel analysis (pp. 15–40). Routledge. https://doi.org/10.4324/9780203848852.CrossRef Google Scholar

Nowlan, S. J., Hinton, G. E., (1992). Simplifying neural networks by soft weight-sharing Neural Computation 4 473–493 10.1162/neco.1992.4.4.473CrossRef Google Scholar

Nwankpa, C. E., Ijomah, W., Gachagan, A., & Marshall, S. (2018). Activation functions: Comparison of trends in practice and research for deep learning. arXiv:8110.3378.Google Scholar

Pennebaker, J. W., Chung, C. K., Ireland, M., Gonzales, A., & Booth, R. J. (2007). The development and psychometric properties of LIWC2007. LIWC Inc.Google Scholar

R Core Team. (2013). R: A language and environment for statistical computing [Computer software manual]. Vienna, Austria. Retrieved from http://www.R-project.org/Google Scholar

Ripley, B. D. (1996). Pattern recognition and neural networks. Cambridge University Press.CrossRef Google Scholar

Rosipal, R., Trejo, L., (2001). Kernel partial least squares regression in reproducing kernel Hilbert space Journal of Machine Learning Research 2 97–123 10.1162/15324430260185556Google Scholar

Schölkopf, B., Smola, A., & Müller, K.-R. (1997). Kernel principal component analysis. International conference on artificial neural network. In W. Gerstner, A. Germond, M. Hasler, & J. D. Nicoud (Eds.), Artificial neural networks—ICANN 1997 (pp. 583–588). Springer.CrossRef Google Scholar

Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R., (2014). Dropout: A simple way to prevent neural networks from overfitting Journal of Machine Learning Research 15 1929–1958 10.5555/2627435.2670313Google Scholar

Takane, Y., Hwang, H., (2005). An extended redundancy analysis and its applications to two practical examples Computational Statistics & Data Analysis 49 (3) 785–808 10.1016/j.csda.2004.06.004CrossRef Google Scholar

Vapnik, V. (1995). The nature of statistical learning theory. Springer. https://doi.org/10.1007/978-1-4757-2440-0.CrossRef Google Scholar

Wold, H. (1973). Nonlinear iterative partial least squares (NIPALS) modeling: Some current developments. In P. R. Krishnaiah (Ed.), Multivariate analysis (pp. 383–487). Academic Press.Google Scholar

Wolpert, D., (1996). The lack of a priori distinctions between learning algorithms Neural Computation 8 (7) 1341–1390 10.1162/neco.1996.8.7.1341CrossRef Google Scholar

Wu, W., Massart, D. L., de Jong, S., (1997). The kernel PCA algorithms for wide data. Part I: Theory and algorithms Chemometrics and Intelligent Laboratory Systems 36 (2) 165–172 10.1016/S0169-7439(97)00010-5CrossRef Google Scholar

Yalcin, I., Amemiya, Y., (2001). Nonlinear factor analysis as a statistical method Statistical Science 16 (3) 275–294 10.1214/ss/1009213729Google Scholar

Vijayakumar et al. supplementary material

Appendix A and B

File 78 KB

Article contents

A Unified Neural Network Framework for Extended Redundancy Analysis

Abstract

Keywords

Access options

Article purchase

Temporarily unavailable

Footnotes

References

Vijayakumar et al. supplementary material

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests