From Factor analysis to SEM!

Course Outline

Overview of the next 5 weeks of MV stats

  1. Factor analysis
  2. FA as EFA, moving to CFA & the concept of fit
  3. Path Analysis: Regression and lm as a subset of SEM
  4. Structural Equation Modelling
  5. Latent variable modeling
  6. Causality?

Week

  1. Introduces you to the idea of latent variables, the concept of a path diagram, and also how to do a factor analysis in R.
    • Show how the EFA needs to fix latent variances at 1 and covariances at 0, and fix the upper right triangle of whatever matrix contains the latent loadings. Show what rotation does (R is great for that - just pull the loadings matrix and throw in at promax() or varimax(). Introduce indeterminacy by talking about how the rotations do not differ in fit. Then the next week convert that same model to CFA. Use a real data set so it has a purpose.
    • Tutorial: Factor analysis of built-in personality data
  2. Introduces SEM
    • Implement our factor analysis as an exploratory factor analysis.
    • Introduce the idea of confirmatory factor analysis.
    • Discuss the important concept which this brings with it: fit.
    • Curb your enthusiasm: By week 3 you should understand what boxes, circles, triangles, diamonds, and straight and curved arrows are.
    • Tutorial: Building a CFA of built-in five-factor model data.
  3. Path models: “Regression on steroids”
    • Building the equivalent of lm(y ~ x + z) in umx.
    • This leverages two ideas you already understand: Factor analysis and linear models/ANOVA.
    • Measurement invariance?
    • Curb your enthusiasm: By week 3 you should understand.
    • Tutorial: Implementing regression model in umx.
  4. Model comparison a ratchet taking science toward truth and valid causal inference.
    • Testing (and, critically, mxCompare()-ing) ideas.
    • Tutorial: Modifying and comparing models.
    • Are there aspects of personality”?
  5. Data complexities, further opportunities, and summary discussion
    • Multi-group models& twins
    • Handling ordinal outcomes as thresholded normals.
    • Least-squares: WLS
    • Multi-level
    • IRT?
    • Tutorial: Multiple group model (measurement invariance?)

The goal of this block is to get you in a position to solve practical problems with SEM, and to understand its applicability and caveates in interpretation and more complex data types and models.

We can’t in this space allocated, cover the mathematical basis of SEM, competing implementations (RAM, Lisrel, general matrix models). Nor can we cover the full range of analytic methods (LS, FIML etc). A longer term goal is to present this material in accompanying video lectures deriving modelling concepts.

Bibliography

Book: I like John Loehlin’s (2004). Latent Variable Models: An Introduction to Factor, Path, and Structural Equation Analysis (4 ed.): Psychology Press.

There is a paper on the package we will use: umx: A library for Structural Equation and Twin Modelling in R