estimation alternatives using the LMS approach • modsem

library(modsem)
#> This is modsem (1.0.14). Please report any bugs!

Accelerated EM and Adaptive Quadrature

By default (as of v1.0.9) the LMS approach uses an accelerated EM procedure ("EMA") that uses Quasi-Newton and Fisher Scoring optimization steps when needed. If desireable, this can be switched to the standard Expectation-Maximization (EM) algorithm, by setting algorithm = "EM".

By default the LMS approach also uses a fixed Gauss-Hermite quadrature, to estimate a numerical approximation of the log-likelihood. Instead of a fixed quadrature, it is possible to use a quasi-adaptive quadrature instead. Due to performance reasons, the adaptive quadrature does not fit an individual quadrature to each participant, but instead one for the entire sample (at each EM iteration), based on the whole sample densities of the likelihood function. It essentially works by removing irrelevant nodes which don’t contribute to the integral, and increasing the number of nodes which actually contribute to the integral. This usually means that more nodes are placed towards the center of the distribution, compared to a standard fixed Gauss-Hermite quadrature. Using the EMA and adaptive quadrature might yield estimates that are closer to results from Mplus.

If the model struggles to converge, you can try changing the EM procedure by setting algorithm = "EMA", or algorithm = "EM", and adaptive.quad = TRUE in the modsem() function. Additionally it is possible to tweak these parameters:

max.iter: Maximum number of iterations for the EM algorithm (default is 500).
max.step: Maximum number of steps used in the Maximization step of the EM algorithm (default is 1).
convergence.rel: Relative convergence criterion for the EM algorithm.
convergence.abs: Absolute convergence criterion for the EM algorithm.
nodes: Number of nodes for numerical integration (default is 24). Increasing this number can improve the accuracy of the estimates, especially for complex models.
quad.range: Integration range for quadrature. Smaller ranges means that the integral is more focused. Applies to only when using a quasi-adaptive quadrature.
adaptive.frequency: How often should the quasi-adaptive quadrature be calculated? Defaults to every third EM iteration.
adaptive.quad.tol: Relative error tolerance when calculating the quasi-adaptive quadrature.

Here we can see an example using the TPB_UK dataaset, which is more troublesome to estimate than the simulated TPB dataset.

tpb_uk <- "
# Outer Model (Based on Hagger et al., 2007)
  ATT =~ att3 + att2 + att1 + att4
  SN =~ sn4 + sn2 + sn3 + sn1
  PBC =~ pbc2 + pbc1 + pbc3 + pbc4
  INT =~ int2 + int1 + int3 + int4
  BEH =~ beh3 + beh2 + beh1 + beh4

# Inner Model (Based on Steinmetz et al., 2011)
  INT ~ ATT + SN + PBC
  BEH ~ INT + PBC
  BEH ~ INT:PBC
"

fit <- modsem(tpb_uk,
              data = TPB_UK,
              method = "lms",
              nodes = 32, # Number of nodes for numerical integration
              adaptive.quad = TRUE, # Use quasi-adaptive quadrature
              adaptive.frequency = 3, # Update the quasi-adaptive quadrature every third EM-iteration
              adaptive.quad.tol = 1e-12, # Relative error tolerance for quasi-adaptive quad
              algorithm ="EMA", # Use accelerated EM algorithm (Default)
              convergence.abs = 1e-4, # Relative convergence criterion
              convergence.rel = 1e-10, # Relative convergence criterion
              max.iter = 500, # Maximum number of iterations
              max.step = 1) # Maximum number of steps in the maximization step
summary(fit)
#> 
#> modsem (1.0.14) ended normally after 53 iterations
#> 
#>   Estimator                                        LMS
#>   Optimization method                       EMA-NLMINB
#>   Number of model parameters                        69
#> 
#>   Number of observations                          1169
#> 
#> Loglikelihood and Information Criteria:
#>   Loglikelihood                              -35375.31
#>   Akaike (AIC)                                70888.62
#>   Bayesian (BIC)                              71238.03
#>  
#> Numerical Integration:
#>   Points of integration (per dim)                   32
#>   Dimensions                                         1
#>   Total points of integration                       32
#> 
#> Fit Measures for Baseline Model (H0):
#>                                               Standard
#>   Chi-square                                   5519.01
#>   Degrees of Freedom (Chi-square)                  162
#>   P-value (Chi-square)                           0.000
#>   RMSEA                                          0.168
#>                                                       
#>   Loglikelihood                              -35522.87
#>   Akaike (AIC)                                71181.74
#>   Bayesian (BIC)                              71526.09
#>  
#> Comparative Fit to H0 (LRT test):
#>   Loglikelihood change                          147.56
#>   Difference test (D)                           295.12
#>   Degrees of freedom (D)                             1
#>   P-value (D)                                    0.000
#>  
#> R-Squared Interaction Model (H1):
#>   INT                                            0.898
#>   BEH                                            0.922
#> R-Squared Baseline Model (H0):
#>   INT                                            0.896
#>   BEH                                            0.867
#> R-Squared Change (H1 - H0):
#>   INT                                            0.002
#>   BEH                                            0.055
#> 
#> Parameter Estimates:
#>   Coefficients                          unstandardized
#>   Information                                 observed
#>   Standard errors                             standard
#>  
#> Latent Variables:
#>                  Estimate  Std.Error  z.value  P(>|z|)
#>   ATT =~        
#>     att3            1.000                             
#>     att2            0.965      0.011   86.337    0.000
#>     att1            0.812      0.017   47.183    0.000
#>     att4            0.870      0.019   45.461    0.000
#>   SN =~         
#>     sn4             1.000                             
#>     sn2             1.314      0.041   32.282    0.000
#>     sn3             1.350      0.041   32.700    0.000
#>     sn1             1.001      0.038   26.593    0.000
#>   PBC =~        
#>     pbc2            1.000                             
#>     pbc1            0.859      0.021   41.257    0.000
#>     pbc3            0.935      0.017   54.942    0.000
#>     pbc4            0.818      0.021   39.781    0.000
#>   INT =~        
#>     int2            1.000                             
#>     int1            0.970      0.011   92.106    0.000
#>     int3            0.984      0.010   98.394    0.000
#>     int4            0.992      0.009  104.545    0.000
#>   BEH =~        
#>     beh3            1.000                             
#>     beh2            0.986      0.013   77.726    0.000
#>     beh1            0.814      0.019   42.704    0.000
#>     beh4            0.803      0.019   41.479    0.000
#> 
#> Regressions:
#>                  Estimate  Std.Error  z.value  P(>|z|)
#>   INT ~         
#>     ATT            -0.060      0.030   -2.032    0.042
#>     SN              0.051      0.033    1.548    0.122
#>     PBC             1.037      0.037   28.303    0.000
#>   BEH ~         
#>     PBC             0.397      0.052    7.593    0.000
#>     INT             0.594      0.049   12.217    0.000
#>     INT:PBC         0.141      0.008   17.647    0.000
#> 
#> Intercepts:
#>                  Estimate  Std.Error  z.value  P(>|z|)
#>    .pbc2            4.019      0.066   61.061    0.000
#>    .pbc1            3.986      0.063   63.035    0.000
#>    .pbc3            3.755      0.063   59.762    0.000
#>    .pbc4            3.790      0.061   61.919    0.000
#>    .att3            3.723      0.064   58.396    0.000
#>    .att2            3.838      0.062   62.174    0.000
#>    .att1            4.210      0.060   70.033    0.000
#>    .att4            3.689      0.065   56.663    0.000
#>    .sn4             4.500      0.051   87.665    0.000
#>    .sn2             4.348      0.054   79.985    0.000
#>    .sn3             4.387      0.055   80.094    0.000
#>    .sn1             4.470      0.052   85.887    0.000
#>    .int2            3.722      0.067   55.876    0.000
#>    .int1            3.867      0.066   58.763    0.000
#>    .int3            3.739      0.066   56.446    0.000
#>    .int4            3.783      0.066   57.023    0.000
#>    .beh3            2.657      0.076   34.918    0.000
#>    .beh2            2.576      0.076   34.106    0.000
#>    .beh1            2.538      0.073   34.997    0.000
#>    .beh4            2.681      0.072   37.043    0.000
#> 
#> Covariances:
#>                  Estimate  Std.Error  z.value  P(>|z|)
#>   ATT ~~        
#>     SN              1.680      0.110   15.293    0.000
#>   PBC ~~        
#>     ATT             3.677      0.177   20.748    0.000
#>     SN              1.936      0.117   16.565    0.000
#> 
#> Variances:
#>                  Estimate  Std.Error  z.value  P(>|z|)
#>    .pbc2            0.701      0.042   16.676    0.000
#>    .pbc1            1.455      0.069   20.996    0.000
#>    .pbc3            0.802      0.043   18.608    0.000
#>    .pbc4            1.458      0.069   21.277    0.000
#>    .att3            0.296      0.023   12.808    0.000
#>    .att2            0.306      0.022   13.807    0.000
#>    .att1            1.286      0.057   22.576    0.000
#>    .att4            1.584      0.071   22.424    0.000
#>    .sn4             1.363      0.065   21.026    0.000
#>    .sn2             0.491      0.032   15.393    0.000
#>    .sn3             0.377      0.032   11.885    0.000
#>    .sn1             1.446      0.068   21.126    0.000
#>    .int2            0.237      0.014   16.972    0.000
#>    .int1            0.404      0.020   20.243    0.000
#>    .int3            0.335      0.017   19.346    0.000
#>    .int4            0.271      0.015   17.900    0.000
#>    .beh3            0.456      0.030   15.254    0.000
#>    .beh2            0.513      0.031   16.553    0.000
#>    .beh1            1.837      0.082   22.302    0.000
#>    .beh4            1.918      0.086   22.397    0.000
#>     ATT             4.452      0.197   22.598    0.000
#>     SN              1.717      0.119   14.480    0.000
#>     PBC             4.359      0.211   20.698    0.000
#>    .INT             0.506      0.038   13.281    0.000
#>    .BEH             0.449      0.034   13.239    0.000