06. SEMinR. Review of Model Specification, Composites, Modes (A/B), and Weighting Schemes

TL;DR

PLS-SEM estimation starts by specifying a measurement model that defines each construct as a composite of its indicators.

Briefing Cornell Notes

Briefing

A PLS-SEM workflow hinges on two linked pieces: how constructs are turned into measurable “composites” (the measurement model) and how those constructs are connected by hypothesized paths (the structural model). In this seminar-style walkthrough, the model being tested links Vision development and rewards to collaborative culture, with each construct measured by multiple survey items. The practical takeaway is that once the measurement and structural models are specified, the PLS algorithm generates construct scores that then feed regression-based estimation for both indicator relationships (weights/loadings) and path relationships (path coefficients). That chain matters because it determines what gets estimated, how it gets estimated, and which assumptions (reflective vs. formative) control the math.

The process starts in R by loading the dataset from a CSV file (including a header row and comma separators) into a data object. Next comes the measurement model specification, where constructs such as Vision development, rewards, and collaborative culture are defined as composites built from item indicators (e.g., V1–V4 for Vision development, RW1–RW? for rewards, and CC1–CC? for collaborative culture). The transcript emphasizes the conceptual role of composites in PLS-SEM: indicators are linearly combined to form composite variables that act as valid proxies for the underlying latent constructs.

With the measurement model set, the structural model is specified as directional paths from independent variables to dependent variables. In the example, Vision development and rewards are treated as independent variables predicting collaborative culture as the dependent variable. The structural model is encoded by listing the IVs as a vector and the DV as a separate target; additional DVs can be handled by adding more paths.

Estimation then follows the PLS-SEM algorithm logic used in SmartPLS: after running the PLS algorithm, construct scores are produced and used as inputs to regression models within the path model. The results include measurement-model estimates (indicator weights and loadings) and structural-model estimates (path coefficients). A key fork in the road is whether constructs are reflective or formative. Reflective constructs assume an arrow from the construct to the indicators, treat items as interchangeable, and typically use Mode A estimation (often described as based on by-variate regression/correlation logic). Formative constructs assume the arrow runs from indicators to the construct, require Mode B estimation, and are validated differently; the transcript notes that reflective is the default assumption in PLS-SEM, while formative requires explicit Mode B settings.

Finally, the weighting scheme affects estimation details. While alternative weighting schemes can yield slightly different results, the transcript highlights that the path weighting scheme is the most popular and recommended because it tends to produce the highest R² for the endogenous latent variable and works broadly across PLS path model specifications. The session closes by pointing ahead to how to implement the PLS estimate function in R after this review of model specification, composites, modes (A/B), and weighting schemes.

Cornell Notes

The workflow for PLS-SEM centers on turning survey items into latent-variable proxies (composites) and then linking those latent variables with hypothesized paths. Constructs like Vision development, rewards, and collaborative culture are modeled as composites formed by linearly combining their indicators. After specifying the measurement model and structural paths (e.g., Vision development and rewards → collaborative culture), the PLS algorithm computes construct scores, which are then used in regression-based estimation to obtain indicator weights/loadings and path coefficients. A major assumption choice is reflective vs. formative measurement: reflective uses Mode A by default (construct → indicators), while formative uses Mode B (indicators → construct). Weighting schemes also matter; path weighting is recommended for typically higher R² on endogenous constructs.

What does it mean in PLS-SEM to treat constructs as “composites,” and why does that matter for estimation?

In PLS-SEM, constructs are represented as composite variables built by linearly combining their indicators. Those composite scores become proxies for the latent constructs, so the model’s estimation depends on how indicators are combined into these composites. Once the PLS algorithm computes construct scores from the indicators, those scores are used as inputs to regression models in the path model, producing both measurement-model outputs (indicator weights/loadings) and structural-model outputs (path coefficients).

How is the structural model specified when there are multiple independent variables and one dependent variable?

The structural model is encoded as paths from IVs to a DV. In the example, Vision development and rewards are the independent variables, and collaborative culture is the dependent variable. The IVs are grouped into a vector (or array) using the same construct names as in the measurement model, and the DV is specified separately. If there were multiple dependent variables, additional paths would be added (e.g., by extending the path specification with commas).

What is the practical difference between reflective and formative constructs in Mode A vs. Mode B?

Reflective measurement assumes the arrow flows from the construct to the indicators, treating items as interchangeable; PLS-SEM defaults to reflective constructs and uses Mode A estimation (described as correlation/weight logic for loadings). Formative measurement assumes the arrow flows from indicators to the construct; it requires Mode B estimation and is validated differently. The transcript stresses that reflective models typically do not require explicitly mentioning Mode A, while formative models require specifying Mode B.

Why does the PLS algorithm come before regression in the estimation workflow?

The PLS algorithm determines construct scores by combining indicators according to the measurement model. Those construct scores then serve as independent variables in regression models used within the path model. This sequencing is why the algorithm’s output directly affects the later estimation of path coefficients and the measurement-model parameters like indicator weights/loadings.

How do weighting schemes influence PLS-SEM results, and which one is recommended here?

Different weighting schemes can produce slightly different results after running the PLS algorithm. The transcript highlights the path weighting scheme as the most popular and generally recommended approach because it tends to provide the highest R² for the endogenous latent variable and is applicable across many PLS path model specifications.

Review Questions

In the example model, which constructs are treated as independent variables and which as the dependent variable, and how are those relationships represented as paths?
What assumptions distinguish reflective from formative measurement, and how do those assumptions determine whether Mode A or Mode B is used?
After running the PLS algorithm, what outputs are produced first, and how do those outputs feed into the regression estimation step?

Key Points

1
PLS-SEM estimation starts by specifying a measurement model that defines each construct as a composite of its indicators.
2
Construct composites are formed by linearly combining indicators, and the resulting construct scores act as proxies for latent variables.
3
The structural model is specified as directional paths from independent constructs to dependent constructs using consistent construct names.
4
Running the PLS algorithm first produces construct scores, which then feed regression models to estimate both measurement parameters (weights/loadings) and structural parameters (path coefficients).
5
Reflective constructs (construct → indicators) are the default in PLS-SEM and typically use Mode A estimation; formative constructs (indicators → construct) require Mode B.
6
Weighting schemes can change results slightly; the path weighting scheme is recommended because it often yields higher R² for endogenous constructs.
7
Implementations in R (including PLS estimate) depend on matching these model-specification choices—especially reflective vs. formative mode and weighting scheme settings.

Highlights

Composites are not just a modeling convenience: PLS-SEM uses them as the proxies that drive construct-score computation and downstream regression estimation.

Reflective vs. formative measurement determines the direction of the arrow and the required estimation mode—Mode A for reflective (default) and Mode B for formative.

Path weighting is singled out as the most popular and recommended scheme because it tends to maximize R² for endogenous latent variables.

The workflow is sequential: specify measurement and structural models, run the PLS algorithm to get construct scores, then estimate regression-based path coefficients and indicator parameters.

Topics

PLS-SEM Workflow
Measurement Model
Structural Paths
Reflective vs Formative
Weighting Schemes

Mentioned

PLS-SEM
PLS
SCM
IV
DV
R²
PLS algorithm
SmartPLS