Robustness Checks #SmartPLS4 - Endogeneity using Gaussian Copula in #SmartPLS4?
Based on Research With Fawad's video on YouTube. If you like this content, support the original creators by watching, liking and subscribing to their content.
SmartPLS4 Gaussian copula endogeneity testing should be run first on individual paths, then on all possible combinations of paths (pairs, triples, etc.).
Briefing
Endogeneity checks in SmartPLS4 can be made more reliable by testing not just single paths, but every combination of paths using the Gaussian copula approach. The workflow starts by selecting “Gaussian copula,” then running an endogeneity test for each individual relationship (path) in the structural model. After that, the process scales up: it tests pairs of relationships together, then triples, and continues through to the maximum number of relationships present in the model. This matters because endogeneity can hide in specific paths or emerge only when multiple paths are considered jointly, leading to misleading path significance if it’s not detected.
In practice, the procedure is iterative. A user first saves the output after testing one relationship and runs bootstrapping to obtain path coefficients and p-values. If the resulting path coefficient is insignificant, that path is treated as not showing endogeneity concerns. The transcript also notes a practical reporting step: copy the relevant output row into Excel for tracking, including the p-value and the Gaussian copula results. The output is interpreted in terms of which variable is exogenous versus endogenous (for example, “TL” as the endogenous variable and the Gaussian copula output as the test statistic/indicator).
Next, the user removes that tested relationship and adds a different path, then reruns bootstrapping. In the example, the second relationship produces a significant result, which flags endogeneity issues for that specific path—meaning the model may violate assumptions that underpin standard inference. The transcript emphasizes that resolving endogeneity is a separate topic, to be handled later, but the detection step is treated as complete once the problematic paths are identified.
After single-path testing, the workflow moves to combinations. The user runs the Gaussian copula endogeneity test for two relationships at a time, then checks the resulting path coefficients again. In the example, one relationship in the pair shows no endogeneity problems while the other still requires attention, illustrating why combination testing is useful: it helps map where endogeneity is concentrated and how it interacts across paths. The same logic is repeated for all possible pairs, then for all possible triples, and then for all possible quadruples (and so on), until every relevant combination has been assessed.
The final deliverable is a table summarizing endogeneity test results across single paths and multi-path combinations. The decision rule is straightforward: p-values above 0.05 indicate no endogeneity issue, while p-values at or below 0.05 suggest endogeneity concerns. With those results compiled, the transcript frames this as the second step in robustness checking—after which the next robustness dimension to examine is heterogeneity.
Cornell Notes
Gaussian copula endogeneity testing in SmartPLS4 is performed in stages: first for each individual path, then for every combination of paths (pairs, triples, up to the maximum number in the model). Each run uses bootstrapping to generate path coefficients and p-values, which are then recorded (e.g., copied into Excel) to build a clear map of where endogeneity appears. A significant Gaussian copula result for a given path flags endogeneity issues, while p-values greater than 0.05 indicate no endogeneity concern. Testing combinations matters because some endogeneity problems may be path-specific or may only become evident when multiple relationships are assessed together. The output is typically summarized in a table for all single and multi-path combinations.
How does the Gaussian copula endogeneity check work in SmartPLS4, and why is it done in stages?
What does it mean when a path coefficient is insignificant versus significant after bootstrapping?
What practical step is recommended for interpreting results across multiple runs?
How are multi-path combinations tested, and what changes compared with single-path testing?
What decision rule is used to conclude whether endogeneity is present?
Review Questions
- When would you test only single relationships, and when is it necessary to test combinations of relationships as well?
- What p-value threshold is used to decide whether endogeneity is present, and how should those results be organized for later interpretation?
- How does removing one relationship and adding another before rerunning bootstrapping help isolate endogeneity problems in a SmartPLS4 model?
Key Points
- 1
SmartPLS4 Gaussian copula endogeneity testing should be run first on individual paths, then on all possible combinations of paths (pairs, triples, etc.).
- 2
Bootstrapping is used for each run to generate path coefficients and p-values that indicate whether endogeneity concerns exist for the selected relationship(s).
- 3
Significant results for a tested path or combination flag endogeneity issues, while p-values above 0.05 indicate no endogeneity problem.
- 4
Copying output rows into Excel after each run helps build a consolidated table of endogeneity results across many paths and combinations.
- 5
Combination testing can reveal endogeneity patterns that may not be obvious when only one path is tested at a time.
- 6
Resolving detected endogeneity is treated as a separate follow-up step, after the detection and mapping are complete.