Data Distribution/Normality in PLS SEM using SmartPLS

TL;DR

PLS-SEM is more flexible than covariance-based SEM regarding distribution assumptions, but extreme non-normality can still be problematic.

Briefing Cornell Notes

Briefing

PLS-SEM work in SmartPLS often raises a practical question: should researchers test whether survey responses are normally distributed? The core takeaway is that PLS-SEM doesn’t rely on strict normality assumptions the way covariance-based SEM does, but normality still matters because extreme non-normality can distort results. Since survey data are recorded across fixed response categories (e.g., 1–7 agree/disagree scales), researchers can summarize how responses cluster across categories using frequency tables, bar charts, or histograms—then assess whether the overall pattern resembles a bell-shaped normal curve.

In the transcript’s example, most respondents select the middle categories (around 4 on a 7-point scale), with fewer responses at the extremes (1 and 7). That “bell-shaped” pattern is the hallmark of a normal distribution: values concentrate near the mean and thin out toward both tails. Real data, however, can deviate from normality in predictable ways, such as becoming skewed (asymmetry) or having heavy/light tails (captured by kurtosis). Rather than relying on a single visual check, the recommended approach is to quantify deviation from normality.

For PLS-SEM, the transcript notes that formal normality tests like the Shapiro–Wilk test or other goodness-of-fit tests (e.g., “cgov smof”) can be applied, but in practice many researchers focus on two distribution diagnostics: skewness and kurtosis. Skewness indicates whether the distribution leans left or right; kurtosis reflects how peaked or flat the distribution is and how heavy the tails are. Common rule-of-thumb thresholds are mentioned: skewness is typically expected to fall within about ±2, while kurtosis is often considered acceptable up to around 5 (with some references allowing values up to 10). The transcript also points to established SEM methodology references (including guidance attributed to Hair, Black, Babin, and Anderson, and a note about Joel Kier’s discussion of kurtosis ranges).

The workflow in SmartPLS is straightforward. After loading the dataset, researchers can inspect skewness and kurtosis values directly in the software’s output (the transcript points to the “skewness and kosis” section). If both metrics sit comfortably within the cited ranges—no extreme violations—analysis can proceed without concern for normality-driven distortions. If values fall outside those bounds, the transcript’s logic implies the need for caution and potentially further steps, because the distribution may be materially non-normal even if PLS-SEM itself is more forgiving than covariance-based SEM.

Overall, the message is balanced: PLS-SEM may not require normality by assumption, but distribution checks remain a safeguard against outlier-heavy or strongly asymmetric survey patterns that could undermine inference.

Cornell Notes

PLS-SEM in SmartPLS generally does not impose strict normality assumptions like covariance-based SEM, but survey data can still be meaningfully non-normal. Because responses are collected on fixed Likert-type categories (e.g., 1–7), researchers can visualize distributions with frequency tables or histograms and then quantify deviation from normality using skewness and kurtosis. Common guidelines treat skewness within about ±2 as acceptable, while kurtosis is often considered acceptable around 5 (with some references allowing up to 10). SmartPLS provides skewness and kurtosis outputs, letting researchers quickly check whether values show extreme violations. If skewness and kurtosis fall within reasonable ranges, the analysis can proceed with confidence that normality is not a major concern.

Why do researchers even check normality when using PLS-SEM in SmartPLS?

PLS-SEM is described as making no strict assumptions about data distribution, unlike covariance-based SEM. Still, extreme non-normality can cause problems, so distribution diagnostics are used as a safeguard. The transcript’s practical stance is: don’t ignore distribution—check whether it shows extreme skewness or kurtosis that could meaningfully distort results.

What does a “normal-looking” distribution mean for Likert-scale survey data?

A normal-looking distribution is bell-shaped and roughly symmetric around the mean. In the example with a 7-point agree/disagree scale, most responses cluster in the middle category (around 4), while very few fall at the extremes (1 and 7). That pattern indicates a concentration near the mean with thin tails on both sides.

Which metrics are used to assess deviation from normality, and what do they capture?

Skewness and kurtosis. Skewness measures asymmetry—whether the distribution leans left or right. Kurtosis measures tail heaviness and peakedness—how heavy or light the tails are compared with a normal distribution. Together they indicate whether the data deviates strongly from normality.

What threshold ranges are suggested for skewness and kurtosis?

The transcript gives rule-of-thumb ranges: skewness should be within about ±2. Kurtosis is typically acceptable up to around 5, though some references mention values up to 10. It also cites methodology guidance associated with Hair, Black, Babin, and Anderson, and a note that Joel Kier discusses kurtosis up to 10.

How can researchers check skewness and kurtosis inside SmartPLS?

After opening the dataset in SmartPLS, researchers can navigate to the section showing skewness and kurtosis values (the transcript points to “skewness and kosis”). They then compare those values to the cited acceptable ranges. If both metrics are well within bounds, normality is not considered a major issue and analysis can proceed.

Review Questions

What practical reason does the transcript give for checking skewness and kurtosis in PLS-SEM even though PLS-SEM makes no strict distribution assumptions?
How do skewness and kurtosis differ in what they reveal about a distribution’s shape?
In SmartPLS, what specific outputs should be inspected to assess normality-related concerns?

Key Points

1
PLS-SEM is more flexible than covariance-based SEM regarding distribution assumptions, but extreme non-normality can still be problematic.
2
Likert-scale survey responses can be summarized with frequency tables, bar charts, or histograms to see whether they form a bell-shaped pattern.
3
Skewness and kurtosis are the main quantitative diagnostics used to assess deviation from normality.
4
A common guideline treats skewness within about ±2 as acceptable, while kurtosis is often considered acceptable up to around 5 (with some references allowing up to 10).
5
SmartPLS provides skewness and kurtosis outputs, enabling a quick check before proceeding with analysis.
6
If skewness and kurtosis show no extreme violations, normality is unlikely to be a major concern for the PLS-SEM results.

Highlights

Even without strict normality assumptions, PLS-SEM still benefits from checking skewness and kurtosis to avoid extreme distribution problems.

A bell-shaped, roughly symmetric response pattern (most answers near the middle category) is the visual hallmark of normality for Likert data.

SmartPLS can display skewness and kurtosis directly, making normality screening a built-in step rather than a separate calculation.

Rule-of-thumb thresholds are repeatedly emphasized: skewness around ±2 and kurtosis around 5 (sometimes up to 10).

Topics

Normality Testing
PLS-SEM
SmartPLS
Skewness
Kurtosis

Mentioned

SmartPLS