Mann Whitney U Test in SPSS - Concept, Interpretation, and Reporting Mann Whitney U Test
Based on Research With Fawad's video on YouTube. If you like this content, support the original creators by watching, liking and subscribing to their content.
Mann–Whitney U test is the independent-samples t test alternative when normality isn’t reasonable or when data are ordinal.
Briefing
Mann–Whitney U test is presented as the go-to non-parametric alternative to the independent-samples t test when group data are ordinal or continuous but not normally distributed—and when comparing means isn’t appropriate. Instead of testing whether two groups differ in their means, the Mann–Whitney U test evaluates whether the distributions’ ranks differ, which effectively corresponds to a comparison of medians for the two groups. Because the method converts scores into ranks, the original score distribution doesn’t need to be normal, and the data do not need to be continuous.
The session lays out practical situations where this test fits. One example involves a quality manager comparing service quality ratings received from suppliers versus customers. Service quality is measured with three ordered categories—excellent, moderate, and poor—so the data are ordinal. Another example uses a market researcher comparing attention to social media between men and women, again measured with a three-option ordinal statement. A third scenario is an HR manager comparing compensation across two departments (finance and HR): compensation is continuous but described as non-normal, which also makes Mann–Whitney U appropriate.
A key interpretation point is that statistical significance and effect size are not the same. After running the test in SPSS, the results include a p-value indicating whether the rank-based difference between groups is statistically significant, but SPSS does not directly provide an effect size in this workflow. The session therefore computes effect size using a z-based approach: effect size r is calculated as z divided by the square root of the total sample size (r = z / √N). In the worked example, there are 158 suppliers and 146 customers (N = 304). The output shows a significance value below 0.05, indicating a significant difference in perceived service quality between suppliers and customers. The z statistic is reported as 2.6, and dividing by √304 (about 17.43) yields an effect size around 0.14, which is interpreted as a small effect—meaning the difference exists but is not large.
Reporting guidance ties the statistical output back to the original question and hypothesis. The session demonstrates writing a results statement that the Mann–Whitney U test found significant differences in service quality perception between suppliers and customers (p < 0.05), while also noting that the effect size is small. To make the report more concrete, it also retrieves median values for each group in SPSS: both suppliers and customers have a median service-quality rating of 2. The U statistic is shown as 971, and the z statistic and effect size are used to support the conclusion that the groups differ significantly in ranks, even though the median values are the same—consistent with a small effect.
Overall, the workflow emphasizes choosing Mann–Whitney U when normality assumptions fail or when data are ordinal, interpreting rank-based differences via p-values, and quantifying practical impact through an effect size calculation.
Cornell Notes
Mann–Whitney U test serves as the non-parametric alternative to the independent-samples t test when group data are ordinal or continuous but non-normal. Rather than comparing means, it compares the ranks of observations across two independent groups, which aligns with a median-based interpretation. In SPSS, the test is run under Analyze → Nonparametric Tests → Legacy Dialogs → Two Independent Samples, selecting the grouping variable (e.g., supplier vs. customer) and the test variable (e.g., service quality). A p-value below 0.05 indicates a statistically significant difference in ranks between groups. Practical importance is assessed by computing effect size r = z / √N; in the example, r ≈ 0.14 indicates a small effect even when the result is significant.
When should researchers choose Mann–Whitney U test instead of an independent-samples t test?
What does Mann–Whitney U compare: means or medians?
How does SPSS output support interpretation of group differences?
Why compute effect size manually, and how is it calculated here?
How should results be reported, including medians and test statistics?
Review Questions
- In what two data conditions does Mann–Whitney U test become preferable to the independent-samples t test?
- How do you interpret a significant p-value alongside a small effect size r in the Mann–Whitney U context?
- What SPSS menu path is used to run the Mann–Whitney U test for two independent groups in the workflow described?
Key Points
- 1
Mann–Whitney U test is the independent-samples t test alternative when normality isn’t reasonable or when data are ordinal.
- 2
The test works by converting scores to ranks, so the original distribution shape doesn’t need to be normal.
- 3
Interpretation relies on the p-value for statistical significance and on effect size for practical importance.
- 4
Effect size can be computed as r = z / √N when SPSS doesn’t provide it directly in the shown workflow.
- 5
A small effect size can coexist with a statistically significant result, meaning differences may be real but not large.
- 6
Reporting should include a hypothesis-aligned statement, p-value, U and z (as available), effect size, and group medians for context.