Skip to main content

Dataset Comparison

This page provides a comprehensive side-by-side comparison of all four primary result groups. Each group represents an independent dataset with progressively less restrictive quality criteria. Researchers can use this comparison to assess how data cleaning decisions affect statistical conclusions.

Sample Overview

#Result GroupNDefinition
1Conservative Clean80Prolific APPROVED + all quality checks (IRI, duration >= 540s, reCAPTCHA, straightlining, auth)
2Flexible Clean127Prolific APPROVED + basic quality (all 3 IRIs + duration >= 480s)
3Prolific Accepted231All deduplicated V2 rows with Prolific APPROVED status
4All V2 Finished368Finished + duration >= 120s (extreme speeders excluded)

Core Metrics Comparison

All descriptive statistics, reliability coefficients, and correlations are shown for each result group. The Δ column shows the difference from Conservative Clean (the strictest dataset). Deltas > 0.05 are highlighted in amber.

MetricConservative CleanFlexible CleanProlific AcceptedAll V2 FinishedΔ FlexibleΔ ProlificΔ All
Barrier Grand Mean2.82392.81462.77902.7401-0.0093-0.0449-0.0838
Barrier SD0.63320.71780.71530.7714+0.0846+0.0821+0.1382
Readiness Grand Mean3.05283.09683.14003.2494+0.0440+0.0872+0.1966
Readiness SD0.58060.65720.67490.7150+0.0766+0.0943+0.1344
Maturity Grand Mean3.03773.06613.15713.2667+0.0284+0.1194+0.2290
Maturity SD0.70610.79600.80550.8058+0.0899+0.0994+0.0997
B-R Correlation-0.4856-0.4595-0.3423-0.3165+0.0261+0.1433+0.1691
B-M Correlation-0.2315-0.3197-0.2787-0.3100-0.0882-0.0472-0.0785
R-M Correlation0.58990.69820.71250.7341+0.1083+0.1226+0.1442
Alpha Barriers0.85800.87720.87700.9011+0.0192+0.0190+0.0431
Alpha Readiness0.87560.91810.92070.9305+0.0425+0.0451+0.0549
Alpha Maturity0.83300.88330.88970.8900+0.0503+0.0567+0.0570

Survey Demographics Comparison (Qualtrics)

Role distribution and organization size breakdown for each result group, based on self-reported survey responses (Q1, Q4). These are organizational demographics from the TABS instrument. Prolific platform demographics (age, sex, ethnicity, plus prescreener fields like industry, company size, and occupation) are collected separately and can be used to cross-validate these survey responses via Prolific Participant ID.

Tech vs Non-Tech Composition

Result GroupNTechnicalNon-TechnicalOther% Tech
Conservative Clean8015521318.8%
Flexible Clean12729791922.8%
Prolific Accepted229511334522.3%
All V2 Finished363912066625.1%

Organization Size Distribution

Result Group<100100–499500–9991000–49995000–999910000+
Conservative Clean929718611
Flexible Clean17401630816
Prolific Accepted405733482033
All V2 Finished569857733153

Filter Bias Analysis

This analysis tests whether stricter quality filters disproportionately exclude certain demographics. A Chi-square test for independence is computed across the four result groups for role, organization size, and profit model.

Demographic CategoryChi-Square (χ²)dfp-valueInterpretation
Role (Tech vs Non-Tech)3.5160.7425No significant difference (demographics stable)
Organization Size9.46150.8520No significant difference (demographics stable)
Profit Model2.6960.8462No significant difference (demographics stable)

Profit Model Distribution

Result GroupFor-ProfitNon-ProfitGovernment/Public Sector
Conservative Clean57158
Flexible Clean872713
Prolific Accepted1724118
All V2 Finished2776229

Effect Size Comparison

Cohen’s d effect sizes for key group comparisons across all four result groups. This shows how effect sizes shift as the sample becomes less restrictive.

Tech vs Non-Tech (Cohen’s d)

ConstructConservative CleanFlexible CleanProlific AcceptedAll V2 Finished
barriers-0.234-0.306-0.118-0.120
readiness0.5430.5750.5600.355
maturity0.1520.2950.3140.210

Large vs Small/Medium Org (Cohen’s d)

ConstructConservative CleanFlexible CleanProlific AcceptedAll V2 Finished
barriers0.5480.4760.2250.233
readiness-0.067-0.2680.0510.027

Interpretation Guide

Metrics that remain stable across all four groups suggest robust findings that are not sensitive to data cleaning decisions. Metrics that show large deltas (highlighted in amber) between Conservative Clean and less restrictive groups warrant further investigation, as the finding may depend on sample composition.

As a rule of thumb: if a Cohen’s d shifts by more than 0.1 between Conservative Clean and All V2 Finished, the effect size may be inflated or attenuated by lower-quality responses. Similarly, if Cronbach’s α drops below 0.70 in larger samples, it may indicate that less-engaged respondents are adding noise to the scale.

Related