When should we prefer a Large Sparse (LS) panel over a Small Rich (SR) panel for measuring advertising effectiveness?
| Panel Type | Size | Covariates | Data Quality |
|---|---|---|---|
| Large Sparse (LS) | 50,000 | Basic demographics (age, gender, region, income) | 85% ad tracking, 60% purchase linkage |
| Small Rich (SR) | 4,000 | + purchase history, online behavior, brand awareness | 98% ad tracking, 95% purchase linkage |
Confounding: Demographics drive BOTH ad exposure AND purchase probability
- Women are 2-5× more likely to see ads (targeting)
- Women are also 2-5× more likely to purchase (baseline behavior)
- Naive analysis: "Ads caused huge lift!" Reality: It's confounding
Measurement Error: Neither panel observes everything perfectly
- LS panel: Misses 15% of ad exposures, 40% of purchases
- SR panel: Misses 2% of ad exposures, 5% of purchases
Vary systematically across:
- Confounding strength (1× to 10× gender effect)
- True advertising effect (10-30% log-odds)
- Measurement error (on/off)
- Panel type (LS vs SR)
Evaluate on:
- Statistical metrics: Bias, RMSE, coverage
- Decision quality: Correct go/no-go decisions, expected utility loss
- Why? Rich covariates control for confounding better
- Look for: Lower RMSE, higher decision accuracy at confounding ≥ 3×
- Plot: [BIAS & VARIANCE BY CONFOUNDING STRENGTH]
- Why? Large sample size → narrow confidence intervals
- Look for: Similar utility but narrower intervals at confounding ≤ 1.5×
- Plot: [CONFIDENCE INTERVAL WIDTH COMPARISON]
- Why? LS already has lower quality + attenuation bias compounds
- Look for: Larger utility loss for LS when measurement error = TRUE
- Plot: [UTILITY LOSS: WITH vs WITHOUT MEASUREMENT ERROR]
- Why? Smaller effects harder to detect → need better confounding control
- Look for: SR advantage emerges at lower confounding when true effect is small
- Plot: [DECISION ACCURACY HEATMAP: CONFOUNDING × EFFECT SIZE]
Decision Accuracy: % of correct go/no-go decisions
- Threshold: £100k campaign must generate break-even lift of [VALUE]%
- Placeholder: LS: ___% correct | SR: ___% correct
Expected Utility: Average profit/loss from decisions
- True optimal decision utility: £[VALUE]
- Placeholder: LS achieves ___% of optimal | SR achieves ___% of optimal
Utility Loss: Cost of wrong decisions
- Type I error: Waste £100k on ineffective campaign
- Type II error: Miss profitable opportunity
- Placeholder: Mean loss - LS: £[VALUE] | SR: £[VALUE]
When to use Large Sparse panel:
- Confounding strength < [THRESHOLD]
- True effect size > [THRESHOLD]
- High data quality (measurement error < [THRESHOLD]%)
- Budget constraints (cheaper to maintain)
When to use Small Rich panel:
- Confounding strength > [THRESHOLD]
- Small/moderate effects need precise measurement
- Multiple confounders present
- Decision stakes are high (utility loss matters)
Recommendation: [TO BE COMPLETED AFTER RESULTS]
- Break-even lift percentage (from config)
- Cross-over threshold for confounding strength
- Mean decision accuracy by panel type
- Mean utility loss by panel type
- Specific scenarios where each panel wins
bias_variance_plot.png- Shows bias/RMSE by confoundingdecision_accuracy_plot.png- Correct decision rate comparisonutility_loss_plot.png- Expected utility loss by scenario
- Slide 3: Detailed methodology (DGP equations)
- Slide 4: Sensitivity analysis
- Slide 5: Business implications and recommendations