CHI '26 · Honorable mention · full-paper review · confidence medium-high

Examining Interpretation Strategies for Multiple Forecast Visualizations with Two and Four Forecasts

Lace M. Padilla , Racquel Fygenson , Connor Wilson , Kristi Potter , Spencer C. Castro

This is a solid CHI empirical paper with a clear gap, a well-scoped experimental baseline, and a useful strategy taxonomy for multiple forecast visualizations. Its main contribution is not a new display, but evidence that viewers often reason in ways designers may not expect, including winner-takes-all and artifact-driven interpretations.

Axes Lens

Rare contribution shape, typical evidence profile. The point here is not a score. It is to show what kind of claim the paper makes, and whether the evidence pattern is unusual or baseline in this 268 -review set.

Contribution shape

Knowledge form: descriptive knowledge typical · 92/268
Novelty type: empirical finding typical · 68/268
Abstraction level: task typical · 36/268
Generalization target: design family typical · 38/268
Validation mode: controlled experiment typical · 47/268

Evidence profile

Evidence strength: strong typical · 158/268
Claim alignment: strong typical · 231/268
Overclaim risk: low typical · 53/268

Review Summary

This paper is strongest as a careful descriptive contribution to the CHI visualization literature. Rather than proposing a new forecast display, it asks a more foundational question: how do people actually interpret multiple forecasts when they are shown several common visualization types? The answer is empirically interesting and practically relevant. The paper shows that many participants do not behave like a rational integrator of uncertainty; instead, a substantial portion adopt winner-takes-all reasoning, while others visually average or latch onto salient graphical features such as intersection points or end caps. That is a meaningful departure from the common design assumption that viewers will naturally combine forecasts in a principled way. The novelty lies in the systematic, context-free baseline across five visualization types and two forecast-count conditions, supported by two preregistered experiments and strategy coding from open responses. The evidence is aligned with the claims and the paper is appropriately bounded: it explicitly acknowledges that removing domain cues helps isolate perceptual effects, but also limits direct transfer to real-world settings like weather, finance, or health. In field terms, this is a useful empirical finding that can inform future design guidance, especially for uncertainty communication and forecast comparison. The main caution is that the contribution is interpretive and baseline-oriented rather than mechanistic or general-purpose; still, within that scope, the paper appears rigorous and well positioned.

What Changed

Canon before

Prior CHI and visualization work has treated forecast displays as supporting aggregation or comparison, but the paper frames a gap in understanding how people actually interpret multiple competing forecasts.

Departure from common sense

The paper’s core surprise is that many lay participants do not combine multiple forecasts into a single integrated judgment; instead, a substantial portion use a winner-takes-all strategy, selecting one forecast as the most likely outcome. That behavior departs from the intuitive expectation that viewers would average or reconcile the forecasts.

Actual novelty

The paper contributes a context-free empirical baseline for interpreting multiple forecast visualizations across five common visualization types and two forecast-count conditions, with a detailed strategy analysis that identifies winner-takes-all, visual averaging, and artifact-driven reasoning. The novelty is primarily in the systematic characterization of interpretation strategies rather than a new visualization artifact.

Evidence

Two preregistered experiments examined how people interpret multiple forecast visualizations across five visualization types and two forecast-count conditions. The paper reports that a significant portion of lay audiences adopt winner-takes-all reasoning, that many participants visually average forecasts, and that some rely on visual artifacts such as intersection points or end caps. The study also explicitly frames its contribution as a context-free baseline and notes domain-cue limitations.

“ In contrast, our approach that people often average pairs of numbers [6], to the point that establishes a general baseline for how interpretation strategies vary scholars describe it as “a rule that is (almost) universally invoked across common visualization techniques”

actual novelty · 009_chi-conference-on-human-factors-in-computing-systems-chi-26-and-forecast-properties · confidence 0.72

“3790969 lay audiences adopt a winner-takes-all approach when in- terpreting visualized forecasts, favoring a single forecast as the most likely outcome instead of combining forecasts.”

departure from common sense · 009_chi-conference-on-human-factors-in-computing-systems-chi-26-and-forecast-properties · confidence 0.80

“ The most common approach aligned with our To do so, we adopted a context-free design that establishes a base- prediction that participants would rely on the visible extent of the line understanding of how people interpret multiple forecasts in the mark, placing their upper and lower bounds at the furthest visible absence of domain cues”

limitation · 025_sd-bands-were-most-likely-to-elicit-a-winner-takes-all-approach · confidence 0.66

“ izations, we conducted two preregistered experiments using five Multiple forecast visualizations offer a lesser-studied method common visualization types: median plots, 95% confidence intervals, for communicating implicit uncertainty by showing variance or standard deviation bands, density plots, and hypothetical outcome agreement among multiple forecasts”

validation scope · 009_chi-conference-on-human-factors-in-computing-systems-chi-26-and-forecast-properties · confidence 0.70

Limits

Method limits

The evidence comes from two preregistered experiments in a context-free setting, so the findings are strongest as a baseline for perceptual and interpretive strategies rather than as a full account of domain-specific decision making. The strategy taxonomy is grounded in participant responses and open-ended analysis, but it remains tied to the chosen visualization set and experimental framing.

Deployment limits

The results should not be treated as a direct prescription for all real-world forecasting contexts. Domain-specific settings such as weather, finance, or health may introduce cues and incentives that alter interpretation strategies, so deployment claims should be limited to settings where the display is similarly decontextualized or where designers explicitly account for domain effects.

Boundary conditions

The paper’s own framing indicates that the baseline is context-free and that domain-specific scenarios can introduce strong biases and idiosyncratic reasoning. The reported winner-takes-all and artifact-based strategies are therefore best understood as boundary-sensitive patterns that may vary with domain cues, forecast agreement, and visualization type.

Position in field

This sits in the CHI visualization and uncertainty-communication literature as an empirical study of how people interpret multiple forecasts, extending prior work from display design toward interpretation strategy analysis. Its value is in clarifying user reasoning patterns that can inform future forecast visualization design and evaluation.

Abstract

Multiple forecast visualizations (MFVs) present curated sets of forecasts to support decision-making under uncertainty. However, the research community knows little about how people interpret and integrate competing forecasts. In this study, we investigate the strategies individuals use when predicting hypothetical future events with MFVs across five visualization types (median, 95\% CIs, standard deviation intervals, density plots, and hypothetical outcome plots) and multiple probability distributions in two preregistered experiments (\textit{n} = 500 each). Analysis of 18 participant strategies and open responses shows that whereas many participants attempted to visually average across forecasts, others adopted a winner-takes-all approach (\textit{e.g.,} selecting a single forecast as the most likely outcome), which deviates from rational agent expectations. We also observed reliance on visual artifacts, such as intersection points or end caps. These findings underscore the complexity of interpreting a range of forecasts and help explain why individuals may privilege particular predictions in real-world decision contexts.