CHI '26 · Honorable mention · full-paper review · confidence medium-high

Investigating the Interaction of Game Features and Spatial Skills with Performance and Perceived Difficulty in a Block-Based 3D Programming Puzzle Game

Yasitha Rajapaksha , John Thomas Bacher , Tiffany Barnes

This is a solid empirical CHI paper with a clear design contribution: it moves beyond generic claims about spatial ability by showing that specific 3D game features differentially affect performance and perceived difficulty. The strongest value is the combination of a validated difficulty measure and feature-level analysis, though the scope remains bounded by a modest online sample.

Axes Lens

Rare contribution shape, typical evidence profile. The point here is not a score. It is to show what kind of claim the paper makes, and whether the evidence pattern is unusual or baseline in this 268 -review set.

Contribution shape

Knowledge form: causal knowledge typical · 31/268
Novelty type: empirical finding typical · 68/268
Abstraction level: task typical · 36/268
Generalization target: task class typical · 63/268
Validation mode: controlled experiment typical · 47/268

Evidence profile

Evidence strength: strong typical · 158/268
Claim alignment: strong typical · 231/268
Overclaim risk: low typical · 53/268

Review Summary

This paper’s main contribution is not a new interaction technique or system, but a careful empirical mapping of how design choices in a 3D block-based programming puzzle game interact with spatial ability and player experience. The result that spatial skills predict performance but not perceived difficulty is a useful departure from a simplistic intuition that “better spatial thinkers will feel the task is easier.” Instead, the paper suggests that subjective difficulty is shaped more directly by specific game features and constraint structures, such as loops and orientation changes, than by spatial skill alone. That is a meaningful CHI insight because it helps separate performance from perceived challenge in a domain where both matter for learning and engagement. The validation of the custom perceived difficulty scale also strengthens the paper: the authors do not merely report an outcome variable, they show that it behaves as a reliable one-factor measure. At the same time, the evidence is best read as a bounded empirical finding rather than a broad theory. The study is online, uses 60 participants, and varies only a limited number of levels per feature, so the design space is sampled sparsely. That means the paper is strongest as a methodological and empirical argument for early-stage design guidance in similar block-based 3D learning games, not as a universal claim about all programming environments or all learners. The practical takeaway is clear: designers should not assume spatial skill alone will explain perceived difficulty; they should instead provide scaffolds for mental-model shifts and explicit cues for repetition and abstraction. Overall, this is a credible honorable-mention-level contribution because it combines a focused question, a validated measure, and interpretable feature effects, while remaining appropriately cautious about generalization.

What Changed

Canon before

Prior CHI work on block-based programming and spatial reasoning has often treated spatial skill as a general predictor of programming success, but less often isolates how specific 3D game features alter performance and perceived difficulty.

Departure from common sense

The paper shows that spatial skill is not a universal proxy for felt difficulty: it predicts objective performance, but the subjective difficulty ratings are driven more by specific level features and constraint structures than by spatial ability alone.

Actual novelty

By examining these relationships in a 3D block-based programming game, our study is among the first to provide empirical evidence on how level features, spatial ability, player characteristics, and constraint mechanisms jointly shape performance and perceived challenge.

Evidence

The paper reports an online study with 60 players and analyzes how feature changes affected performance and perceived difficulty. It validates a custom perceived difficulty scale as unidimensional and reliable, then shows that spatial skills predict performance but not perceived difficulty, while specific level features such as larger/more complex layouts, backward-facing characters, and loops affect outcomes differently.

“ By examining these relationships in a 3D block-based programming game, our study is among the first to provide empirical evidence on how level features, spatial ability, player characteristics, and constraint mechanisms jointly shape performance and perceived ch”

actual novelty · Abstract and Introduction (last paragraph of Introduction). · confidence 0.72

“ Spatial skills strongly predicted performance but did not predict perceived difficulty”

departure from common sense · Abstract; also discussed in Discussion (RQ2). · confidence 0.80

“ The sample size was modest (N=60) and lacked ethnic diversity, which limits the generalizability of the findings”

limitation · Limitations and Future Work (Section 7). · confidence 0.92

“1 Perceived Difficulty Scale We first validated our custom perceived difficulty scale, which proved to be a highly reliable unidimensional measure .”

validation scope · Results 4.1 Perceived Difficulty Scale. · confidence 0.88

Limits

Method limits

The study uses a modest sample (N=60), limited feature-level variation, and an online setting, so statistical power and causal breadth are constrained. The custom perceived difficulty scale is validated internally, but broader psychometric comparison remains limited.

Deployment limits

Findings are most directly applicable to early-stage design of 3D block-based programming puzzles and similar novice-learning contexts; transfer to other genres, age groups, or classroom settings should be tested before adoption.

Boundary conditions

Effects were observed in a 3D block-based programming puzzle game with two to three levels per feature and an online participant pool. The reported relationships may depend on the specific feature manipulations, puzzle structure, and the measured spatial-skill distribution.

Position in field

This paper sits at the intersection of educational games, programming pedagogy, and spatial cognition, contributing a feature-level empirical account of how 3D puzzle design choices interact with learner differences to shape both objective performance and subjective difficulty.

Abstract

Block-based programming puzzle games are popular as engaging, visual tools for introducing novice learners to programming. Many incorporate 3D environments, where players navigate and solve spatial challenges. However, little research has examined how spatial reasoning skills interact with game features to shape player experience. This study investigates the interplay between game features, spatial ability, in-game performance, and perceived difficulty within BOTs: 3D programming game where players plan spatial movements to solve puzzles. In an online study with 60 players, we examined how feature changes affected performance and perceived difficulty. Spatial skills strongly predicted performance but did not predict perceived difficulty. Larger or more complex layouts increased performance costs, with backward-facing player characters producing the largest spike in performance demands. Loops reliably increased perceived difficulty. Our findings highlight concrete needs for early spatial scaffolds, clearer support for mental-model shifts, and better cues for recognizing repetition and abstraction.