CHI '26 · Honorable mention · full-paper review · confidence medium-high

Toward Independent Online Shopping of the Visually Impaired Through Voice-based Computer-Using Agent

Subin Shin , Jeesun Oh , Suhyun Kim , Seoyeon Eom , Sangwon Lee

This is a timely accessibility paper whose main value is not a new algorithm but a well-scoped qualitative demonstration that voice-based computer-using agents can change how visually impaired users shop online. The contribution is strongest as an early design and experience study, with clear limits on generalization.

Axes Lens

Rare contribution shape, typical evidence profile. The point here is not a score. It is to show what kind of claim the paper makes, and whether the evidence pattern is unusual or baseline in this 268 -review set.

Contribution shape

Knowledge form: descriptive knowledge typical · 92/268
Novelty type: empirical finding typical · 68/268
Abstraction level: practice typical · 85/268
Generalization target: user population typical · 75/268
Validation mode: qualitative study typical · 63/268

Evidence profile

Evidence strength: moderate typical · 105/268
Claim alignment: medium typical · 32/268
Overclaim risk: medium typical · 210/268

Review Summary

This paper’s significance lies in reframing online shopping accessibility around agentic, voice-mediated interaction rather than around incremental improvements to screen-reader navigation. The evidence indicates that participants could complete an end-to-end shopping journey through natural conversation with a CUA, which is a meaningful departure from the common-sense assumption that accessibility support must remain linear and page-by-page. The novelty is primarily empirical and design-oriented: the authors position the work as one of the first studies of LMM-based agents in this visually intensive shopping context, and the study is grounded in a Semi-Automatic Wizard-of-Oz protocol with 12 visually impaired participants plus debriefing interviews. That makes the paper valuable as an early descriptive account of needs, breakdowns, and design implications, not as a technical validation of a production-ready system. The limitations are important and well aligned with the evidence: the participant pool is restricted to acquired visual impairment, the setting is controlled, and the study does not address noisy environments, public-space privacy, or mobile/network variability. So the paper’s strongest contribution is to establish a plausible interaction direction and a set of accessibility-centered design implications, while its claims should be read as exploratory and context-bound rather than broadly generalizable.

What Changed

Canon before

Prior CHI accessibility work for online shopping has largely centered on screen-reader compatibility, alt text, and linear navigation support rather than voice-based agents that can directly operate graphical interfaces for independent shopping.

Departure from common sense

The paper’s core shift is that accessibility need not be limited to screen-reader-style linear access; instead, a voice-based CUA can support an end-to-end shopping journey, including searching, comparing, and purchasing, in a more autonomous and conversational way.

Actual novelty

The paper claims one of the first empirical studies of LMM-based computer-using agents for visually impaired online shopping, grounding the contribution in qualitative evidence from a Wizard-of-Oz study and design implications for disability-centered shopping support.

Evidence

The paper combines a qualitative Wizard-of-Oz study with 12 visually impaired participants and debriefing interviews to examine voice-based CUA shopping experiences. The evidence supports a descriptive contribution about needs, interaction breakdowns, and design implications rather than a technical benchmark or causal claim. The novelty is positioned as early empirical exploration of LMM-based agents in this accessibility setting.

“ Our paper contributes by presenting one of the first empirical studies exploring how LMM-based agents can facilitate accessibility in visually intensive online shopping contexts”

actual novelty · Abstract/Introduction contribution statement (embedded in 007_share-on) · confidence 0.70

“ 6 Design Implications In this section, we present implications derived from our findings to design a more inclusive and disability-centered online shopping environment with voice-based CUA for visually impaired u”

departure from common sense · Section 5.1 Discussion (embedded in 007_share-on) · confidence 0.74

“ First, our study excluded users with more diverse conditions—such as mild low vision, central vision loss, or congenital impairment—since all participants were individuals with acquired visual impairment, either completely blind or with low vision at a level approaching total blindnes”

limitation · Section 7 Limitation and Future Work (embedded in 007_share-on) · confidence 0.90

“ Thus, our study qualitatively explores the experiences and needs of visually impaired users as they shop online through voice interaction with a CUA We conducted a Semi-Automatic Wizard-of-Oz study with 12 visually impaired participants, in which they performed shopping tasks with a voice-based CUA system, followed by debriefing interviews”

validation scope · Abstract + Method/Participants/Procedures (embedded in 007_share-on) · confidence 0.78

Limits

Method limits

The study is qualitative and small-N, using a Semi-Automatic Wizard-of-Oz setup rather than a fully deployed autonomous system. The evidence supports experience-centered insights, but not performance claims about robustness, scalability, or comparative superiority across broader accessibility contexts.

Deployment limits

The paper’s implications are most directly applicable to voice-based shopping support for visually impaired users in controlled settings and may not transfer unchanged to noisy, mobile, or highly variable real-world shopping environments.

Boundary conditions

The findings are bounded by the participant pool and setting: acquired visual impairment, lab-based tasks, stable connectivity, and a voice-based CUA workflow. The paper itself notes that broader impairment profiles and environmental variability were not covered.

Position in field

This work sits at the intersection of accessibility, conversational interaction, and emerging computer-using agents. Its field contribution is to move beyond conventional screen-reader-centered shopping access toward a disability-centered interaction model for visually intensive commerce.

Abstract

Visually impaired individuals face barriers in online shopping because product details are often conveyed through images, and alternative text is frequently insufficient. The recent advent of Computer-Using Agents (CUA) based on Large Multimodal Models, which can directly manipulate graphical user interfaces, offers new opportunities for such accessibility. However, there is a lack of research that considers how voice-based systems should be designed to support visually impaired users in complex online shopping contexts. Thus, our study qualitatively explores the experiences and needs of visually impaired users as they shop online through voice interaction with a CUA We conducted a Semi-Automatic Wizard-of-Oz study with 12 visually impaired participants, in which they performed shopping tasks with a voice-based CUA system, followed by debriefing interviews. This paper proposes design implications for creating a more inclusive and disability-centered online shopping environment with voice-based CUA for visually impaired users.