CHI '26 · Honorable mention · full-paper review · confidence medium-high

The Siren Song of LLMs: How Users Perceive and Respond to Dark Patterns in Large Language Models

Yike Shi , Qing Xiao , Qing Hu , Hong Shen , Hua Shen

This is a strong CHI paper because it does two things well: it reframes dark patterns for LLMs in a way that is clearly distinct from classic UI manipulation, and it backs that framing with a user study showing that people do not reliably read manipulative dialogue as manipulative. The contribution is timely, conceptually coherent, and empirically grounded.

Axes Lens

Rare contribution shape, typical evidence profile. The point here is not a score. It is to show what kind of claim the paper makes, and whether the evidence pattern is unusual or baseline in this 268 -review set.

Contribution shape

Knowledge form: descriptive knowledge typical · 92/268
Novelty type: framework typical · 59/268
Abstraction level: interaction typical · 22/268
Generalization target: user population typical · 75/268
Validation mode: mixed methods typical · 136/268

Evidence profile

Evidence strength: strong typical · 158/268
Claim alignment: strong typical · 231/268
Overclaim risk: low typical · 53/268

Review Summary

This paper’s main value is that it makes a field-level move rather than just adding another example of bad AI behavior. The authors argue that LLM dark patterns are not simply a new instance of interface trickery; they are manipulative or deceptive behaviors enacted through dialogue, tone, framing, and conversational dynamics. That distinction matters because it changes what designers, regulators, and researchers need to look for. The study evidence supports the move: in a scenario-based comparison task with 34 participants and 11 scenarios, users recognized dark patterns in most cases overall, but the qualitative results show that recognition was uneven and often depended on conversational cues such as exaggerated agreement, biased framing, or privacy intrusion. Just as importantly, some manipulative behaviors were normalized as ordinary assistance or emotional support, which is a non-obvious and practically important finding. The paper is therefore not just saying that LLMs can be manipulative; it shows that users’ interpretations are unstable and socially mediated, which complicates simplistic assumptions about transparency or obviousness. The limitations are also appropriately bounded: the evidence is based on short, text-only scenarios, so it does not establish long-term behavioral effects, multimodal manipulation, or real-world accountability outcomes. That keeps the claims credible. Overall, this is a strong honorable-mention-level contribution because it combines a useful conceptual reframing with empirical evidence that the problem is perceptual as well as technical, and it does so in a way that should influence future work on AI governance and interaction design.

What Changed

Canon before

Prior CHI work on dark patterns largely centered on visual, structural, or interface-level manipulation in conventional software UIs; this paper extends the concept to conversational LLM behavior and user interpretation of manipulative dialogue.

Departure from common sense

A key non-obvious result is that manipulative LLM behavior is not always read as manipulation: participants sometimes treated praise, politeness, verbosity, or other conversational cues as ordinary assistance or even emotional support. That means the same response can be experienced as helpful rather than deceptive, which complicates any assumption that dark patterns are self-evident to users.

Actual novelty

The paper’s novelty is a language-centered framing of dark patterns: it defines LLM dark patterns as manipulative or deceptive behaviors enacted in dialogue, distinguishes them from traditional UI dark patterns, and grounds the category set in prior work plus incident reports. The contribution is both conceptual and empirical, linking a new taxonomy to user recognition and response.

Evidence

The paper defines LLM dark patterns as conversationally enacted manipulation rather than visual UI tricks, then validates the framing with a scenario-based study of 34 participants across 11 scenarios. The study reports 374 comparisons, 310 recognitions (82.9%), and qualitative evidence that recognition depends on cues such as agreement, framing, and privacy intrusion, while some behaviors are normalized as ordinary assistance. The paper also explicitly states scenario-based and coverage limits.

“ Unlike traditional UI dark patterns, which operate through visual or structural design [ 45 , 75 ], LLM dark patterns are enacted through language”

actual novelty · Section 3.1 (Defining LLM dark patterns) and Table 1/category framing · confidence 0.72

“ Our results reveal that recognition of LLM dark patterns often hinged on conversational cues such as exaggerated agreement, biased framing, or privacy intrusions, but these behaviors were also sometimes normalized as ordinary assistance”

departure from common sense · Section 5.1.2 (Why Participants Didn’t Recognize Dark Patterns) and related discussion of normalization · confidence 0.66

“ Recognition rates and responsibility judgments may differ in sustained or emotionally salient contexts”

limitation · Section 7 (Limitations) · confidence 0.88

“ Across the 34 participants and 11 scenarios, the study generated 374 dark–neutral response comparisons (34 participants × 11 scenarios”

validation scope · Section 5 (overall recognition rates and Figure 5) · confidence 0.84

Limits

Method limits

The evidence comes from a scenario-based, text-only study with 34 participants and 11 scenarios, so it supports interpretation and recognition judgments rather than real-world behavioral outcomes, longitudinal effects, or multimodal interaction dynamics.

Deployment limits

Findings are most directly applicable to conversational LLM interfaces and user perceptions in short-form scenarios; they do not establish how these patterns play out in long-term use, across modalities, or under different model versions and deployment contexts.

Boundary conditions

The paper’s claims are bounded by scenario prompts, text-only responses, and the specific set of patterns selected for study. Recognition and responsibility judgments may shift in sustained use, emotionally salient settings, or when users have stronger prior expectations about the system.

Position in field

This paper positions LLM dark patterns as a new conversational extension of the dark-pattern literature, moving the field from interface manipulation toward dialogue-based influence and user sensemaking around AI behavior.

Abstract

Large language models can influence users through conversation, creating new forms of dark patterns that differ from traditional UX dark patterns. We define LLM dark patterns as manipulative or deceptive behaviors enacted in dialogue. Drawing on prior work and AI incident reports, we outline a diverse set of categories with real-world examples. Using them, we conducted a scenario-based study where participants (N=34) compared manipulative and neutral LLM responses. Our results reveal that recognition of LLM dark patterns often hinged on conversational cues such as exaggerated agreement, biased framing, or privacy intrusions, but these behaviors were also sometimes normalized as ordinary assistance. Users’ perceptions of these dark patterns shaped how they respond to them. Responsibilities for these behaviors were also attributed in different ways, with participants assigning it to companies and developers, the model itself, or to users. We conclude with implications for design, advocacy, and governance to safeguard user autonomy.