CHI '26 · Honorable mention · full-paper review · confidence medium-high

Just-In-Time Objectives: A General Approach for Specialized AI Interactions

Michelle S. Lam , Omar Shaikh , Hallie Xu , Alice Guo , Diyi Yang , Jeffrey Heer , James A. Landay , Michael S. Bernstein

DOI PDF Program page

This is a strong CHI systems paper because it reframes personalization around inferred, momentary objectives rather than explicit prompts. The architecture is conceptually clean, the evaluation is broader than a single demo, and the reported gains suggest the idea is practically meaningful, though latency and inference reliability remain important constraints.

Axes Lens

Rare contribution shape, typical evidence profile. The point here is not a score. It is to show what kind of claim the paper makes, and whether the evidence pattern is unusual or baseline in this 268 -review set.

Contribution shape

Knowledge form: technical knowledge typical · 50/268
Novelty type: system architecture typical · 35/268
Abstraction level: system typical · 61/268
Generalization target: task class typical · 63/268
Validation mode: mixed methods typical · 136/268

Evidence profile

Evidence strength: strong typical · 158/268
Claim alignment: strong typical · 231/268
Overclaim risk: medium typical · 210/268

Review Summary

This paper’s main contribution is not a new language model capability but a new interaction abstraction: the system infers a user’s immediate objective from observed behavior and then treats that objective as the steering signal for downstream generation and evaluation. That is a meaningful shift from the common prompt-first framing in LLM interfaces, because it makes the objective itself a first-class artifact rather than assuming users can or should articulate it up front. The evidence packet supports both the conceptual move and the implementation: the abstract explicitly describes passive objective induction and objective-conditioned steering, and the evaluation spans participant-submitted browser traces, a larger-scale experiment over many contexts, and in-person sessions on participants’ own writing tasks. The reported 66–86% win rates over typical LLMs, along with higher-quality specialized tools in use sessions, indicate that the approach is not merely plausible but empirically effective in the studied settings. At the same time, the paper is appropriately bounded by practical constraints: it currently takes 1–3 minutes to induce and apply objectives, depends on accurate inference from limited context, and raises privacy/security concerns if broader context is collected. So the strongest reading is that this is a well-grounded systems contribution with clear novelty in how it operationalizes intent, strong evidence for the targeted task family, and real deployment caveats that prevent overgeneralization.

What Changed

Canon before

Prior LLM interaction work typically relies on fixed prompts, explicit user instruction, or manually specified objectives to steer outputs; this paper shifts the objective itself into an inferred, transient object derived from observed behavior.

Departure from common sense

The paper argues against the default assumption that a user must explicitly state a stable goal before an LLM can be specialized. Instead, it infers an in-the-moment objective from behavior and uses that single objective to drive generation and evaluation, which is a notable departure from prompt-centric interaction design.

Actual novelty

The core novelty is the just-in-time objectives architecture: objectives are induced automatically from passive observation, then made actionable by steering downstream AI systems through both generation and evaluation against the inferred objective. That combination of inference plus objective-conditioned steering is the paper’s main contribution.

Evidence

The paper presents a system architecture for inducing just-in-time objectives from user behavior and using them to specialize LLM outputs. Validation spans participant-submitted browser traces, a larger-scale experiment over many input contexts, and in-person use sessions on participants’ own writing tasks. Reported outcomes include objective accuracy/usefulness and 66–86% win rates over typical LLMs, plus higher-quality specialized tools than a standard chat baseline.

“ We contribute a generalizable architecture for user-specific LLM specialization that automatically induces just-in-time (JIT) objectives from interaction traces, then applies the JIT objective to steer AI behavior by shaping LLM generation and evaluatio”

actual novelty · Abstract + Section 3 (architecture description) · confidence 0.78

“ The core architecture depends on a single upstream step of objective induction that infers user goal(s) based on observed context and instantiates each goal as a just-in-time objective , which can later be applied to optimize generation ( gen_objective ) and evaluation ( eval_objective )”

departure from common sense · Abstract + Introduction (motivation and approach) · confidence 0.80

“ First, a just-in-time approach necessarily incurs a time cost since users must wait for the objectives to be induced and applied, which currently takes 1-3 minutes”

limitation · System Limitations (Section 3.3) and related evaluation limitations · confidence 0.77

“ In a series of experiments on participants’ own tasks, JIT objectives enable LLM outputs that achieve 66–86% win rates over typical LLMs”

validation scope · Evaluation sections (Study 1/2 and in-lab sessions) · confidence 0.72

Limits

Method limits

The method depends on inferring objectives from limited behavioral context and currently incurs a 1–3 minute delay while objectives are induced and applied. The evaluation also appears centered on participants’ own tasks and writing-oriented use cases, so generality beyond those settings is not fully established.

Deployment limits

Deployment is constrained by latency, by the need to infer objectives accurately from sparse context, and by privacy/security concerns if broader context capture is used. The approach may be less suitable for time-sensitive interactions or settings where passive observation is unacceptable.

Boundary conditions

The paper’s own framing suggests the approach works best when there is enough behavioral signal to infer a meaningful in-the-moment objective and when users can tolerate a short wait. It is less clear how well it transfers to tasks with weak behavioral traces, highly dynamic goals, or contexts requiring immediate responses.

Position in field

This sits at the intersection of LLM interaction design, personalization, and adaptive tool generation. Its contribution is less about a new model and more about a new way to represent and operationalize user intent as a transient objective that can specialize downstream AI behavior.

Abstract

Large language models promise a broad set of functions, but when not given a specific objective, they default to generic results. We demonstrate that inferring the user's in-the-moment objective, then rapidly optimizing for that singular objective, enables LLMs to produce specialized tools, interfaces, and responses. Our work introduces just-in-time objectives, which model a user's goals to specialize LLM systems on the fly. We contribute an architecture for automatically inducing such objectives by passively observing user behavior, then steering downstream AI systems through generation and evaluation against this objective. Inducing just-in-time objectives (e.g., “Clarify the abstract’s research contribution”) enables automatic generation of tools, e.g., those that critique a draft based on relevant HCI methodologies, anticipate related researchers' reactions, or surface ambiguous terminology. In a series of experiments on participants' own tasks, JIT objectives enable LLM outputs that achieve 66–86% win rates over typical LLMs. In-person use sessions confirm that JIT objectives produce specialized tools that are unique to each participant and are rated as significantly higher quality than a standard LLM chat tool.