https://github.com/andyed/approach-retreat

Cursor approach-retreat dynamics on search result pages. SERP-specific companion to ClickSense.
https://github.com/andyed/approach-retreat
cognitive-modeling information-retrieval mouse-tracking relevance-f
Last synced: about 2 months ago
JSON representation
Cursor approach-retreat dynamics on search result pages. SERP-specific companion to ClickSense.
Host: GitHub
URL: https://github.com/andyed/approach-retreat
Owner: andyed
License: mit
Created: 2026-04-08T03:15:46.000Z (4 months ago)
Default Branch: main
Last Pushed: 2026-05-02T03:34:20.000Z (3 months ago)
Last Synced: 2026-05-02T05:29:50.217Z (3 months ago)
Topics: cognitive-modeling, information-retrieval, mouse-tracking, relevance-f
Language: HTML
Homepage: https://andyed.github.io/approach-retreat/
Size: 79.9 MB
Stars: 0
Watchers: 0
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE
Awesome Lists containing this project

README

          # Approach/Retreat

Tells your SERP what users *did* with each result — beyond clicks, without

an eye tracker. Drop-in instrumentation for ranked-list pages: search

result pages, recommendation feeds, comparison tables, product grids.

Two channels:

1. **Approach-retreat episodes** *(desktop, cursor)* — per-result enter /

   dwell / exit behaviour, classified into a four-class taxonomy:

   **clicked**, **deferred** (considered, returned to, eventually skipped),

   **evaluated-rejected** (approached, decided against), and

   **not-approached**. These map directly onto the (0/1/2) graded-relevance

   vocabulary that learning-to-rank consumes natively. Also produces the

   seven-feature M4-7 vector used by the click-prediction and deferred-class

   classifiers.

2. **Viewport dynamics** *(any device, scroll-only)* — per-AOI residence,

   MRC/IAB viewability, and scroll kinematics. No cursor required; works

   wherever `scroll` events + DOM bounding boxes are available, including

   mobile and feed surfaces.

Sister library to [ClickSense](https://github.com/andyed/clicksense), which

captures the per-click commitment moment. Approach-retreat captures the

evaluation phase that precedes it. For the research backing — task model,

four-class taxonomy derivation, validation against AdSERP and ACD — see

[`docs/research.md`](docs/research.md).

## See it in action

Three AdSERP trials replayed against the original screenshots. The labels

(CLK / DEF / REJ / NA) were inferred from cursor episodes alone — no gaze

data at inference time. Boxes are AOIs from the dataset; labels are this

library's output.


_{Canonical rejected
DEF 9 · REJ 4}


_{Multi-AOI drama
CLK 1 · DEF 9 · REJ 1 · NA 4}


_{Canonical deferred
DEF 11 · REJ 1}

Backgrounds are raw AdSERP screenshots — what the participant saw, pixel

for pixel. Full replay index:

[andyed.github.io/approach-retreat/replay/](https://andyed.github.io/approach-retreat/replay/) —

86 curated trials.

The companion viewer at

[andyed.github.io/attentional-foraging/](https://andyed.github.io/attentional-foraging/)

renders the same trials through a foveated-perception simulator (showing

what the participant could *resolve* at each fixation). Different view of

the same data.

## Run it yourself

Two live deployments of the library you can poke at right now. Same code,

different surfaces — clone the repo, fork the demos, or just open a URL

and press `d` to watch the debug overlay light up as you move your cursor.

**[andyed.github.io/approach-retreat/](https://andyed.github.io/approach-retreat/)**

— 5 SERP layouts × 4 Q&A queries, 20 bookmarkable combinations. Pick a

layout, pick a question, browse the answers like a search engine. Press

`d` on any SERP for the in-page debug overlay showing live episode

classification per result. Same `ar_episode` / `ar_click` /

`ar_session_summary` schema across all 20 — the layout is the variable,

the instrumentation is the constant.

**[movies.mindbendingpixels.com](https://movies.mindbendingpixels.com)** —

the same library running in a different domain (a film-recommendation

mission flow) against the same PostHog event contract. Portability

proof: v0.2.1 and v0.3.0 of this library shipped data-correctness fixes

discovered on that deployment (see [CHANGELOG](CHANGELOG.md)).

Telemetry is live on both. Append `?ph=0` to any URL to opt out of

capture. All site source is in [`site/`](site/) — fork it, drop in your

own results, ship your own deployment.

---

## Install

```bash

npm install andyed/approach-retreat

```

Or via script tag:

```html

```

## Quick start

```js

import { ApproachRetreat } from 'approach-retreat';

const ar = new ApproachRetreat({

  resultSelector: '[data-result]',

  onEpisode: (episode) => console.log(episode),

  onClick: ({ position, episode }) => {

    console.log(`Clicked position ${position} after ${episode.dwell_ms}ms`);

  },

});

```

Mark your results:

```html



  Result title

  Snippet text...



```

### Tag the surface type with `data-etype` (recommended)

If your SERP mixes ads and organics in the result column, tag each so your

dashboard can slice ad-vs-organic behaviour:

```html

...

...

...

```

Conventional values: `organic` (first-class result), `dd_top` (top-of-page

ad carousel cell), `native_ad` (inline-text ad). Any `data-*` attribute is

passed through to PostHog as `target_data_`. The library is

etype-agnostic at the machinery level — what you give up by going

untagged is dashboard-level slicing, not capture.

## What the library emits

### Episode (cursor channel)

Every completed visit to a result emits a 23-field episode from

`Episode.toJSON()` (19 cursor fields + 4 banded-viewport fields, the

latter null when `trackViewportBands: false`).

```js

{

  // --- Identity + outcome ---

  position: 2,

  outcome: 'deferred',          // clicked | deferred | evaluated_rejected | not_approached

  visited: true,

  clicked: false,

  retreated: true,

  visit_number: 2,              // 1 = first visit, 2+ = re-approach

  // --- Timing (ms, performance.now() base) ---

  dwell_ms: 847,

  entered_at: 1412.38,

  exited_at: 2259.77,

  clicked_at: null,

  // --- Cursor dynamics ---

  approach_velocity: 0.34,      // px/ms at entry

  approach_angle: 1.21,         // radians, atan2(vy, vx) at entry

  peak_velocity: 0.89,

  min_velocity: 0.02,

  retreat_distance: 186,        // px from AOI center at max retreat

  sample_count: 51,

  // --- Scroll context ---

  direction: 'forward',         // forward | regressive

  entry_scroll: 420,

  hwm_at_entry: 420,

}

```

#### Raw trajectory (opt-in)

Set `includeSamplesInEpisodeJson: true` to add a `samples` array (one

`{x,y,t,vx,vy}` per native mousemove sample, ~60 Hz). Research-grade

material — keep it local unless you're shipping it through the PostHog

adapter.

### Viewport analytics (cursor-free channel)

One record per AOI per session, computed from scroll events plus DOM

bounding boxes alone. Runs anywhere `scroll` is logged.

```js

ar.getViewportAnalytics();

// [{

//   position: 0,

//   // Impression (MRC/IAB)

//   iab_viewable: true,            // ≥ 50% pixels visible for ≥ 1s continuous

//   ms_at_50pct_or_more: 2400,

//   // Residence (continuous)

//   vt_any_ms: 6200,

//   vt_center_ms: 1800,

//   avg_viewport_y_px: 340,

//   max_overlap_frac: 1.0,

//   // Kinematics (scroll trajectory while visible)

//   min_abs_velocity_px_per_s: 0,

//   n_reversals: 2,

// }, ...]

```

| Tier | Field | Meaning |

|---|---|---|

| Impression (MRC/IAB) | `iab_viewable` | True iff ≥ 50% pixel overlap held for ≥ 1 continuous second. Display rule. |

| Impression | `ms_at_50pct_or_more` | Cumulative ms at ≥ 50% overlap, no continuity constraint. |

| Residence | `vt_any_ms` | Cumulative ms with any viewport overlap. "Did the user ever see it?" |

| Residence | `vt_center_ms` | Cumulative ms with AOI center within ±100 px of viewport center. |

| Residence | `avg_viewport_y_px` | Mean AOI-center viewport-y during visibility. |

| Impression / peak | `max_overlap_frac` | Peak fraction visible. 1.0 = fully in view at some point. |

| Kinematics | `min_abs_velocity_px_per_s` | Slowest scroll speed while AOI was visible. Stabilization marker. |

| Kinematics | `n_reversals` | Scroll-direction reversals while AOI was visible. EWM-reload signal. |

**Banded decomposition** (`vp_top_ms` / `vp_mid_ms` / `vp_bot_ms`) is also

available via `ar.getViewportBands()` — retained for dashboard heatmaps;

adds no detectable AUC on top of the continuous six (see research index

for sourcing).

**Config:** `trackViewportAnalytics` (default `true`),

`viewportCenterTolPx` (default 100), `iabViewableThresholdMs` (default

1000 for the MRC display rule; set 2000 for video).

### Library-side classification + signals

```js

ar.classify();

// { clicked: [{position, ...}], deferred: [...],

//   evaluated_rejected: [...], not_approached: [...] }

ar.getSignals();

// [{ position, outcome, total_dwell_ms, mean_retreat_distance, ... }, ...]

ar.getEpisodes();   // full list, one entry per finalized visit

ar.flush();         // finalize in-flight episodes without clearing history

```

### Canonical seven-feature M4-7 vector

`ar.getApproachFeatures()` emits the canonical feature vector consumed by

the click-prediction (M3) and deferred-class (M5) classifiers. One vector

per result position per session. The companion paper (submitted)

documents the click-buffer leakage screen that distinguishes the seven

buffer-robust features from `final_dist` and `retreat_dist` — see

[`docs/research.md`](docs/research.md) for the deployment caveat.







*Every feature is a geometric property of one deliberation episode's

cursor-to-AOI distance `d(t)` — no gaze, no click. **(a) Commitment**,

proximity: `min_dist`, `mean_dist`, `dwell_in_proximity_ms`.

**(b) Decisiveness**, approach rate: `mean_approach_velocity`,

`max_approach_velocity`. **(c) Vacillation**, monotonicity:

`direction_changes`, `frac_decreasing`. Synthetic trace.*

```js

ar.getApproachFeatures();

// [

//   { position: 0,

//     min_dist: 2.0,

//     mean_dist: 143.15,

//     dwell_in_proximity_ms: 466,

//     mean_approach_velocity: 238,

//     max_approach_velocity: 966,

//     direction_changes: 11,

//     frac_decreasing: 0.62,

//     // Caveat fields — see research index

//     final_dist: 200.0,

//     retreat_dist: 198.0,

//     sample_count: 64 },

//   { position: 1, ... },

// ]

```

### Sampling rate (`maxSampleHz`)

The cursor feature path runs on `mousemove`, which fires at ~60 Hz (more

on high-refresh displays). Each event accumulates the approach features

and does an over-result hit-test that forces one synchronous layout read

per result — ~60×/s of forced layout for as long as the page is open.

`maxSampleHz` caps that. It defaults to **15** — a fixed-rate throttle

that keeps at most one `mousemove` per `1000 / maxSampleHz` ms (~66.7 ms

at 15 Hz) and drops the rest before any state is touched. This is

*uniform time-decimation*: a kept event is processed exactly as at full

rate, velocity and Δt are simply measured over the wider gap.

```js

const ar = new ApproachRetreat({

  maxSampleHz: 15,   // default — cap the mousemove feature path at 15 Hz

});

// Disable the throttle (process every native event) for research /

// replication against a native-rate-trained model:

const arFullRate = new ApproachRetreat({ maxSampleHz: 0 }); // or Infinity

```

The §5.1 cursor sampling-rate ablation downsampled the AdSERP cursor

stream from ~59 Hz to 1 Hz and re-ran the M4 LOSO click-prediction: AUC

held flat at 0.847 ± 0.001 across the whole range. The seven approach

features are per-episode aggregates and rate-invariant by construction,

so 15 Hz carries large accuracy headroom while cutting the layout cost

~4×. `click` is a separate listener and is **never** throttled.

## Sending events to your analytics

### PostHog adapter (bundled)

Three event types, all `ar_`-prefixed:

| Event | Fires on | Key fields |

|---|---|---|

| `ar_episode` | every finalized episode | the 19 cursor fields + optional `ar_trajectory` (10% sample rate by default) |

| `ar_click` | every click on a result | pre-click velocity, angle, direction, retreat distance, dwell |

| `ar_session_summary` | `visibilitychange` / `pagehide` | four-class counts, positions per class, time-to-first-click |

Every event is merged with session context: `ar_session_id`, `ar_layout`,

`ar_query_id`, viewport (`w`, `h`, `dpr`), UA, referrer, page path.

**Kill switch.** Append `?ph=0` to any URL to skip PostHog entirely.

`ar_click` carries the ClickSense v0.2 target vocabulary —

`target_tag` / `target_id` / `target_label` / `target_href` /

`target_text` / `target_aria_label` / `target_title` / `target_name` /

`target_path` / `target_data_` — so you can JOIN

`click_confidence ↔ ar_click` on `target_href` or `target_name` when both

libraries run on the same page.

### Other adapters

- `approach-retreat/adapters/posthog` — PostHog event flattening.

- `approach-retreat/adapters/callback` — buffer + flush (`sendBeacon`,

  custom transport).

## Composing with ClickSense

Both libraries run on the same page without conflict. ClickSense

captures the commitment moment (per-click confidence); approach-retreat

captures the evaluation phase that precedes it.

```js

import { ClickSense } from 'clicksense';

import { ApproachRetreat } from 'approach-retreat';

const cs = new ClickSense({ enableApproachDynamics: true, onCapture: ... });

const ar = new ApproachRetreat({ resultSelector: '[data-result]', onEpisode: ... });

```

## Relevance scoring

```js

const scores = ar.computeRelevance();

// [{ position: 0, score: 0.72, signals: {...} }, ...]

```

Default weights: dwell time (40%), re-approaches (30%), clicks (30%),

small penalty for repeated retreats. The four-class taxonomy maps cleanly

onto the (0/1/2) graded-relevance vocabulary that learning-to-rank

consumes natively (clicked = 2, deferred = 1, evaluated-rejected = 0;

not-approached excluded as no-evidence).

## Privacy

The library captures cursor + scroll events that the page's own

JavaScript already has access to — no new permissions. Raw trajectory

samples are opt-in (`includeSamplesInEpisodeJson: true`); without that

flag, only aggregate-per-episode statistics leave the browser.

For deployment-grade privacy posture (consent, retention, opt-out):

follow your existing PostHog (or other analytics) configuration. The

library does not introduce a new data plane; it adds events to the one

you already operate.

For a published treatment of the same telemetry primitives' privacy

implications, see Leiva, Arapakis & Iordanou. "My Mouse, My Rules"

(CHIIR 2021).

---

## For researchers

The library is the runnable form of the cognitive task model in the

companion paper (submitted). The full research index — task model

derivation, four-class taxonomy validation, click-buffer leakage screen,

LAB / WILD numbers with provenance, foundation-model rebuttal, and the

Leiva/Arapakis lineage — lives at

**[`docs/research.md`](docs/research.md)**.

Ancillary docs:

- [`docs/theory.md`](docs/theory.md) — concise theoretical writeup.

- [`docs/one-pager.md`](docs/one-pager.md) — task model vs 638-feature bag,

  four-class taxonomy, retreat geometry as deliberation indicator.

- [`docs/positioning.md`](docs/positioning.md) — four-lane map of related

  work.

- [`docs/bbox-attribution-lineage.md`](docs/bbox-attribution-lineage.md) —

  sequence of AOI extraction flavors (band → bbox-organic → typed →

  typed_gapfill → cellsplit), which is in use, and how to read flavor

  tags in cited numbers. **Read before citing any AUC from this repo.**

- [`docs/history.md`](docs/history.md) — Lucidity 2001 → Optimoz 2001 →

  Uzilla 2003 → ClickSense 2026 → approach-retreat 2026 lineage with

  Slashdot front-page screenshot.

- [`docs/validation/attcur-bruckner.md`](docs/validation/attcur-bruckner.md) —

  public head-to-head against Brückner, Arapakis & Leiva (SIGIR 2021).

  Approach-retreat features beat the scalar mouse-length baseline by

  +12.5 AUC (0.821 vs 0.696) on their own ad-click-prediction benchmark.

- [`docs/validation/m5-calibration.md`](docs/validation/m5-calibration.md) —

  end-to-end calibration methodology for the deferred-class detector.

- [`docs/validation/viewport-bands-calibration.md`](docs/validation/viewport-bands-calibration.md) —

  bootstrap protocol for the retreat + bands AUC.

- [`docs/validation/feature-ablation-cross-stage.md`](docs/validation/feature-ablation-cross-stage.md) —

  full LOFO + group ablation matrix across the paper's four modeling

  stages (click classifier, deferred classifier, three LambdaMART

  rankers). The cross-stage view the paper §4.1/§4.3/§4.6 paragraphs

  imply but couldn't fit in page budget.

> **AllSERP companion paper.** *AllSERP: Exhaustive Per-Element Enrichment

> of the Versatile AdSERP Dataset* — [arXiv:2605.04949](https://arxiv.org/abs/2605.04949)

> (2026). Documents the typed AOI extraction used here for AOI labels in

> the replay viewer. Local PDF: [`allserp-paper.pdf`](./allserp-paper.pdf).

## License

MIT
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/andyed/approach-retreat

Awesome Lists containing this project

README

Result title