Each participant response captures, for one clip:
| selected_emotion | enum (14 emotions + 'mixed' response mode) | One of the 14-emotion taxonomy, OR 'mixed_more_than_one' — which is a response mode (not an emotion) meaning the reader sees more than one at once |
| secondary_emotions | multi-enum | Required when primary = mixed_more_than_one, ≥ 2 codes. These are the actual emotions the reader sees concurrently |
| selected_cues | multi-enum (9) | Which signals the reader used; multi-select |
| confidence_rating | 1–5 Likert | Self-reported confidence in the reading |
| free_text_reasoning | text ≤ 280 chars | Optional first-person reasoning |
Each session captures, per participant per collection, immutable demographic snapshots: age, country (ISO-2), primary spoken language, English confidence (1–5), gender, the country the participant grew up in, and self-rated emotion-reading difficulty (1–4).
Each clip carries curator-assigned metadata — social_complexity (simple/moderate/complex), verbal_dependency (low/medium/high), and a target person description identifying whose emotion is being read.
Each clip optionally has one approved AI annotation (Claude Sonnet 4.6 in current configuration) with the AI's primary emotion, secondary emotions, cue selections, public-facing rationale, and ambiguity caveat.
14-emotion taxonomy
- happy_amused
- proud
- moved
- affectionate
- surprised
- disappointed
- sad
- angry_frustrated
- contempt
- anxious_nervous
- scared
- embarrassed_awkward
- confused
- neutral
9-cue taxonomy
- facial_expression
- tone_of_voice
- verbal_content
- body_language
- situation_context
- timing_pacing
- others_reaction
- something_else
- not_sure