← Dashboard 8-UdwBkXJBI
Ready annotator_a
Video ID: 8-UdwBkXJBI
Category: speech_dominant
Standard Vision ○
Misleading Vision ○
Standard Audio ○
Misleading Audio ○
As the prank reaches its climax when the friend in the light blue scarf tilts the cup upward to pour the white liquid, what specific reaction does the victim in the red shirt exhibit immediately?
A.He laughs nervously and wipes his face with a handkerchief.
B.He shoves the prankster away and chases him toward the bus.
C.His eyes go wide and his mouth agape in shock as he clutches his chest. ✓ Correct
D.He remains stoic, adjusting his glasses while ignoring the mess.
E.The visual detail in the question is incorrect
F.The audio detail in the question is incorrect
Answer timestamp: [20s-30s]s Modality: vision Category: temporal

Annotation

In the moment that the curly-haired man with hands on his hips steps forward to grab the cup, what immediate physical response does the victim in the red shirt display?
A.He laughs nervously and wipes his face with a handkerchief.
B.He shoves the prankster away and chases him toward the bus.
C.His eyes go wide and his mouth agape in shock as he clutches his chest.
D.He remains stoic, adjusting his glasses while ignoring the mess.
E.The visual detail in the question is incorrect ✓ Correct
F.The audio detail in the question is incorrect
Answer timestamp: [20s-30s]s Modality: vision Category: temporal
Misleading Information
Category: person_action
Description: By swapping the actor (scarf-wearer vs. curly-haired man) and the action (pouring vs. grabbing), the question tests if the model is tracking who actually performs the pivotal prank. The curly-haired man is present but passive (hands on hips, laughing), so a model relying on generic 'group interaction' patterns might incorrectly associate the action with any active-looking character.

Annotation

During the interview segment where Jerry Aldini describes how each camper will stalk and kill their own bear, what distinct sound effect abruptly cuts off the conversation?
A.A loud, synthesized 8-bit video game jingle begins to play.
B.The sound of splashing liquid dominates the air before fading.
C.A loud, synthetic, low-pitched electronic buzz silences the scene. ✓ Correct
D.The background music swells into a dramatic orchestral crescendo.
E.The visual detail in the question is incorrect
F.The audio detail in the question is incorrect
Answer timestamp: [70s-80s]s Modality: audio Category: temporal

Annotation

Following the moment the interviewer asks if the children can handle the Shakespeare in the Round program, what distinct sound effect abruptly interrupts the dialogue?
A.A loud, synthesized 8-bit video game jingle begins to play.
B.The sound of splashing liquid dominates the air before fading.
C.A loud, synthetic, low-pitched electronic buzz silences the scene.
D.The background music swells into a dramatic orchestral crescendo.
E.The visual detail in the question is incorrect
F.The audio detail in the question is incorrect ✓ Correct
Answer timestamp: [70s-80s]s Modality: audio Category: temporal
Misleading Information
Category: speech_context
Description: This swaps the specific topic of conversation (killing bears vs. Shakespeare). Both topics are mentioned by Jerry in the same interview sequence, but the specific sound effect (the buzz) occurs only after the 'bear' comment. A model that just knows 'interview gets interrupted' without listening to the specific trigger phrase will fail to distinguish between these two moments.

Annotation