SEER: The Span-based Emotion Evidence Retrieval Benchmarkcs. AI updates on arXiv.org

_ October 7, 2025_ Tech Jacks Solutions_ 0 Comments

arXiv:2510.03490v1 Announce Type: cross
Abstract: We introduce the SEER (Span-based Emotion Evidence Retrieval) Benchmark to test Large Language Models’ (LLMs) ability to identify the specific spans of text that express emotion. Unlike traditional emotion recognition tasks that assign a single label to an entire sentence, SEER targets the underexplored task of emotion evidence detection: pinpointing which exact phrases convey emotion. This span-level approach is crucial for applications like empathetic dialogue and clinical support, which need to know how emotion is expressed, not just what the emotion is. SEER includes two tasks: identifying emotion evidence within a single sentence, and identifying evidence across a short passage of five consecutive sentences. It contains new annotations for both emotion and emotion evidence on 1200 real-world sentences. We evaluate 14 open-source LLMs and find that, while some models approach average human performance on single-sentence inputs, their accuracy degrades in longer passages. Our error analysis reveals key failure modes, including overreliance on emotion keywords and false positives in neutral text. Read More

Author

Gallery

Contacts

SEER: The Span-based Emotion Evidence Retrieval Benchmarkcs. AI updates on arXiv.org

Tech Jacks Solutions

Leave a comment Cancel reply

Our Address

Our Mailbox

Our Phone

Gallery

Contacts

SEER: The Span-based Emotion Evidence Retrieval Benchmarkcs. AI updates on arXiv.org

Tech Jacks Solutions

PolyKAN: A Polyhedral Analysis Framework for Provable and Minimal KAN Compressioncs.AI updates on arXiv.org

Operationalizing Data Minimization for Privacy-Preserving LLM Promptingcs.AI updates on arXiv.org

Leave a comment Cancel reply

Our Address

Our Mailbox

Our Phone