Probing for Knowledge Attribution in Large Language Models AI updates on arXiv.org

_ February 28, 2026_ Tech Jacks Solutions_ 0 Comments

arXiv:2602.22787v1 Announce Type: cross
Abstract: Large language models (LLMs) often generate fluent but unfounded claims, or hallucinations, which fall into two types: (i) faithfulness violations – misusing user context – and (ii) factuality violations – errors from internal knowledge. Proper mitigation depends on knowing whether a model’s answer is based on the prompt or its internal weights. This work focuses on the problem of contributive attribution: identifying the dominant knowledge source behind each output. We show that a probe, a simple linear classifier trained on model hidden representations, can reliably predict contributive attribution. For its training, we introduce AttriWiki, a self-supervised data pipeline that prompts models to recall withheld entities from memory or read them from context, generating labelled examples automatically. Probes trained on AttriWiki data reveal a strong attribution signal, achieving up to 0.96 Macro-F1 on Llama-3.1-8B, Mistral-7B, and Qwen-7B, transferring to out-of-domain benchmarks (SQuAD, WebQuestions) with 0.94-0.99 Macro-F1 without retraining. Attribution mismatches raise error rates by up to 70%, demonstrating a direct link between knowledge source confusion and unfaithful answers. Yet, models may still respond incorrectly even when attribution is correct, highlighting the need for broader detection frameworks. Read More

Author

Gallery

Contacts

Probing for Knowledge Attribution in Large Language Models AI updates on arXiv.org

Tech Jacks Solutions

Leave a comment Cancel reply

Our Address

Our Mailbox

Our Phone

Gallery

Contacts

Probing for Knowledge Attribution in Large Language Models AI updates on arXiv.org

Tech Jacks Solutions

Cost-of-Pass: An Economic Framework for Evaluating Language Models AI updates on arXiv.org

Thousands of Public Google Cloud API Keys Exposed with Gemini Access After API Enablement The Hacker Newsinfo@thehackernews.com (The Hacker News)

Leave a comment Cancel reply

Our Address

Our Mailbox

Our Phone