ViLBias: Detecting and Reasoning about Bias in Multimodal Contentcs.AI updates on arXiv.org

_ October 2, 2025_ Tech Jacks Solutions_ 0 Comments

arXiv:2412.17052v4 Announce Type: replace
Abstract: Detecting bias in multimodal news requires models that reason over text–image pairs, not just classify text. In response, we present ViLBias, a VQA-style benchmark and framework for detecting and reasoning about bias in multimodal news. The dataset comprises 40,945 text–image pairs from diverse outlets, each annotated with a bias label and concise rationale using a two-stage LLM-as-annotator pipeline with hierarchical majority voting and human-in-the-loop validation. We evaluate Small Language Models (SLMs), Large Language Models (LLMs), and Vision–Language Models (VLMs) across closed-ended classification and open-ended reasoning (oVQA), and compare parameter-efficient tuning strategies. Results show that incorporating images alongside text improves detection accuracy by 3–5%, and that LLMs/VLMs better capture subtle framing and text–image inconsistencies than SLMs. Parameter-efficient methods (LoRA/QLoRA/Adapters) recover 97–99% of full fine-tuning performance with $<5%$ trainable parameters. For oVQA, reasoning accuracy spans 52–79% and faithfulness 68–89%, both improved by instruction tuning; closed accuracy correlates strongly with reasoning ($r = 0.91$). ViLBias offers a scalable benchmark and strong baselines for multimodal bias detection and rationale quality. Read More

Author

Gallery

Contacts

ViLBias: Detecting and Reasoning about Bias in Multimodal Contentcs.AI updates on arXiv.org

Tech Jacks Solutions

Leave a comment Cancel reply

Our Address

Our Mailbox

Our Phone

Gallery

Contacts

ViLBias: Detecting and Reasoning about Bias in Multimodal Contentcs.AI updates on arXiv.org

Tech Jacks Solutions

What Makes a Language Look Like Itself? Towards Data Science

A Gentle Introduction to MCP Servers and Clients KDnuggets

Leave a comment Cancel reply

Our Address

Our Mailbox

Our Phone