Detail publikačního výsledku

Assessing Web Search Credibility and Response Groundedness in Chat Assistants

VYKOPAL, I.; PIKULIAK, M.; OSTERMANN, S.; ŠIMKO, M.

Originální název

Assessing Web Search Credibility and Response Groundedness in Chat Assistants

Anglický název

Assessing Web Search Credibility and Response Groundedness in Chat Assistants

Druh

Stať ve sborníku mimo WoS a Scopus

Originální abstrakt

Chat assistants increasingly integrate web search functionality, enabling them to retrieve and cite external sources. While this promises more reliable answers, it also raises the risk of amplifying misinformation from low-credibility sources. In this paper, we introduce a novel methodology for evaluating assistants’ web search behavior, focusing on source credibility and the groundedness of responses with respect to cited sources. Using 100 claims across five misinformation-prone topics, we assess GPT-4o, GPT-5, Perplexity, and Qwen Chat. Our findings reveal differences between the assistants, with Perplexity achieving the highest source credibility, whereas GPT-4o exhibits elevated citation of non-credible sources on sensitive topics. This work provides the first systematic comparison of commonly used chat assistants for fact-checking behavior, offering a foundation for evaluating AI systems in high-stakes information environments.

Anglický abstrakt

Chat assistants increasingly integrate web search functionality, enabling them to retrieve and cite external sources. While this promises more reliable answers, it also raises the risk of amplifying misinformation from low-credibility sources. In this paper, we introduce a novel methodology for evaluating assistants’ web search behavior, focusing on source credibility and the groundedness of responses with respect to cited sources. Using 100 claims across five misinformation-prone topics, we assess GPT-4o, GPT-5, Perplexity, and Qwen Chat. Our findings reveal differences between the assistants, with Perplexity achieving the highest source credibility, whereas GPT-4o exhibits elevated citation of non-credible sources on sensitive topics. This work provides the first systematic comparison of commonly used chat assistants for fact-checking behavior, offering a foundation for evaluating AI systems in high-stakes information environments.

Klíčová slova

misinformation analysis, fact-checking, web-search, retrieval, grounded response, LLM response analysis, factuality

Klíčová slova v angličtině

misinformation analysis, fact-checking, web-search, retrieval, grounded response, LLM response analysis, factuality

Autoři

VYKOPAL, I.; PIKULIAK, M.; OSTERMANN, S.; ŠIMKO, M.

Vydáno

01.01.2026

Nakladatel

Association for Computational Linguistics

Místo

Stroudsburg, PA, USA

ISBN

979-8-89176-380-7

Kniha

Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)

Strany od

2539

Strany do

2560

Strany počet

22

URL

BibTex

@inproceedings{BUT201837,
  author="Ivan {Vykopal} and  {} and  {} and Marián {Šimko}",
  title="Assessing Web Search Credibility and Response Groundedness in Chat Assistants",
  booktitle="Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)",
  year="2026",
  pages="2539--2560",
  publisher="Association for Computational Linguistics",
  address="Stroudsburg, PA, USA",
  doi="10.18653/v1/2026.eacl-long.115",
  isbn="979-8-89176-380-7",
  url="https://aclanthology.org/2026.eacl-long.115/"
}