May 9, 2024Open Access

OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs

Key Points

Key points are not available for this paper at this time.

Abstract

The increased use of large language models (LLMs) across a variety of real-world applications calls for mechanisms to verify the factual accuracy of their outputs. Difficulties lie in assessing the factuality of free-form responses in open domains. Also, different papers use disparate evaluation benchmarks and measurements, which renders them hard to compare and hampers future progress. To mitigate these issues, we propose OpenFactCheck, a unified factuality evaluation framework for LLMs. OpenFactCheck consists of three modules: (i) CUSTCHECKER allows users to easily customize an automatic fact-checker and verify the factual correctness of documents and claims, (ii) LLMEVAL, a unified evaluation framework assesses LLM's factuality ability from various perspectives fairly, and (iii) CHECKEREVAL is an extensible solution for gauging the reliability of automatic fact-checkers' verification results using human-annotated datasets. OpenFactCheck is publicly released at https://github.com/yuxiaw/OpenFactCheck.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Wang et al. (Thu,) studied this question.

www.synapsesocial.com/papers/68e6aec4b6db643587630e26 — DOI: https://doi.org/10.48550/arxiv.2405.05583

Authors

Yuxia Wang

Minghan Wang

Hasan Iqbal

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion