ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented LLMs

Published in EMNLP 2024, 2024

Yuxiang Zhang, Jing Chen, Junjie Wang, Yaxin Liu, Cheng Yang, Zihao Lin, et al., Hayato Yamana

EMNLP 2024

Recommended citation: Yuxiang Zhang, Jing Chen, Junjie Wang, Yaxin Liu, Cheng Yang, Zihao Lin, et al., Hayato Yamana. "ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented LLMs." EMNLP 2024.
Download Paper