ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented LLMs
Published in EMNLP 2024, 2024
Yuxiang Zhang, Jing Chen, Junjie Wang, Yaxin Liu, Cheng Yang, Zihao Lin, et al., Hayato Yamana
EMNLP 2024
Recommended citation: Yuxiang Zhang, Jing Chen, Junjie Wang, Yaxin Liu, Cheng Yang, Zihao Lin, et al., Hayato Yamana. "ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented LLMs." EMNLP 2024.
Download Paper
