Home
News
People
Publications
Gallary
Contact
Rui Men
Latest
HellaSwag-Pro: A Large-Scale Bilingual Benchmark for Evaluating the Robustness of LLMs in Commonsense Reasoning
Cite
×