Home
News
People
Publications
Gallary
Contact
Yichang Zhang
Latest
HellaSwag-Pro: A Large-Scale Bilingual Benchmark for Evaluating the Robustness of LLMs in Commonsense Reasoning
Cite
×