Home
News
People
Publications
Gallary
Contact
Yuexiang Xie
Latest
Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization
β
-DPO: Direct Preference Optimization with Dynamic
β
Cite
×