Home
News
People
Publications
Gallary
Contact
Yuexiang Xie
Latest
Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization
$\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$
Cite
×