Search

Home
News
People
Publications
Gallary
Contact

Yuexiang Xie

Latest

Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization
$\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$

© 2026 USTC LDS All rights reserved

Published with Wowchemy — the free, open source website builder that empowers creators.

Cite