AI & ML interests

None defined yet.

Recent Activity

trl-lib 's collections 7

Comparing DPO with IPO and KTO
A collection of chat models to explore the differences between three alignment techniques: DPO, IPO, and KTO.