User Tools

Site Tools


trans_learn:reading_group_2024_spring

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
trans_learn:reading_group_2024_spring [2024/06/19 05:25] haohuawangtrans_learn:reading_group_2024_spring [2024/07/17 05:08] (current) chekaki
Line 45: Line 45:
   * Presenter: Haohua Wang   * Presenter: Haohua Wang
   * Paper: Direct Preference Optimization: Your Language Model is Secretly a Reward Model   * Paper: Direct Preference Optimization: Your Language Model is Secretly a Reward Model
-  * Slides:{{ :trans_learn:20240619-DPO-slides.pdf |}}+  * Slides:{{ :|}}
  
  
trans_learn/reading_group_2024_spring.1718789113.txt.gz · Last modified: 2024/06/19 05:25 by haohuawang