人気の記事一覧

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

1か月前

Better & Faster Large Language Models via Multi-token Prediction

2か月前