人気の記事一覧

SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling

3か月前

sDPO: Don't Use Your Data All at Once

4か月前