人気の記事一覧

Are Protein Language Models Compute Optimal?

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Sakuga-42M Dataset: Scaling Up Cartoon Research

1か月前

Pretraining on the Test Set Is All You Need

1か月前

Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations

2週間前

Scaling MLPs: A Tale of Inductive Bias

2週間前

Observational Scaling Laws and the Predictability of Language Model Performance

4週間前

Scaling MLPs: A Tale of Inductive Bias

1か月前

Scaling Laws for Transfer

1か月前

Grandmaster-Level Chess Without Search

1か月前

The Quantization Model of Neural Scaling

1か月前