人気の記事一覧

Are Protein Language Models Compute Optimal?

3か月前

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

4か月前

Sakuga-42M Dataset: Scaling Up Cartoon Research

4か月前

Pretraining on the Test Set Is All You Need

5か月前

Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations

4か月前

Scaling MLPs: A Tale of Inductive Bias

4か月前

Observational Scaling Laws and the Predictability of Language Model Performance

4か月前

Scaling MLPs: A Tale of Inductive Bias

4か月前

Scaling Laws for Transfer

4か月前

Grandmaster-Level Chess Without Search

5か月前

The Quantization Model of Neural Scaling

5か月前