人気の記事一覧

Are Protein Language Models Compute Optimal?

1か月前

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

1か月前

Sakuga-42M Dataset: Scaling Up Cartoon Research

2か月前

Pretraining on the Test Set Is All You Need

3か月前

Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations

1か月前

Scaling MLPs: A Tale of Inductive Bias

1か月前

Observational Scaling Laws and the Predictability of Language Model Performance

2か月前

Scaling MLPs: A Tale of Inductive Bias

2か月前

Scaling Laws for Transfer

2か月前

Grandmaster-Level Chess Without Search

2か月前

The Quantization Model of Neural Scaling

2か月前