12の機械学習について知るべき役に立つこと

2020年6月24日 12:57

12 Useful Things to Know about Machine Learning
12の機械学習について知るべき役に立つこと
Machine learning algorithms can figure out how to perform important tasks by generalizing from examples. This is often feasible and cost-effective where manual programming is not. As more data becomes available, more ambitious problems can be tackled. As a result, machine learning is widely used in computer sincere and other fields. However, developing successful machine learning applications requires a substantial amount of “black art” that is hard to find in textbooks.
機械学習アルゴリズムは、例から一般化することにより、重要なタスクを実行する方法を把握することができます。これは手作業のプログラミングがそうでないところで、しばしば実現可能で対費用効果的です。多くのデータが利用可能になるので、より野心的な問題に取り組むことができます。結果として、機械学習は、コンピューターはもちろん、その他のフィールドで広く利用されます。ただし、成功した機械学習アプリケーションの開発は教科書で見つけるのは難しい「黒魔術」の相当な量が必要です。

I recently read an amazing technical paper by Professor Pedro Domingos of University of Washington titled “A Few Useful Things to Know about Machine Learning.” It summarizes 12 key lessons that machine learning researchers and practitioners have learned include pitfalls to avoid, important issues to focus on, and answers to common questions. I’d like to share these lessons in this article because they are extremely useful when thinking about tackling your next machine learning problems.
私は最近、ワシントン大学のペドロドミンゴス博士著の驚くべき技術論文を読みました。そのタイトルは「機械学習について知るべき２－３の役に立つこと」です。「機械学習研究者と専門家が学んできた12 の重要な教訓は、避けるべき落とし穴や、集中すべき重要課題とよく寄せられる質問への回答を含んでいます。あなたの次の機械学習問題への取り組みについて考えるとき、それらは非常に役立つので、この記事でこれらの教訓を共有したいと思います。

1. Learning = Representation + Evaluation + Optimization
学習 = 表現 + 評価 + 最適化
All machine learning algorithms generally consist of combinations of just 3 components:
すべての機械学習アルゴリズムは一般的に3 つの構成要素だけの組み合わせで構成されます。
 Representation: A classifier must be represented in some formal language that the computer can handle. Conversely, choosing a representation for a learner is tantamount to choosing the set of classifiers that it can possibly learn. This set is called the hypothesis space of the learner. If a classifier is not in the hypothesis space, it cannot be learned. A related question is how to represent the input, i.e., what features to use.
表現:分類子は、コンピューターが処理できるいくつかの正式な言語で表現されなければなりません。逆に、学習者のための表現の選択はそれがおそらく学ぶことができる識別子のセットを選択するに等しいです。このセットは、学習者の仮説空間と呼ばれます。分類子が仮説空間にない場合は、学習することはできません。関連質問はその入力の表現方法、すなわち、どのような機能を使用するかです。
 Evaluation: An evaluation function is needed to distinguish good classifiers from bad ones. The evaluation function used internally by the algorithm may differ from the external one that we want the classifier to optimize, for ease of optimization and due to the issues discussed in the next section.
評価: 評価関数は、悪い分類子から良いものを区別するために必要です。アルゴリズムによって内部的に使用する評価関数は、最適化を容易にするためやは次のセクションで議論される問題のため、最適化すべき分類子を我々が望む 1 つの外部の評価関数と異なるかもしれません。
 Optimization: Finally, we need a method to search among the classifiers in the language for the highest-scoring one. The choice of optimization technique is key to the efficiency of the learner, and also helps determine the classifier produced if the evaluation function has more than one optimum. It is common for new learners to start out using off-the-shelf optimizers, which are later replaced by custom-designed ones.
最適化: 最後に、我々は最高得点のものの言語での分類子間を検索する方法が必要です。最適化手法の選択は、学習者の効率化への鍵であり、もし評価関数が1 つ以上の最適を持つなら生成される分類子を決めるたすけにもなります。新しい学習者が市販のオプティマイザーを使用し始められるのが一般的です。そのことは後でカスタム設計されたものに置き換えられます。

ここから先は

28,727字 / 1ファイル

¥ 100

期間限定 PayPay支払いすると抽選でお得に！

ログイン

この記事が気に入ったらサポートをしてみませんか？