画像生成AI「Stable Diffusion」のLicense要約を和訳して読む

2022年8月24日 12:06

このnoteは何か

画像生成AIの一つ「Stable Diffusion」のLicense要約のブログをDeepLで和訳したnoteです。

結論をいうと
・出力する画像は基本自由に使っていいよ、でも違法/有害なことしないでね

という私は解釈をしました。しかし、私はソフトウェアエンジニアでありますがAIに関する専門家ではありません。その前提で、一つの参考として読んでいただけると嬉しいです。

Stable Diffusion

Stable Diffusion is a text-to-image latent diffusion model created by the researchers and engineers from CompVis, Stability AI and LAION. It is trained on 512x512 images from a subset of the LAION-5B database. LAION-5B is the largest, freely accessible multi-modal dataset that currently exists.

In this post, we want to show how to use Stable Diffusion with the 🧨 Diffusers library, explain how the model works and finally dive a bit deeper into how diffusers allows one to customize the image generation pipeline.

Note: It is highly recommended to have a basic understanding of how diffusion models work. If diffusion models are completely new to you, we recommend reading one of the following blog posts:

The Annotated Diffusion Model

Getting started with 🧨 Diffusers

Now, let's get started by generating some images

https://huggingface.co/blog/stable_diffusion

Stable Diffusionは、CompVis、Stability AI、LAIONの研究者と技術者によって作られたテキストから画像への潜伏拡散モデルです。LAION-5Bデータベースのサブセットから512x512の画像で学習されます。LAION-5Bは、現在存在する最大かつ自由にアクセス可能なマルチモーダルデータセットです。

この投稿では、DiffusersライブラリでStable Diffusionを使用する方法を示し、モデルがどのように動作するかを説明し、最後にDiffusersで画像生成パイプラインをカスタマイズする方法を少し深く掘り下げたいと思います。

注意：拡散モデルがどのように機能するかについて、基本的な理解を持っていることを強く推奨します。もし、拡散モデルが全く初めてであれば、以下のブログ記事のいずれかを読むことをお勧めします。

・アノテーションされた拡散モデル
・ディフューザーを使い始める
では、さっそく画像を生成してみましょう

DeepL翻訳

ここまではStable Diffusionの説明です。以下がLicenseの文章になります。

License

Before using the model, you need to accept the model license in order to download and use the weights.

The license is designed to mitigate the potential harmful effects of such a powerful machine learning system. We request users to read the license entirely and carefully. Here we offer a summary:

You can't use the model to deliberately produce nor share illegal or harmful outputs or content,

We claim no rights on the outputs you generate, you are free to use them and are accountable for their use which should not go against the provisions set in the license, and

You may re-distribute the weights and use the model commercially and/or as a service. If you do, please be aware you have to include the same use restrictions as the ones in the license and share a copy of the CreativeML OpenRAIL-M to all your users.

Note: we are concurrently developing an additional blog post focusing on the model card and license of this system.

https://huggingface.co/blog/stable_diffusion

モデルを使用する前に、ウェイトをダウンロードして使用するために、モデルのライセンスに同意する必要があります。

このライセンスは、このような強力な機械学習システムの潜在的な弊害を軽減するために設計されています。我々は、ユーザーがライセンスを完全にかつ慎重に読むことを要求します。ここでは、その要約を提供します。

1. あなたは、違法または有害な出力やコンテンツを意図的に生成したり、共有するためにモデルを使用することはできません。

2. 私たちは、あなたが生成した出力に関する権利を主張しません。あなたはそれらを自由に使用することができ、ライセンスで設定された規定に反してはならないその使用について説明責任を負います。

3. あなたは、ウェイトを再配布し、モデルを商業的および/またはサービスとして使用することができます。その場合、ライセンスにあるものと同じ使用制限を含め、CreativeML OpenRAIL-Mのコピーをあなたのすべてのユーザーに共有しなければならないことに注意してください。

注：このシステムのモデルカードとライセンスに焦点を当てた追加のブログポストを同時に開発中です。

DeepL翻訳

私の解釈

これを読んだ私の解釈としては

・このページはLicenseの原文の要約
・出力する画像は基本自由に使っていいよ、でも違法/有害なことしないでね
・学習モデルの再配布や、商用利用も可能。その場合、同じライセンスを使う & CreativeML OpenRAIL-Mのコピーを共有する

という解釈です。補足としては

・「the weights =(ウェイト)」は入力値から評価値を計算する過程の係数のことを指しているという理解です。雑な表現だと「学習済モデル」でしょうか (間違っていたらごめんなさい)
・ CreativeML OpenRAIL-MのコピーというのはLicense原文を指しているという理解です。

おまけ

このnoteのサムネは「Stable Diffusion License DeepL」というTextからStable Diffusionで生成した画像を使用しました

参考ページ

・https://huggingface.co/blog/stable_diffusion
・https://huggingface.co/spaces/CompVis/stable-diffusion-license
・https://note.com/fladdict/n/n13c1413c40de
・https://qiita.com/Naggi-Goishi/items/70e4bef5092352642ef6
・https://note.com/npaka/n/ndd549d2ce556

よろしければサポートお願いします！サポートいただいたお金はハッカソンの運営に使わせていただきたいと思います。