【Googleの革新：テキストからの画像生成技術】英語解説を日本語で読む【2023年10月14日｜@AI Revolution】

2023年10月15日 12:09

Googleは新機能「SGE」を発表し、テキストプロンプトから直接画像を生成することができるようになりました。この機能は「Imagin」というテキストから画像への拡散モデルに基づき、Googleの大規模言語モデル「LaMDA」で動作します。ユーザーは100以上の言語で指示を入力し、さまざまな画像スタイルを選択することができます。SGEは他のAIツールに比べて優れており、Google検索に統合されています。ただし、まだ実験的な段階にあり、制限や注意点が設けられています。
公開日：2023年10月14日
※動画を再生してから読むのがオススメです。

So, Google has just launched a new feature that lets you generate images directly from the search bar using text prompts.

さて、Googleはテキストプロンプトを使用して検索バーから直接画像を生成できる新機能を発表しました。

This part of their search generative experience (SGE) initiative aims to make searches more creative.

これは彼らの検索生成体験（SGE）イニシアティブの一部で、検索をよりクリエイティブにすることを目指しています。

SGE, powered by imagin, a text to image diffusion model, creates realistic images understanding text deeply.

SGEはimaginというテキストから画像への変換モデルを搭載しており、テキストを深く理解してリアルな画像を生成します。

Imagin operates on Google's LaMDA, a large language model managing natural conversational language.

ImaginはGoogleのLaMDAで動作し、自然な会話言語を扱う大規模な言語モデルです。

Now, SGE accommodates over 100 languages, so feel free to use any language you prefer.

今、SGEは100以上の言語に対応しており、お好きな言語を自由に使用できます。

You have the option to select the image's aspect ratio, like square, portrait, or landscape, and pick from various image styles, including 3D, oil painting, or cartoon.

画像のアスペクト比を選択したり、正方形、縦長、または横長のような異なるスタイルの画像を選択したりするオプションがあります。

Suppose you want to create a picture of a giant cobra snake on a farm, with the snake composed of corn.

たとえば、巨大なコーンのヘビで構成された農場の絵を作成したい場合、詳細を検索バーに入力してエンターキーを押すだけです。

Just type these details into the search bar and hit enter.

生成された画像は、AIモデルがプロンプトをどれだけ理解してリアルなコーンヘビを典型的な農場の背景に生成したかを示します。

The resulting image showcases how well the AI model grasped the prompt, generating a realistic yet corn snake amidst a typical farm setting.

生成された画像は、AIモデルがプロンプトをどれだけうまく把握しているかを示しており、典型的な農場の風景の中で現実的なコーンヘビが生成されています。

Should you wish to alter the image, maybe having the snake in blue or depicting the farm in winter, it's a breeze.

もしも画像を変更したい場合、例えば蛇を青くしたり、農場を冬景色に描いたりする場合は、簡単です。

Just add or change details in your query.

クエリで詳細を追加または変更するだけです。

For instance, entering a giant blue cobra snake on a farm in winter, the snake is made out of corn will give you a new image, now with a blue snake and a snow-covered farm while maintaining the corn texture.

たとえば、「巨大な青いコーンのヘビが農場で冬に作られ、ヘビはコーンでできています」と入力すると、新しい画像が表示され、青いヘビと雪が積もった農場が表示され、コーンの質感が保たれます。

SGE provides substantial control and flexibility in image generation.

SGEは画像生成においてかなりの制御と柔軟性を提供します。

The choice of words is entirely up to you, and the AI model strives to produce an image aligning with your description.

言葉の選択は完全にあなた次第であり、AIモデルはあなたの説明に合った画像を生成することを目指しています。

How does SGE stand against other AI tools for creating images?

SGEは画像を作成するための他のAIツールに対してどのように立ち向かうのでしょうか？

There are many others like Microsoft's Bing image creator with DALL·E 3, Midjourney with Stable Diffusion, Crayon with DALL·E Mini, Night Cafe with Dream by Womo, all using unique AI methods to turn text into images.

マイクロソフトのBingイメージクリエーターのDALL·E 3や、Stable Diffusionを使用したMidjourney、DALL·E Miniを使用したCrayon、Dream by Womoを使用したNight Cafeなど、すべてが独自のAIメソッドを使用してテキストを画像に変換しています。

Each has its strengths and weaknesses.

それぞれに長所と短所があります。

For instance, Bing image creator is user-friendly and lets you easily modify images once made.

たとえば、Bing画像生成は使いやすく、作成した画像を簡単に変更できます。

Midjourney delivers high-quality images and works in many languages.

Midjourneyは高品質の画像を提供し、多言語で動作します。

Crayon is easy and quick to use, plus you can download images at no cost.

Crayonは簡単で迅速に使用でき、画像を無料でダウンロードできます。

Night Cafe lets you add a fun touch by animating images with music.

Night Cafeは音楽を使用して画像をアニメーション化する楽しい要素を追加できます。

Lastly, Do E has a robust model capable of tackling complicated prompts.

最後に、Do Eは複雑なプロンプトに対処できる強力なモデルを持っています。

But I think SGE has some advantages over these tools as well.

ただし、SGEにもこれらのツールに対するいくつかの利点があると思います。

Firstly, it's part of Google search, so you don't need a separate app or website.

まず第一に、それはGoogle検索の一部であるため、別個のアプリやウェブサイトは必要ありません。

Just type your query in your browser like usual.

通常通りブラウザにクエリを入力するだけです。

Secondly, SGE utilizes imagin, a sophisticated text-to-image diffusion model, to create lifelike images with a good understanding of language.

第二に、SGEはimaginという高度なテキストから画像への変換モデルを使用して、言語をよく理解したリアルな画像を生成します。

Imagin learns from a vast amount of text and images found online, including Google search results, so it's well-informed.

ImaginはGoogle検索結果を含むオンラインで見つかる大量のテキストと画像から学習するため、情報を豊富に持っています。

It can also process multimodal queries, letting you mix text, images, and emojis in your requests.

また、マルチモーダルクエリを処理でき、リクエストにテキスト、画像、絵文字を混在させることができます。

This way, you can get very detailed and realistic images based on your descriptions.

この方法で、詳細な説明に基づいて非常に詳細かつリアルな画像を得ることができます。

Thirdly, SGE can work with Bard, Google's AI chatbot, letting you generate images while chatting with Bard.

第三に、SGEはGoogleのAIチャットボットであるBardと連携し、Bardとチャットしながら画像を生成できます。

And now, you can ask Bard to help draft written content within SGE, like requesting an article on a specific topic, with SGE providing relevant images.

そして今、BardにSGE内での文章の起草を手伝ってもらうことができます。特定のトピックに関する記事のリクエストをしたり、SGEが関連する画像を提供したりすることができます。

This combo of Bard and SGE can be a big help for content creators, writers, or students looking for some assistance or inspiration.

BardとSGEの組み合わせは、コンテンツクリエーターやライター、またはアシスタンスやインスピレーションを求めている学生にとって大きな助けになるかもしれません。

But before you start using SGE, it's important to know a few things.

SGEを使い始める前に、いくつかの重要なことを知っておくことが重要です。

Firstly, SGE is in its experimental phase, so it might have some bugs or errors.

まず第一に、SGEは実験的な段階にあるため、いくつかのバグやエラーが発生する可能性があります。

It's possible that the AI might not always understand your requests or create the image you're envisioning since it's still improving with ongoing feedback and data.

AIが常にあなたのリクエストを理解するわけではないか、またはイメージを想像通りに生成しない可能性もあるため、フィードバックとデータの持続的な改善が行われています。

Secondly, SGE has put measures in place to prevent misuse.

第二に、SGEは誤用を防ぐための措置を講じています。

It won't create images that are harmful, offensive, or misleading.

有害、攻撃的、または誤解を招く可能性のある画像は生成しません。

And it also won't create realistic images of people's faces or notable individuals without permission, to uphold privacy and prevent false information.

また、許可なく人物の顔や有名人のリアルな画像を生成しません。これはプライバシーを守り、誤った情報を防ぐためです。

Thirdly, SGE includes features for labeling and watermarking to show where the images come from, promoting transparency and accountability.

第三に、SGEには画像の出典を示すためのラベリングとウォーターマーキングの機能が含まれており、透明性と責任を促進します。

You can see the text prompt used to create the image by hovering over or clicking on it.

画像をホバーまたはクリックすることで、画像を作成するために使用されたテキストプロンプトを確認できます。

Lastly, only those who are 18 or older can use SGE, as some of the content it generates might not be suitable for younger users or might need parental supervision.

最後に、18歳以上のみがSGEを使用できます。生成されるコンテンツの一部は若年のユーザーには適していない可能性があるため、親の監督が必要かもしれません。

Knowing these points will help you as you use SGE.

これらのポイントを知っておくことは、SGEを使用する際に役立ちます。

I hope this video provides the insight you need.

このビデオが必要な情報を提供できれば幸いです。

If you enjoy it, please like it and subscribe for more.

お楽しみいただければ、いいねとチャンネル登録をお願いします。

Share your thoughts on SGE and the types of images you aim to create with it in the comments below.

SGEやそれで作成しようとしている画像の種類についてのご意見を、以下のコメントでお聞かせください。

Thank you so much for watching, and I'll see you in the next one.

ご視聴いただき、ありがとうございました。次回もお会いしましょう。

この記事が気に入ったらサポートをしてみませんか？