見出し画像

【RTX 3060】SD3 Medium を試す【ローカル】

2024年6月13日 22:39

はじめに

Stable Diffusion 3 Medium のローカル利用が可能になったので試してみました。利用した環境は下記になります。

OS：Windows 12 23H2
CPU：Ryzen5 3600
システムメモリ：32G
GPU：RTX 3060 12G
モデル：sd3_medium_incl_clips_t5xxlp16.safetensors
アプリ：ComfyUI

結果

960x1280解像度画像を生成するのに、一枚あたり31秒程度でした。

Japanese high school girl, school uniform,striking a victory pose

Japanese high school girl, school uniform, loafer, socks,

Japanese high school girl, school uniform,loafers, socks, sitting in the classrom

A young woman with long, curly brown hair and piercing green eyes sits amidst the crumbling remains of an ancient city. Her worn leather cloak lies beside her on the dusty ground, and a faint light illuminates her face from behind. The air is heavy with the scent of decay and forgotten history

Shibuya district. She wears a bold outfit consisting of ripped jeans and a black leather jacket adorned with metal studs, paired with a pair of chunky platform boots and oversized sunglasses. Her features are striking, with full lips painted a deep red color and dark eyeliner that adds depth to her eyes. The overall aesthetic is edgy and fashion-forward, capturing the essence of Shibuya's vibrant street style

まとめ

（典型的な画像生成AIの弱点は何も変化がないため）良くもなく、悪くもないという印象です。ただし、モデルが大きいため、システムメモリ 32GB だと少し厳しいので、デメリットの方が大きいかもしれません。

推論中に利用されるGPUメモリは vae 利用時に 9GB 程度、ステップ処理中は 5GB 程度でした。（960x1280解像度、lowvram mode）

ただし、リアル系の画像場合は次の理由があるので、リアルさがモデルの善し悪しの判断にはならない事に注意が必要です。

この記事が気に入ったらサポートをしてみませんか？