AI_Program4 のバックアップ(No.13) - PukiWiki

[ トップ ] [ 一覧 | 検索 | 履歴 | ログイン ]

私的AI研究会 > AI_Program4

生成 AI プログラミング４ == 編集中 == †

　これまで検証してきた結果をもとに、Python で生成 AI プログラムを書く

▲　目　次

生成 AI プログラミング４ == 編集中 ==
参考資料

※ 最終更新:2025/08/05　

diffusersではじめめる Stable Diffusion （応用編３） †

　画像生成のプログラムを書く

動作環境 †

このプロジェクトは以下の Anaconda 仮想環境とプロジェクト・フォルダで動作する
```
(base) PS > conda activate sd_test
(sd_test) PS > cd workspace_3/sd_test
```

概要 †

この章で作成するプログラム一覧と実行速度の目安

Step		プログラム	GPU					CPU
Step		プログラム	RTX 4070Ti	RTX 4060	RTX 4060L	RTX 3050	GTX 1050	i7-1260P
50	顔の崩れを修正する１	sd_050.py		00:03	00:10	00:05		02:12
51	顔の崩れを修正する２「ADetailer」	sd_051.py		00:03	00:03	00:05		03:35

　・単位　（時：）分：秒

Step 50：顔の崩れを修正する１ †

はじめに
・全身の画像などの顔の面積が小さいときの画像生成では顔が崩れてしまうことが多い
・顔認識を利用して顔を抽出して拡大再生成した画像を埋め込む方法を実践してみる

処理の流れ
① 元画像から顔認識パッケージ「face_recognition」で科をの領域を検出
② 顔の領域を 512x512 ピクセルサイズに拡大し、「StableDiffusionImg2ImgPipeline」で画像を生成
③ 元の画像の同じ領域に埋め込む
④ 埋め込んだ画像の周辺処理のため 8bit グレイスケールのマスク画像を作成
⑤ マスク画像の周辺をぼかした画像を使って元画像と新しく生成した画像を合成して完成

元画像顔の抽出顔の修正完成画像

周辺の補正前自動で作成したマスクマスクで補正した完成画像

顔認識パッケージ「face_recognition」について
・以前「顔認証アプリケーション基礎編」で検証した（2022/6～）→ 顔認証 (Face recognition)
・現在は「pip face_recognition」コマンドからインストール可能となっている
・Windows 環境下では同時にインストールされる「dlib」パッケージを事前にインストールしておく必要がある（2025/8 現在）
・「dlib」パッケージは「conda-forge」からインストールできる

追加のパッケージ・インストール

 conda install dlib -c conda-forge

 pip install face_recognition

プログラムを実行する（実行時間：約 2秒 RTX 4070 Ti 12GB）

 python sd_050.py

(sd_test) PS > python sd_050.py

Stable Diffusion with diffusers(050)  Ver 0.06: Starting application...

 --result_image             :   results/image_050.png
 --cpu                      :   False
 --log                      :   3
 --model_dir                :   /StabilityMatrix/Data/Models/StableDiffusion
 --model_path               :   SD1.5/beautifulRealistic_brav5.safetensors
 --image_path               :   images/sd_050_test.jpg
 --max_size                 :   0
 --prompt                   :   masterpiece, high quality, very_high_resolution, large_filesize, full color, an extremely cute face, woman, symmetrical, HDR, real, realistic
 --seed                     :   12345678
 --width                    :   512
 --height                   :   512
 --step                     :   20
 --scale                    :   8.5
 --strength                 :   0.4
 --neg_prompt               :   lowres, bad anatomy, bad hands, text, error, missing fingers, extra digit, fewer digits, cropped, worst quality, low quality, normal quality, jpeg artifacts, signature, watermark, username, blurry, artist name, multiple legs, malformation

Fetching 11 files: 100%|███████████████████████████████| 11/11 [00:00<?, ?it/s]
Loading pipeline components...: 100%|████████████| 6/6 [00:00<00:00, 19.56it/s]
100%|██████████████████████████████████████████| 20/20 [00:02<00:00,  9.50it/s]
result_file: results/image_050.png

Finished.

画像ファイル「image_050.png」が生成される
実行例

元画像顔の抽出顔の修正完成画像

モジュール・ソースコード

▼「sd_050.py」

# -*- coding: utf-8 -*-
##--------------------------------------------------
##  Stable Diffusion with diffusers(050)   Ver 0.06
##
##               2025.07.31 Masahiro Izutsu
##--------------------------------------------------
## sd_050.py    顔の崩れを修正する
##  Ver 0.06    2025.07.31  sd_081 IP-Adapter 対応

# タイトル
title = 'Stable Diffusion with diffusers(050)  Ver 0.06'

import warnings
warnings.simplefilter('ignore')

# インポート＆初期設定
import os
import torch
from PIL import Image
from PIL import ImageDraw, ImageFilter
import face_recognition
from diffusers import StableDiffusionUpscalePipeline
from diffusers import StableDiffusionImg2ImgPipeline
from diffusers import logging

import my_logging
import sd_tools as sdt

logging.set_verbosity_error()

# 定数定義
DEF_MODEL_CNTL = 'control_v11p_sd15_inpaint_fp16.safetensors'
DEF_MODEL_BASE = 'SD1.5/beautifulRealistic_brav5.safetensors'
DEF_IMAGE_PATH = 'images/sd_050_test.jpg'
DEF_PROMPT = 'masterpiece, high quality, very_high_resolution, large_filesize, full color, an extremely cute face, woman, symmetrical, HDR, real, realistic'
DEF_NEG_PROMPT = 'lowres, bad anatomy, bad hands, text, error, missing fingers, extra digit, fewer digits, cropped, worst quality, low quality, normal quality, jpeg artifacts, signature, watermark, username, blurry, artist name, multiple legs, malformation'
#FACE_RECOGNITION_MODEL_ID = "hog"                                              # 速度重視の場合
FACE_RECOGNITION_MODEL_ID = "cnn"                                               # 精度重視の場合
UPSCALE_MODEL_ID = "stabilityai/stable-diffusion-x4-upscaler"

# コマンドライン定義
opt_list = [
            ['pros_sel','','sd_050'],                                                                       #  0
            ['result_image', 'results/image_050.png', 'path to output image file'],                         #  1
            ['cpu', 'store_true', 'cpu mode'],                                                              #  2
            ['log', '3', 'Log level(-1/0/1/2/3/4/5) Default value is \'3\''],                               #  3
            ['model_dir', '/StabilityMatrix/Data/Models/StableDiffusion', 'Model directory'],               #  4
            ['model_path', DEF_MODEL_BASE, 'Model Path'],                                                   #  5
            ['image_path', DEF_IMAGE_PATH, 'Sourcs image file path'],                                       #  6
            ['max_size', 0, 'image max size (0=source)'],                                                   #  7
            ['prompt', DEF_PROMPT, 'Prompt text'],                                                          #  8
            ['seed', 12345678, 'Seed parameter (-1 = rundom)'],                                             #  9
            ['width', 512, 'image size width'],                                                             # 10
            ['height', 512, 'image size height'],                                                           # 11
            ['step', 20, 'infer step'],                                                                     # 12
            ['scale', 8.5, 'gaidanse scale'],                                                               # 13
            ['strength', 0.4, 'strength value'],                                                            # 15
            ['neg_prompt', DEF_NEG_PROMPT, 'Negative Prompt text'],                                         # 16
           ]
# 画像確認
def image_log(pil_image, wait_s = -1):
    if wait_s >= 0:
        sdt.image_save2(pil_image, save_path = '', dispname = 'Check image', maxsize = 800, wait_s = wait_s)

# 画像を 512x512 アップスケール
def upscale(image, prompt, device):
    if device == 'cpu':
        pipeline  = StableDiffusionUpscalePipeline.from_pretrained(UPSCALE_MODEL_ID)
    else:
        pipeline  = StableDiffusionUpscalePipeline.from_pretrained(UPSCALE_MODEL_ID, torch_dtype = torch.float16)
    pipeline.to(device)

    low_image = image.convert("RGB")
    low_image = low_image.resize((128, 128))
    new_image = pipeline(prompt = prompt, image = low_image).images[0]
    return new_image

# 顔検出
def face_detection(file_name, offset=20):
    image = face_recognition.load_image_file(file_name)

    #顔部分を検出
    face_locs = face_recognition.face_locations(image, number_of_times_to_upsample = 1, model = FACE_RECOGNITION_MODEL_ID)

    face_org_rects = []
    face_rects = []
    if len(face_locs) == 0:
        return face_rects, face_org_rects                                       # 検出できない

    for face_loc in face_locs:
        top, right, bottom, left  = face_loc
        face_org_rects.append((left, top, right, bottom))

        # 範囲が狭いとモデルが顔を認識できない時があるため、検出範囲の矩形をoffset分広げる。
        top -= offset
        right += offset
        bottom += offset
        left -= offset

        # 検出範囲を正方形にする
        w = right - left
        h = bottom - top
        if w > h:
            bottom += w-h
        else:
            right += h-w

        face_rects.append((left, top, right, bottom))

    return face_rects, face_org_rects

# 顔のスタイル変換
def style_change(model_path, image, prompt, neg_prompt, guidance_scale = 9.5, strength = 0.4, seed = 0, device = 'cpu'):
    if device == 'cpu':
        pipeline  = StableDiffusionImg2ImgPipeline.from_single_file(model_path)
    else:
        pipeline  = StableDiffusionImg2ImgPipeline.from_single_file(model_path, torch_dtype = torch.float16)
    pipeline.to(device)

    generator = torch.Generator(device).manual_seed(seed)
    new_image = pipeline(
                        prompt = prompt,
                        negative_prompt = neg_prompt,
                        image = image,
                        guidance_scale = guidance_scale,
                        strength = strength,
                        generator = generator
                        ).images[0]

    return new_image

# マスク作成
def create_mask(image_width, image_height, rect_width, rect_height, rect_x, rect_y, offset = 10):
    image = Image.new('L', (image_width, image_height), 'black')                # 8bit グレイスケール 黒の画像を作成
    draw = ImageDraw.Draw(image)

    # offset分大きい真っ白の矩形を描画
    draw.rectangle([rect_x-offset, rect_y-offset, rect_x + rect_width + offset, rect_y + rect_height + offset], fill = 'white')

    # offset分小さい真っ黒の矩形を描描画
    draw.rectangle([rect_x+offset, rect_y+offset, rect_x + rect_width - offset, rect_y + rect_height - offset], fill = 'black')

    return image

# 画像の顔修正する
def face_style_change(model_path, file_name, prompt, neg_prompt, guidance_scale = 9.5, strength = 0.3, seed = 0, device = 'cpu', bUp = False):
    face_rects, face_org_rects = face_detection(file_name, offset = 30)
    if face_rects == [] or face_rects == []:
        return None, None, None                                                 # 顔検出なし

    face_rect = face_rects[0]
    face_org_rect = face_org_rects[0]

    left, top, right, bottom = face_rect
    left_org, top_org, right_org, bottom_org = face_org_rect
    w = right - left
    h = bottom - top

    #オリジナル画像から顔部分を切り出す
    init_img = Image.open(file_name)
    new_img = init_img.copy()
    face = new_img.crop(face_rect)

    # 顔をアップスケール
    if bUp:
        upscaled_face = upscale(face, prompt='face', device = device)           # upscale
    else:
        upscaled_face = face.resize((512, 512))                                 # resize
    image_log(upscaled_face, 1)

    # スタイル変更
    new_face = style_change(model_path, upscaled_face, prompt, neg_prompt, guidance_scale = guidance_scale, strength = strength, seed = seed, device = device)
    image_log(new_face, 1)

    # 元の画像に貼り付け
    new_img.paste(new_face.resize((w, h)), (left, top))
#    image_log(new_img, 0)

    # 顔の領域
    draw = ImageDraw.Draw(init_img)
    rectcolor = (0, 0, 255)                                                     # 矩形の色(RGB)
    linewidth = 2                                                               # 線の太さ
    draw.rectangle([(left_org, top_org), (right_org, bottom_org)], outline=rectcolor, width=linewidth)
#    image_log(init_img, 0)

    # エッジ部分の修正のためのマスクを作成
    image_width, image_height = new_img.size
    mask = create_mask(image_width, image_height, h, w, left, top, offset=8)
#    image_log(mask, 0)

    return init_img, new_img, mask

# 画像生成
def image_generation(model_path, image_path, prompt, seed, num_inference_steps=20, width=512, height=512, guidance_scale=8.5, strength=0.4, neg_prompt = '', device='cpu'):
    work_path = sdt.get_work_path(logger = None)
    os.makedirs(work_path, exist_ok = True)                                     # 作業フォルダ作成
    src_path, mask_path = sdt.get_source_mask_path(image_path, logger = None)   # ソース/マスク画像作成

    image, new_img, mask = face_style_change(model_path, image_path, prompt, neg_prompt, guidance_scale = guidance_scale, strength = strength, seed = seed, device = device, bUp = False)
    if image is None or new_img is None or mask is None:
        return None                                                             # Error

    # マスクのエッジをソフトフォーカスにして元の画像と合成しエッジを修正
    mask = mask.filter(ImageFilter.GaussianBlur(10))
    new_img = Image.composite(image, new_img, mask)

    sdt.image_save2(image, save_path = src_path, dispname = src_path, maxsize = 800, wait_s = 1)
    sdt.image_save2(mask, save_path = mask_path, dispname = '', maxsize = 800, wait_s = 1)

    return new_img

# ** main関数 **
def main(opt, logger = None):
    # パラメータ設定
    device = sdt._get_device(opt, logger)
    result_image_path = sdt._get_result_image_path(opt, logger)
    result_path = sdt._get_result_path(opt, logger)
    prompt = sdt._get_prompt(opt, logger)
    src_image = sdt._get_source_image(opt, logger)
    model_path = sdt._get_model_path(opt, logger)
    height, width = sdt._get_image_size(opt, logger)
    seed = sdt._get_seed_value(opt, logger)
    num_inference_steps = sdt._get_inference_steps(opt, logger)
    guidance_scale = sdt._get_guidance_scale(opt, logger)
    strength = sdt._get_strength(opt, logger)
    neg_prompt = sdt._get_negative_prompt(opt, logger)
    image_path = sdt._get_source_image_path(opt, logger)

    # 出力フォルダ
    os.makedirs(result_path, exist_ok = True)

    # 画像生成
    image = image_generation(model_path, image_path, prompt, seed, num_inference_steps, width, height, guidance_scale, strength, neg_prompt = neg_prompt, device = device)

    if image is None:
        logger.info(f'{sdt.RED｝There is no face in the image !!{sdt.NOCOLOR｝')

    else:
        sdt.image_save2(image, result_image_path, result_image_path)
        logger.info(f'result_file: {result_image_path｝')


# main関数エントリーポイント(実行開始)
if __name__ == "__main__":
    parser = sdt.parse_args(None, opt_list)
    opt = parser.parse_args()
    sdt._get_device(opt)
    sdt.display_info(opt, title)

    # アプリケーション・ログ設定
    module = os.path.basename(__file__)
    module_name = os.path.splitext(module)[0]
    logger = my_logging.get_module_logger_sel(module_name, int(opt.log))

    main(opt, logger)

    logger.info('\nFinished.\n')

　※ 上記ソースコードは表示の都合上、半角コード '}' が全角 '｝'になっていることに注意

Step 51：顔の崩れを修正する２「ADetailer」 †

概要
・「Stable Diffusion」の拡張機能である「ADetailer」を「diffusers」で動かす
・以下は「SD1.5」モデル専用のパッケージ「asdff」を使用する
・性別の指定のため「--ext」パラメータを用意する
　男性の場合：--ext 'boy'
　女性の場合：--ext 'girl' またはオプション指定なし

追加のパッケージ・インストール
```
 pip install asdff
```

プログラムを実行する（実行時間：約 3秒 RTX 4070 Ti 12GB）

 python sd_051.py

(sd_test) PS > python sd_051.py

Stable Diffusion with diffusers(sd_051)     Ver 0.06: Starting application...

 --result_image             :   results/sd_051.png
 --cpu                      :   False
 --log                      :   3
 --model_dir                :   /StabilityMatrix/Data/Models/StableDiffusion
 --model_path               :   SD1.5/beautifulRealistic_brav5.safetensors
 --image_path               :   D:/anaconda_win/workspace_3/sd_test/images/sd_050_test.jpg
 --max_size                 :   0
 --prompt                   :   masterpiece, best quality, 1girl
 --seed                     :   12345678
 --step                     :   30

= ADetailer =   prompt: 'masterpiece, best quality, 1girl'  model: 'face_yolov8s.pt'
Fetching 11 files: 100%|███████████████████████████████| 11/11 [00:00<?, ?it/s]
Loading pipeline components...: 100%|████████████| 6/6 [00:00<00:00, 19.93it/s]

0: 640x512 1 face, 36.0ms
Speed: 2.3ms preprocess, 36.0ms inference, 61.3ms postprocess per image at shape (1, 3, 640, 512)
100%|██████████████████████████████████████████| 12/12 [00:01<00:00,  9.07it/s]

0: 640x512 1 face, 23.9ms
Speed: 1.3ms preprocess, 23.9ms inference, 2.0ms postprocess per image at shape (1, 3, 640, 512)
100%|██████████████████████████████████████████| 12/12 [00:01<00:00,  9.17it/s]
result_file: results/sd_051-sd_050_test.png

Finished.

画像ファイル「sd_051-XXXXXXXX.png」が生成される（XXXXXXXX は入力ファイル名）

生成画像例（Step 50 と同じ元画像の場合）

モジュール・ソースコード

▼「sd_051.py」

# -*- coding: utf-8 -*-
##--------------------------------------------------
##  Stable Diffusion with diffusers(051)  Ver 0.06
##
##               2025.08.02 Masahiro Izutsu
##--------------------------------------------------
## sd_051.py
##  Ver 0.00    2025.08.02  asdff(ADetailer)
##  Ver 0.06    2025.08.03  sd_100.py 統合版対応

# asdff
# https://github.com/Bing-su/asdff
# https://github.com/theblackhatmagician/adetailer_sdxl?tab=readme-ov-file
# https://github.com/Bing-su/adetailer?tab=readme-ov-file

import warnings
warnings.simplefilter('ignore')

# インポート＆初期設定
import os
import argparse
import torch
from functools import partial
from asdff import AdPipeline, yolo_detector
from huggingface_hub import hf_hub_download
from diffusers import logging
from diffusers.utils import load_image

import my_logging
import sd_tools as sdt

logging.set_verbosity_error()                                                   # 不要なエラー出力の抑制


# 定数定義
MODEL_DIR = 'Bingsu/adetailer'
MODEL_FACE = 'face_yolov8s.pt'                                                  # 2D /リアルな顔
MODEL_HAND = 'hand_yolov8n.pt'                                                  # 2D /リアルな手
MODEL_PERSON = 'person_yolov8s-seg.pt'                                          # 2D/リアルな人物

def_result_image = 'results/sd_051.png'
def_image_path = ''
def_model_dir = '/StabilityMatrix/Data/Models/StableDiffusion'
def_model_path = 'SD1.5/beautifulRealistic_brav5.safetensors'
def_model = def_model_dir + '/' + def_model_path
def_prompt = 'masterpiece, best quality, 1girl'
def_prompt_m = 'masterpiece, best quality, 1boy'
def_seed = 12345678
def_step = 30
def_ext = ''


# タイトル
title = 'Stable Diffusion with diffusers(sd_051)     Ver 0.06'

# コマンドライン・オプション (argparse) 名前/初期値/ヘルプ
opt_list = [
            ['pros_sel','','sd_048'],                                                          #  0
            ['result_image', def_result_image, 'path to output image file'],                                #  1
            ['cpu', 'store_true', 'cpu mode'],                                                              #  2
            ['log', '3', 'Log level(-1/0/1/2/3/4/5) Default value is \'3\''],                               #  3
            ['model_dir', def_model_dir, 'Model directory'],                                                #  4
            ['model_path', def_model_path, 'Model Path'],                                                   #  5
            ['image_path', def_image_path, 'Sourcs image file path'],                                       #  6
            ['max_size', 0, 'image max size (0=source)'],                                                   #  7
            ['prompt', def_prompt, 'Prompt text'],                                                          #  8
            ['seed', def_seed, 'Seed parameter (-1 = rundom)'],                                             #  9
            ['step', def_step, 'infer step'],                                                               # 10
            ['ext', def_ext, 'Extensions \'\' or \'girl\' or \'boy\''],
           ]


# ** 画像生成 **

# 画像生成（画像ファイルパス入力）拡張版
def image_generation2_ex(ext, image_path, device='cpu', prompt=def_prompt, model_path=def_model, seed=def_seed, num_inference_steps=def_step, ad_model=MODEL_FACE):
    # prompt 変更
    if  ext == 'boy':
        prompt = def_prompt_m
    elif ext == 'girl':
        prompt = def_prompt

    src_image = load_image(image_path)
    image = image_generation(src_image, device, prompt, model_path, seed, num_inference_steps, ad_model)
    return image

# 画像生成（PIL 画像イメージ入力）拡張版
def image_generation_ex(ext, src_image, device='cpu', prompt=def_prompt, model_path=def_model, seed=def_seed, num_inference_steps=def_step, ad_model=MODEL_FACE):
    # prompt 変更
    if  ext == 'boy':
        prompt = def_prompt_m
    elif ext == 'girl':
        prompt = def_prompt

    image = image_generation(src_image, device, prompt, model_path, seed, num_inference_steps, ad_model)
    return image



# 画像生成（画像ファイルパス入力）
def image_generation2(image_path, device='cpu', prompt=def_prompt, model_path=def_model, seed=def_seed, num_inference_steps=def_step, ad_model=MODEL_FACE):
    src_image = load_image(image_path)
    image = image_generation(src_image, device, prompt, model_path, seed, num_inference_steps, ad_model)
    return image

# 画像生成（PIL 画像イメージ入力）
def image_generation(src_image, device='cpu', prompt=def_prompt, model_path=def_model, seed=def_seed, num_inference_steps=def_step, ad_model=MODEL_FACE):
    print(f"{sdt.CYAN｝= ADetailer =  {sdt.NOCOLOR｝ prompt: '{prompt｝'  model: '{ad_model｝'")

    # パイプラインを作成
    if device == 'cpu':
        pipeline = AdPipeline.from_single_file(model_path)
    else:
        pipeline = AdPipeline.from_single_file(model_path, torch_dtype=torch.float16)

    pipeline.to(device)

    person_model_path = hf_hub_download(MODEL_DIR, ad_model)
    person_detector = partial(yolo_detector, model_path = person_model_path)

    common = {"prompt": prompt, "num_inference_steps": num_inference_steps｝

    # 画像を生成
    response = pipeline(common=common, detectors=[person_detector, pipeline.default_detector], images=[src_image])
    images = response[0]
    image = None if images == [] else images[0]

    return image


# ** main関数 **
def main(opt, logger):
    # パラメータ設定
    device = sdt._get_device(opt, logger)
    result_image_path = sdt._get_result_image_path(opt, logger)
    result_path = sdt._get_result_path(opt, logger)
    prompt = sdt._get_prompt(opt, logger) if opt.ext != 'boy' else def_prompt_m
    src_image = sdt._get_source_image(opt, logger)
    seed = sdt._get_seed_value(opt, logger)
    num_inference_steps = sdt._get_inference_steps(opt, logger)
    model_path = sdt._get_model_path(opt, logger)
    image_path = sdt._get_source_image_path(opt, logger)

    # 出力フォルダ
    os.makedirs(result_path, exist_ok = True)

    # 画像生成
    image = image_generation(src_image, device, prompt, model_path, seed, num_inference_steps)

    if image is None:
        logger.info(f'{sdt.RED｝There is no face in the image !!{sdt.NOCOLOR｝')

    else:
        s = os.path.splitext(result_image_path)
        s0 = os.path.splitext(os.path.basename(image_path))[0]
        save_path = s[0] + '-' + s0 + s[1]
        sdt.image_save2(image, save_path, save_path)
        logger.info(f'result_file: {save_path｝')

    return


# main関数エントリーポイント(実行開始)
if __name__ == "__main__":
    import my_dialog

    parser = sdt.parse_args(None, opt_list)
    opt = parser.parse_args()
    sdt._get_device(opt)

    if len(opt.image_path) == 0:
        opt.image_path = my_dialog.select_image_file(initdir = './images')
        if len(opt.image_path) == 0:
            exit(0)

    sdt.display_info(opt, title)

    # アプリケーション・ログ設定
    module = os.path.basename(__file__)
    module_name = os.path.splitext(module)[0]
    logger = my_logging.get_module_logger_sel(module_name, int(opt.log))

    main(opt, logger)

    logger.info('\nFinished.\n')

　※ 上記ソースコードは表示の都合上、半角コード '}' が全角 '｝'になっていることに注意

忘備録 †

更新履歴 †

2025/07/26 初版

参考資料 †

Image-to-Image/ControlNet/IP-Adapter

Diffusers

Inpainting
- GitHub: Inpainting
- ドキュメント版 Inpainting に沿って試してみる

face-recognition

Programming
- Python, Pillowで二枚の画像をマスク画像に従って合成
- 【Python/Pillow(PIL)】画像データの新規作成

書籍など
- 日経ソフトウエア 2025年7月号「ローカル生成AIプログラミング」
- Interface 2025年3月号「画像による異常検出＆ローカルLLM作り - 仕事のための生成AI」