YOLOv7_Colab2

YOLO V7 on Google Colaboratory2 †

　「じゃんけん(グー・チョキ・パー)」のカスタム・データセットを作成し、Google Colaboratory 上の「YOLO V7」を使用して学習モデルを作成、じゃんけんの「グー・チョキ・パー」をリアルタイム検出する。

　

▼　目　次

▲　目　次

YOLO V7 on Google Colaboratory2
カスタムデータによる学習２「じゃんけんの判定１」
プロジェクトの目標

開発環境・使用ツール

学習データの収集＜ローカルマシン＞

ラベリング＜ローカルマシン＞

ラベリング（アノテーション）結果＜ローカルマシン＞

データセットの作成＜ローカルマシン＞

学習 Training＜クラウド・サービス＞

学習結果で推論を実行する画像編＜クラウド・サービス＞

学習結果で推論を実行する画像編＜ローカルマシン＞

学習結果で推論を実行する動画編＜クラウド・サービス＞

学習結果で推論を実行する動画編＜ローカルマシン＞

学習結果で推論を実行するカメラ編＜ローカルマシン＞（参考）

学習済みモデルを ONNX 形式にコンバートする＜クラウド・サービス＞

OpenVINO™ で実行する＜ローカルマシン＞

ここまでのまとめと今後の課題
わかったこと

これからの課題

クラウドサービス「Google Colaboratory」について

ローカルマシンで学習（Training）
NVIDIAのGPUアーキテクチャ

使用するハードウェア

現状確認

NVIDIA ドライバーのインストール

「Pytorch」インストール

パッケージの追加

学習 Training を実行

発生したエラー・メッセージとその原因

学習パラメータ考察

更新履歴

参考資料

※ 最終更新:2024/01/14　

↑

カスタムデータによる学習２「じゃんけんの判定１」 †

↑

プロジェクトの目標 †

自前のデータセットを作成して学習モデルを作成する方法を検証してみる。
Local PCのwebカメラを使って、じゃんけん(グー・チョキ・パー)を AIに検出させる。モデルの作成は「Google Colab」の GPUパワーを利用し出来上がったモデルを Local PCへ移植する。
またLocal PCの「OpenVINO™」上で動かすために学習済みモデルを「Google Colab」上で onnx フォーマットにコンバートする。

※ 図引用 & 参考サイト

↑

開発環境・使用ツール †

開発環境

開発環境	特記事項
Google Colab	学習(モデル生成)時に使用。GPUを利用できるため学習スピードが早い
Local PC Webカメラ	静止画撮影、ラベリング、物体検出時に使用する Windows(Linuxも可) pythonの仮想環境はanacondaを利用

仕様ツール

種別	ツール名	用途
Google Colab	YOLO v7	学習(モデル生成)
Local PC	YOLO v7	物体検出フリーウェア(GNU GPL V3)
	カメラアプリ	静止画撮影 Windows標準アプリ
	labelimg	教師データのラベリングフリーウェア(MIT License)

開発フロー

ローカルPC への事前準備
・アップデートファイルをダウンロードする
　update_20230616.zip (1.22GB) <アップデート・データ> をダウンロード

・「update」フォルダ内をすべてプロジェクトのホームディレクトリに配置する（上書き保存）
　「Windows」の場合 ➡ 「/anacondawin」直下
　「Linux」の場合　 ➡ 「~/」ホームディレクトリ直下

↑

学習データの収集＜ローカルマシン＞ †

カメラアプリの設定
・写真の画質 → 640x480
・低速度撮影 → オン
・写真タイマー → 2秒

静止画撮影
・「写真撮影」ボタンを押すと 2秒周期の連続で撮影する。撮影終了はもう一度「写真撮影」ボタンを押す。
・「グー・チョキ・パー」をいろいろな角度から写真を撮影する。
・「グー・チョキ・パー」それぞれ単体の写真を各200枚ずつ、計600枚写真を撮る。

フォルダに分けてファイル名を付け直す

　janken                        → 撮影データフォルダ
　│
　├─ goo                      → 「グー」の写真フォルダ
　│　　├─ goo_001.jpg
　│　　├─ goo_002.jpg
　│　　︙　　︙
　│　　└─ goo_200.jpg
　├─ choki                    → 「チョキ」の写真フォルダ
　│　　├─ choki_001.jpg
　│　　├─ choki_002.jpg
　│　　︙　　︙
　│　　└─ choki_200.jpg
　└─ par                      → 「パー」の写真フォルダ
　　　　├─ par_001.jpg
　　　　├─ par_002.jpg
　　　　︙　　︙
　　　　└─ par_200.jpg

↑

ラベリング＜ローカルマシン＞ †

「labelimg」のインストール
・ローカルPCの anaconda python仮想環境(py38a) にインストールする

 PS > conda activate py38a
(py38a) PS > pip install labelImg

▼　- log -　NVIDIA GeForce GTX 1050 Ti

Collecting labelImg
  Downloading labelImg-1.8.6.tar.gz (247 kB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 247.7/247.7 kB 7.4 MB/s eta 0:00:00
  Preparing metadata (setup.py) ... done
Collecting pyqt5
  Downloading PyQt5-5.15.9-cp37-abi3-win_amd64.whl (6.8 MB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 6.8/6.8 MB 33.7 MB/s eta 0:00:00
Collecting lxml
  Downloading lxml-4.9.2-cp38-cp38-win_amd64.whl (3.9 MB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 3.9/3.9 MB 31.1 MB/s eta 0:00:00
Collecting PyQt5-sip<13,>=12.11
  Downloading PyQt5_sip-12.12.1-cp38-cp38-win_amd64.whl (78 kB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 78.2/78.2 kB 4.2 MB/s eta 0:00:00
Collecting PyQt5-Qt5>=5.15.2
  Downloading PyQt5_Qt5-5.15.2-py3-none-win_amd64.whl (50.1 MB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 50.1/50.1 MB 20.4 MB/s eta 0:00:00
Building wheels for collected packages: labelImg
  Building wheel for labelImg (setup.py) ... done
  Created wheel for labelImg: filename=labelImg-1.8.6-py2.py3-none-any.whl size=261578 sha256=3a9c2569531e82e0da8b254cb569a2df0aee1d0f949dbfdd1b7f3a9dfabca906
  Stored in directory: c:\users\izuts\appdata\local\pip\cache\wheels\c3\9e\49\8368f5bc5347b5e54aef95b7b03ec56af7e23ea4c16c82109c
Successfully built labelImg
Installing collected packages: PyQt5-Qt5, PyQt5-sip, lxml, pyqt5, labelImg
Successfully installed PyQt5-Qt5-5.15.2 PyQt5-sip-12.12.1 labelImg-1.8.6 lxml-4.9.2 pyqt5-5.15.9

「labelimg」の設定
・撮影データフォルダ「work/labelimg/janken/」に分類クラスファイル「predefined_classes.txt」を作成しておく
```
goo
choki
par
```
・「labelImg」を起動する
```
(py38a) PS > cd /anaconda_win/work/labelImg/janken/
(py38a) PS > labelImg par predefined_classes.txt par
```
・左サイドバーの「フォーマット」をYOLOに設定する
・「View」メニューバーの「Auto Save mode」「Display Labels」にチェックを入れる

ラベリング（アノテーション）をする

・左手で 'w' キーを押して枠モードにし、右手マウスで枠を囲う、'd' キーを押して自動Save & Next Image に移動・・・を繰り返す
・「Use default label」をONにし、「goo」「choki」「par」とラベリングしていくと作業が早い
・「グー・チョキ・パー」600枚をラベリングする

↑

ラベリング（アノテーション）結果＜ローカルマシン＞ †

ラベルファイルのフォーマット
・オブジェクトごとに１行ずつ、記述する
```
class x_center y_center width height
```
・先頭列がクラス表す、クラス番号は0から始まる
・ボックス座標は、正規化された xywh形式（0〜1）
・ボックスがピクセル単位の場合は、画像の幅と画像の高さで割る

・「par_001.jpg」のラベルファイル(ラベリングデータ)は「par_001.txt」
```
2 0.458594 0.502083 0.642188 0.812500
↑ 
クラス2 が定義されていることを表す
```

「classes.txt」は使用しないが、クラス分け出来ているか確認することができる。先頭行からクラス0、クラス1、クラス2

goo       ← クラス0が定義されていることを表す
choki     ← クラス1が定義されていることを表す
par       ← クラス2が定義されていることを表す

↑

データセットの作成＜ローカルマシン＞ †

ラベリング（アノテーション）が終わったファイルをデータセットとしてフォルダに振り分ける

　janken_dataset                  → データセット フォルダ
　│
　├─ train                      → トレーニング用データ フォルダ
　│　　├─ images
　│　　│　　├─ XXXXXXX.jpg    → 画像ファイル フォルダ
　│　　︙　　︙
　│　　│
　│　　└─ labels
　│　　　　　├─ XXXXXXX.txt    → ラベルファイル フォルダ
　│　　　　　︙
　│
　└─ varid                      → 学習状況の検証用データフォルダ
　　　　├─ images
　　　　│　　├─ XXXXXXX.jpg    → 画像ファイル フォルダ
　　　　︙　　︙
　　　　│
　　　　└─ labels
　　　　　　　├─ XXXXXXX.txt    → ラベルファイル フォルダ
　　　　　　　︙

「train/」：「valid/」は一般的に８：２で振り分ける（１６０枚：４０枚）
ランダムに４０ファイルを抽出するプログラム

▼「file_select.py」

# -*- coding: utf-8 -*-
##------------------------------------------
##   File select            Ver 0.01
##
##               2023.06.12 Masahiro Izutsu
##------------------------------------------
## file_select.py

import random
import shutil

# 重複なし乱数
def rand_ints_nodup(a, b, k):
  ns = []
  while len(ns) < k:
    n = random.randint(a, b)
    if not n in ns:
      ns.append(n)
  return ns

# 抽出ファイル名のリスト
def get_select_list(ns, heder):
    s = []
    nm = len(ns)
    for n in range(nm):
        s.append(heder + '%03d' % ns[n])
    return s

#-----Main routine-----
#
if __name__ == "__main__":
    sel_no = 40
    max_no = 200

    # ランダムに抽出
    ns = rand_ints_nodup(1, max_no, sel_no)
    goo = get_select_list(ns, 'goo_')
    ns = rand_ints_nodup(1, max_no, sel_no)
    choki = get_select_list(ns, 'choki_')
    ns = rand_ints_nodup(1, max_no, sel_no)
    par = get_select_list(ns, 'par_')

    s_path = 'janken/'
    d0_path = 'dataset/valid/images/'
    d1_path = 'dataset/valid/labels/'

    # ファイル移動
    for n in range(sel_no):
        s0_f = s_path + 'goo/' + goo[n] + '.jpg'
        print(s0_f, '>>', d0_path)
        shutil.move(s0_f, d0_path)

        s1_f = s_path + 'goo/' + goo[n] + '.txt'
        d1_f = d1_path + goo[n]
        print(s1_f, '>>', d1_path)
        shutil.move(s1_f, d1_path)

        s0_f = s_path + 'choki/' + choki[n] + '.jpg'
        print(s0_f, '>>', d0_path)
        shutil.move(s0_f, d0_path)

        s1_f = s_path + 'choki/' + choki[n] + '.txt'
        print(s1_f, '>>', d1_path)
        shutil.move(s1_f, d1_path)

        s0_f = s_path + 'par/' + par[n] + '.jpg'
        print(s0_f, '>>', d0_path)
        shutil.move(s0_f, d0_path)

        s1_f = s_path + 'par/' + par[n] + '.txt'
        print(s1_f, '>>', d1_path)
        shutil.move(s1_f, d1_path)
        print(n)

    print('-- End ---')

トレーニングの構成を指示する「janken_dataset.yaml」ファイルを作成する

train: ./data/janken_dataset/train/images
val: ./data/janken_dataset/valid/images

nc: 3
names: ['goo', 'choki', 'par']

↑

学習 Training＜クラウド・サービス＞ †

GoogleColab を起動する
前回セクション「Google Colaboratory に「YOLO V7」を実装」を未実行の場合は → ここのページを実行
ノートブック「物体検出_yolov7.ipynb」を選択する
左サイドバーの「ファイル」を選択することで Googleドライブをマウントする
データ・セット「janken_dataset」をフォルダごとGoogleドライブの「yolov7/data」にアップする
```
yolov7　
  ┗ data
     ┗ janken_dataset
```
学習するのに必要な情報を記述したyamlファイル「janken_dataset.yaml」を「yolov7」の直下にアップする
```
yolov7　
  ┠ data
  ┃  ┗ janken_dataset
  ┗ janken_dataset.yaml
```
カレントディレクトリを Googleドライブ「yolov7」へ移動する
```
cd /content/drive/MyDrive/yolov7
```
・結果表示
```
/content/drive/MyDrive/yolov7
```

下記のコマンドで学習を実行する

!python train.py --workers 8 --batch-size 16 --data janken_dataset.yaml --cfg cfg/training/yolov7x.yaml --weights 'yolov7x.pt' --name yolov7x_custom --hyp data/hyp.scratch.p5.yaml --epochs 300 --device 0

▼　- log -　GoogleColab Tesla T4

2023-06-12 20:22:01.334600: I tensorflow/core/platform/cpu_feature_guard.cc:182] This TensorFlow binary is optimized to use available CPU instructions in performance-critical operations.
To enable the following instructions: AVX2 FMA, in other operations, rebuild TensorFlow with the appropriate compiler flags.
2023-06-12 20:22:02.217058: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:38] TF-TRT Warning: Could not find TensorRT
YOLOR 🚀 v0.1-122-g3b41c2c torch 2.0.1+cu118 CUDA:0 (Tesla T4, 15101.8125MB)

Namespace(weights='yolov7x.pt', cfg='cfg/training/yolov7x.yaml', data='janken_dataset.yaml', hyp='data/hyp.scratch.p5.yaml', epochs=300, batch_size=16, img_size=[640, 640], rect=False, resume=False, nosave=False, notest=False, noautoanchor=False, evolve=False, bucket='', cache_images=False, image_weights=False, device='0', multi_scale=False, single_cls=False, adam=False, sync_bn=False, local_rank=-1, workers=8, project='runs/train', entity=None, name='yolov7x_custom', exist_ok=False, quad=False, linear_lr=False, label_smoothing=0.0, upload_dataset=False, bbox_interval=-1, save_period=-1, artifact_alias='latest', freeze=[0], v5_metric=False, world_size=1, global_rank=-1, save_dir='runs/train/yolov7x_custom', total_batch_size=16)
tensorboard: Start with 'tensorboard --logdir runs/train', view at http://localhost:6006/
hyperparameters: lr0=0.01, lrf=0.1, momentum=0.937, weight_decay=0.0005, warmup_epochs=3.0, warmup_momentum=0.8, warmup_bias_lr=0.1, box=0.05, cls=0.3, cls_pw=1.0, obj=0.7, obj_pw=1.0, iou_t=0.2, anchor_t=4.0, fl_gamma=0.0, hsv_h=0.015, hsv_s=0.7, hsv_v=0.4, degrees=0.0, translate=0.2, scale=0.9, shear=0.0, perspective=0.0, flipud=0.0, fliplr=0.5, mosaic=1.0, mixup=0.15, copy_paste=0.0, paste_in=0.15, loss_ota=1
wandb: Install Weights & Biases for YOLOR logging with 'pip install wandb' (recommended)
Overriding model.yaml nc=80 with nc=3

                 from  n    params  module                                  arguments                     
  0                -1  1      1160  models.common.Conv                      [3, 40, 3, 1]                 
  1                -1  1     28960  models.common.Conv                      [40, 80, 3, 2]                
  2                -1  1     57760  models.common.Conv                      [80, 80, 3, 1]                
  3                -1  1    115520  models.common.Conv                      [80, 160, 3, 2]               
  4                -1  1     10368  models.common.Conv                      [160, 64, 1, 1]               
  5                -2  1     10368  models.common.Conv                      [160, 64, 1, 1]               
  6                -1  1     36992  models.common.Conv                      [64, 64, 3, 1]                
  7                -1  1     36992  models.common.Conv                      [64, 64, 3, 1]                
  8                -1  1     36992  models.common.Conv                      [64, 64, 3, 1]                
  9                -1  1     36992  models.common.Conv                      [64, 64, 3, 1]                
 10                -1  1     36992  models.common.Conv                      [64, 64, 3, 1]                
 11                -1  1     36992  models.common.Conv                      [64, 64, 3, 1]                
 12[-1, -3, -5, -7, -8]  1         0  models.common.Concat                    [1]                           
 13                -1  1    103040  models.common.Conv                      [320, 320, 1, 1]              
 14                -1  1         0  models.common.MP                        []                            
 15                -1  1     51520  models.common.Conv                      [320, 160, 1, 1]              
 16                -3  1     51520  models.common.Conv                      [320, 160, 1, 1]              
 17                -1  1    230720  models.common.Conv                      [160, 160, 3, 2]              
 18          [-1, -3]  1         0  models.common.Concat                    [1]                           
 19                -1  1     41216  models.common.Conv                      [320, 128, 1, 1]              
 20                -2  1     41216  models.common.Conv                      [320, 128, 1, 1]              
 21                -1  1    147712  models.common.Conv                      [128, 128, 3, 1]              
 22                -1  1    147712  models.common.Conv                      [128, 128, 3, 1]              
 23                -1  1    147712  models.common.Conv                      [128, 128, 3, 1]              
 24                -1  1    147712  models.common.Conv                      [128, 128, 3, 1]              
 25                -1  1    147712  models.common.Conv                      [128, 128, 3, 1]              
 26                -1  1    147712  models.common.Conv                      [128, 128, 3, 1]              
 27[-1, -3, -5, -7, -8]  1         0  models.common.Concat                    [1]                           
 28                -1  1    410880  models.common.Conv                      [640, 640, 1, 1]              
 29                -1  1         0  models.common.MP                        []                            
 30                -1  1    205440  models.common.Conv                      [640, 320, 1, 1]              
 31                -3  1    205440  models.common.Conv                      [640, 320, 1, 1]              
 32                -1  1    922240  models.common.Conv                      [320, 320, 3, 2]              
 33          [-1, -3]  1         0  models.common.Concat                    [1]                           
 34                -1  1    164352  models.common.Conv                      [640, 256, 1, 1]              
 35                -2  1    164352  models.common.Conv                      [640, 256, 1, 1]              
 36                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 37                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 38                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 39                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 40                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 41                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 42[-1, -3, -5, -7, -8]  1         0  models.common.Concat                    [1]                           
 43                -1  1   1640960  models.common.Conv                      [1280, 1280, 1, 1]            
 44                -1  1         0  models.common.MP                        []                            
 45                -1  1    820480  models.common.Conv                      [1280, 640, 1, 1]             
 46                -3  1    820480  models.common.Conv                      [1280, 640, 1, 1]             
 47                -1  1   3687680  models.common.Conv                      [640, 640, 3, 2]              
 48          [-1, -3]  1         0  models.common.Concat                    [1]                           
 49                -1  1    328192  models.common.Conv                      [1280, 256, 1, 1]             
 50                -2  1    328192  models.common.Conv                      [1280, 256, 1, 1]             
 51                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 52                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 53                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 54                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 55                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 56                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 57[-1, -3, -5, -7, -8]  1         0  models.common.Concat                    [1]                           
 58                -1  1   1640960  models.common.Conv                      [1280, 1280, 1, 1]            
 59                -1  1  11887360  models.common.SPPCSPC                   [1280, 640, 1]                
 60                -1  1    205440  models.common.Conv                      [640, 320, 1, 1]              
 61                -1  1         0  torch.nn.modules.upsampling.Upsample    [None, 2, 'nearest']          
 62                43  1    410240  models.common.Conv                      [1280, 320, 1, 1]             
 63          [-1, -2]  1         0  models.common.Concat                    [1]                           
 64                -1  1    164352  models.common.Conv                      [640, 256, 1, 1]              
 65                -2  1    164352  models.common.Conv                      [640, 256, 1, 1]              
 66                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 67                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 68                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 69                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 70                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 71                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 72[-1, -3, -5, -7, -8]  1         0  models.common.Concat                    [1]                           
 73                -1  1    410240  models.common.Conv                      [1280, 320, 1, 1]             
 74                -1  1     51520  models.common.Conv                      [320, 160, 1, 1]              
 75                -1  1         0  torch.nn.modules.upsampling.Upsample    [None, 2, 'nearest']          
 76                28  1    102720  models.common.Conv                      [640, 160, 1, 1]              
 77          [-1, -2]  1         0  models.common.Concat                    [1]                           
 78                -1  1     41216  models.common.Conv                      [320, 128, 1, 1]              
 79                -2  1     41216  models.common.Conv                      [320, 128, 1, 1]              
 80                -1  1    147712  models.common.Conv                      [128, 128, 3, 1]              
 81                -1  1    147712  models.common.Conv                      [128, 128, 3, 1]              
 82                -1  1    147712  models.common.Conv                      [128, 128, 3, 1]              
 83                -1  1    147712  models.common.Conv                      [128, 128, 3, 1]              
 84                -1  1    147712  models.common.Conv                      [128, 128, 3, 1]              
 85                -1  1    147712  models.common.Conv                      [128, 128, 3, 1]              
 86[-1, -3, -5, -7, -8]  1         0  models.common.Concat                    [1]                           
 87                -1  1    102720  models.common.Conv                      [640, 160, 1, 1]              
 88                -1  1         0  models.common.MP                        []                            
 89                -1  1     25920  models.common.Conv                      [160, 160, 1, 1]              
 90                -3  1     25920  models.common.Conv                      [160, 160, 1, 1]              
 91                -1  1    230720  models.common.Conv                      [160, 160, 3, 2]              
 92      [-1, -3, 73]  1         0  models.common.Concat                    [1]                           
 93                -1  1    164352  models.common.Conv                      [640, 256, 1, 1]              
 94                -2  1    164352  models.common.Conv                      [640, 256, 1, 1]              
 95                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 96                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 97                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 98                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 99                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
100                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
101[-1, -3, -5, -7, -8]  1         0  models.common.Concat                    [1]                           
102                -1  1    410240  models.common.Conv                      [1280, 320, 1, 1]             
103                -1  1         0  models.common.MP                        []                            
104                -1  1    103040  models.common.Conv                      [320, 320, 1, 1]              
105                -3  1    103040  models.common.Conv                      [320, 320, 1, 1]              
106                -1  1    922240  models.common.Conv                      [320, 320, 3, 2]              
107      [-1, -3, 59]  1         0  models.common.Concat                    [1]                           
108                -1  1    656384  models.common.Conv                      [1280, 512, 1, 1]             
109                -2  1    656384  models.common.Conv                      [1280, 512, 1, 1]             
110                -1  1   2360320  models.common.Conv                      [512, 512, 3, 1]              
111                -1  1   2360320  models.common.Conv                      [512, 512, 3, 1]              
112                -1  1   2360320  models.common.Conv                      [512, 512, 3, 1]              
113                -1  1   2360320  models.common.Conv                      [512, 512, 3, 1]              
114                -1  1   2360320  models.common.Conv                      [512, 512, 3, 1]              
115                -1  1   2360320  models.common.Conv                      [512, 512, 3, 1]              
116[-1, -3, -5, -7, -8]  1         0  models.common.Concat                    [1]                           
117                -1  1   1639680  models.common.Conv                      [2560, 640, 1, 1]             
118                87  1    461440  models.common.Conv                      [160, 320, 3, 1]              
119               102  1   1844480  models.common.Conv                      [320, 640, 3, 1]              
120               117  1   7375360  models.common.Conv                      [640, 1280, 3, 1]             
121   [118, 119, 120]  1     56144  models.yolo.IDetect                     [3, [[12, 16, 19, 36, 40, 28], [36, 75, 76, 55, 72, 146], [142, 110, 192, 243, 459, 401]], [320, 640, 1280]]
Model Summary: 467 layers, 70828568 parameters, 70828568 gradients

Transferred 630/644 items from yolov7x.pt
Scaled weight_decay = 0.0005
Optimizer groups: 108 .bias, 108 conv.weight, 111 other
train: Scanning 'data/janken_dataset/train/labels' images and labels... 480 found, 0 missing, 0 empty, 0 corrupted: 100% 480/480 [02:09<00:00,  3.70it/s]
train: New cache created: data/janken_dataset/train/labels.cache
val: Scanning 'data/janken_dataset/valid/labels' images and labels... 120 found, 0 missing, 0 empty, 0 corrupted: 100% 120/120 [00:32<00:00,  3.69it/s]
val: New cache created: data/janken_dataset/valid/labels.cache

autoanchor: Analyzing anchors... anchors/target = 3.37, Best Possible Recall (BPR) = 1.0000
Image sizes 640 train, 640 test
Using 2 dataloader workers
Logging results to runs/train/yolov7x_custom
Starting training for 300 epochs...

     Epoch   gpu_mem       box       obj       cls     total    labels  img_size
     0/299     14.3G   0.06686   0.01977   0.02194    0.1086        39       640: 100% 30/30 [00:59<00:00,  1.98s/it]
               Class      Images      Labels           P           R      mAP@.5  mAP@.5:.95:   0% 0/4 [00:00<?, ?it/s]/usr/local/lib/python3.10/dist-packages/torch/functional.py:504: UserWarning: torch.meshgrid: in an upcoming release, it will be required to pass the indexing argument. (Triggered internally at ../aten/src/ATen/native/TensorShape.cpp:3483.)
  return _VF.meshgrid(tensors, **kwargs)  # type: ignore[attr-defined]
               Class      Images      Labels           P           R      mAP@.5  mAP@.5:.95: 100% 4/4 [00:14<00:00,  3.58s/it]
                 all         120         120      0.0544       0.283      0.0567     0.00864

     Epoch   gpu_mem       box       obj       cls     total    labels  img_size
     1/299     14.3G   0.05559    0.0168   0.02139   0.09378        49       640: 100% 30/30 [00:41<00:00,  1.37s/it]
               Class      Images      Labels           P           R      mAP@.5  mAP@.5:.95: 100% 4/4 [00:03<00:00,  1.24it/s]
                 all         120         120        0.18       0.242       0.164      0.0259

     Epoch   gpu_mem       box       obj       cls     total    labels  img_size
     2/299     14.3G   0.05013   0.01553   0.02074    0.0864        50       640: 100% 30/30 [00:40<00:00,  1.35s/it]
               Class      Images      Labels           P           R      mAP@.5  mAP@.5:.95: 100% 4/4 [00:02<00:00,  1.51it/s]
                 all         120         120       0.198       0.423       0.259      0.0494

     Epoch   gpu_mem       box       obj       cls     total    labels  img_size
     3/299     14.3G    0.0501    0.0149    0.0204    0.0854        65       640: 100% 30/30 [00:40<00:00,  1.37s/it]
               Class      Images      Labels           P           R      mAP@.5  mAP@.5:.95: 100% 4/4 [00:02<00:00,  1.68it/s]
                 all         120         120       0.255       0.333        0.25      0.0635

     Epoch   gpu_mem       box       obj       cls     total    labels  img_size
     4/299     14.3G   0.04585   0.01236   0.01767   0.07589        49       640: 100% 30/30 [00:40<00:00,  1.34s/it]
               Class      Images      Labels           P           R      mAP@.5  mAP@.5:.95: 100% 4/4 [00:02<00:00,  1.69it/s]
                 all         120         120       0.244       0.533       0.309       0.119

     Epoch   gpu_mem       box       obj       cls     total    labels  img_size
     5/299     14.4G   0.04897   0.01137   0.01814   0.07849        41       640: 100% 30/30 [00:40<00:00,  1.34s/it]
               Class      Images      Labels           P           R      mAP@.5  mAP@.5:.95: 100% 4/4 [00:02<00:00,  1.57it/s]
                 all         120         120       0.324       0.632        0.38       0.118

     Epoch   gpu_mem       box       obj       cls     total    labels  img_size
     6/299     14.3G    0.0458   0.01117    0.0178   0.07477        39       640: 100% 30/30 [00:40<00:00,  1.34s/it]
               Class      Images      Labels           P           R      mAP@.5  mAP@.5:.95: 100% 4/4 [00:02<00:00,  1.59it/s]
                 all         120         120       0.287       0.608       0.321       0.134

        :
        :

「Google colab」の接続が切れる

・90分ルールというのがあり何も操作せずに90分経つとリセットされる（この学習はおよそ120分かかる）
・無料版で GPUを使いすぎると強制的に切断される。再接続するには数時間から12時間以上の待つ必要がある
　遮断中は「GPUバックエンドに接続できません」のメッセージが表示され時間が経過する場で GPUを使うことができない

・接続が切れた場合、再度接続しカレントディレクトリを再設定する

cd /content/drive/MyDrive/yolov7

・引数に「–resume」を追加することで、前回の途中から学習をする（正常に終了せるまで何度か繰り返す）

!python train.py --workers 8 --batch-size 16 --data janken_dataset.yaml --cfg cfg/training/yolov7x.yaml --weights 'yolov7x.pt' --name yolov7x_custom --hyp data/hyp.scratch.p5.yaml --epochs 300 --device 0  --resume

▼　- log -　GoogleColab Tesla T4

2023-06-14 05:43:14.944705: I tensorflow/core/platform/cpu_feature_guard.cc:182] This TensorFlow binary is optimized to use available CPU instructions in performance-critical operations.
To enable the following instructions: AVX2 AVX512F FMA, in other operations, rebuild TensorFlow with the appropriate compiler flags.
2023-06-14 05:43:15.941306: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:38] TF-TRT Warning: Could not find TensorRT
Resuming training from ./runs/train/yolov7x_custom/weights/last.pt
YOLOR 🚀 v0.1-122-g3b41c2c torch 2.0.1+cu118 CUDA:0 (Tesla T4, 15101.8125MB)

Namespace(weights='./runs/train/yolov7x_custom/weights/last.pt', cfg='', data='janken_dataset.yaml', hyp='data/hyp.scratch.p5.yaml', epochs=300, batch_size=16, img_size=[640, 640], rect=False, resume=True, nosave=False, notest=False, noautoanchor=False, evolve=False, bucket='', cache_images=False, image_weights=False, device='0', multi_scale=False, single_cls=False, adam=False, sync_bn=False, local_rank=-1, workers=8, project='runs/train', entity=None, name='yolov7x_custom', exist_ok=False, quad=False, linear_lr=False, label_smoothing=0.0, upload_dataset=False, bbox_interval=-1, save_period=-1, artifact_alias='latest', freeze=[0], v5_metric=False, world_size=1, global_rank=-1, save_dir='runs/train/yolov7x_custom', total_batch_size=16)
tensorboard: Start with 'tensorboard --logdir runs/train', view at http://localhost:6006/
hyperparameters: lr0=0.01, lrf=0.1, momentum=0.937, weight_decay=0.0005, warmup_epochs=3.0, warmup_momentum=0.8, warmup_bias_lr=0.1, box=0.05, cls=0.3, cls_pw=1.0, obj=0.7, obj_pw=1.0, iou_t=0.2, anchor_t=4.0, fl_gamma=0.0, hsv_h=0.015, hsv_s=0.7, hsv_v=0.4, degrees=0.0, translate=0.2, scale=0.9, shear=0.0, perspective=0.0, flipud=0.0, fliplr=0.5, mosaic=1.0, mixup=0.15, copy_paste=0.0, paste_in=0.15, loss_ota=1
wandb: Install Weights & Biases for YOLOR logging with 'pip install wandb' (recommended)

                 from  n    params  module                                  arguments                     
  0                -1  1      1160  models.common.Conv                      [3, 40, 3, 1]                 
  1                -1  1     28960  models.common.Conv                      [40, 80, 3, 2]                
  2                -1  1     57760  models.common.Conv                      [80, 80, 3, 1]                
  3                -1  1    115520  models.common.Conv                      [80, 160, 3, 2]               
  4                -1  1     10368  models.common.Conv                      [160, 64, 1, 1]               
  5                -2  1     10368  models.common.Conv                      [160, 64, 1, 1]               
  6                -1  1     36992  models.common.Conv                      [64, 64, 3, 1]                
  7                -1  1     36992  models.common.Conv                      [64, 64, 3, 1]                
  8                -1  1     36992  models.common.Conv                      [64, 64, 3, 1]                
  9                -1  1     36992  models.common.Conv                      [64, 64, 3, 1]                
 10                -1  1     36992  models.common.Conv                      [64, 64, 3, 1]                
 11                -1  1     36992  models.common.Conv                      [64, 64, 3, 1]                
 12[-1, -3, -5, -7, -8]  1         0  models.common.Concat                    [1]                           
 13                -1  1    103040  models.common.Conv                      [320, 320, 1, 1]              
 14                -1  1         0  models.common.MP                        []                            
 15                -1  1     51520  models.common.Conv                      [320, 160, 1, 1]              
 16                -3  1     51520  models.common.Conv                      [320, 160, 1, 1]              
 17                -1  1    230720  models.common.Conv                      [160, 160, 3, 2]              
 18          [-1, -3]  1         0  models.common.Concat                    [1]                           
 19                -1  1     41216  models.common.Conv                      [320, 128, 1, 1]              
 20                -2  1     41216  models.common.Conv                      [320, 128, 1, 1]              
 21                -1  1    147712  models.common.Conv                      [128, 128, 3, 1]              
 22                -1  1    147712  models.common.Conv                      [128, 128, 3, 1]              
 23                -1  1    147712  models.common.Conv                      [128, 128, 3, 1]              
 24                -1  1    147712  models.common.Conv                      [128, 128, 3, 1]              
 25                -1  1    147712  models.common.Conv                      [128, 128, 3, 1]              
 26                -1  1    147712  models.common.Conv                      [128, 128, 3, 1]              
 27[-1, -3, -5, -7, -8]  1         0  models.common.Concat                    [1]                           
 28                -1  1    410880  models.common.Conv                      [640, 640, 1, 1]              
 29                -1  1         0  models.common.MP                        []                            
 30                -1  1    205440  models.common.Conv                      [640, 320, 1, 1]              
 31                -3  1    205440  models.common.Conv                      [640, 320, 1, 1]              
 32                -1  1    922240  models.common.Conv                      [320, 320, 3, 2]              
 33          [-1, -3]  1         0  models.common.Concat                    [1]                           
 34                -1  1    164352  models.common.Conv                      [640, 256, 1, 1]              
 35                -2  1    164352  models.common.Conv                      [640, 256, 1, 1]              
 36                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 37                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 38                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 39                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 40                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 41                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 42[-1, -3, -5, -7, -8]  1         0  models.common.Concat                    [1]                           
 43                -1  1   1640960  models.common.Conv                      [1280, 1280, 1, 1]            
 44                -1  1         0  models.common.MP                        []                            
 45                -1  1    820480  models.common.Conv                      [1280, 640, 1, 1]             
 46                -3  1    820480  models.common.Conv                      [1280, 640, 1, 1]             
 47                -1  1   3687680  models.common.Conv                      [640, 640, 3, 2]              
 48          [-1, -3]  1         0  models.common.Concat                    [1]                           
 49                -1  1    328192  models.common.Conv                      [1280, 256, 1, 1]             
 50                -2  1    328192  models.common.Conv                      [1280, 256, 1, 1]             
 51                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 52                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 53                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 54                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 55                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 56                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 57[-1, -3, -5, -7, -8]  1         0  models.common.Concat                    [1]                           
 58                -1  1   1640960  models.common.Conv                      [1280, 1280, 1, 1]            
 59                -1  1  11887360  models.common.SPPCSPC                   [1280, 640, 1]                
 60                -1  1    205440  models.common.Conv                      [640, 320, 1, 1]              
 61                -1  1         0  torch.nn.modules.upsampling.Upsample    [None, 2, 'nearest']          
 62                43  1    410240  models.common.Conv                      [1280, 320, 1, 1]             
 63          [-1, -2]  1         0  models.common.Concat                    [1]                           
 64                -1  1    164352  models.common.Conv                      [640, 256, 1, 1]              
 65                -2  1    164352  models.common.Conv                      [640, 256, 1, 1]              
 66                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 67                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 68                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 69                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 70                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 71                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 72[-1, -3, -5, -7, -8]  1         0  models.common.Concat                    [1]                           
 73                -1  1    410240  models.common.Conv                      [1280, 320, 1, 1]             
 74                -1  1     51520  models.common.Conv                      [320, 160, 1, 1]              
 75                -1  1         0  torch.nn.modules.upsampling.Upsample    [None, 2, 'nearest']          
 76                28  1    102720  models.common.Conv                      [640, 160, 1, 1]              
 77          [-1, -2]  1         0  models.common.Concat                    [1]                           
 78                -1  1     41216  models.common.Conv                      [320, 128, 1, 1]              
 79                -2  1     41216  models.common.Conv                      [320, 128, 1, 1]              
 80                -1  1    147712  models.common.Conv                      [128, 128, 3, 1]              
 81                -1  1    147712  models.common.Conv                      [128, 128, 3, 1]              
 82                -1  1    147712  models.common.Conv                      [128, 128, 3, 1]              
 83                -1  1    147712  models.common.Conv                      [128, 128, 3, 1]              
 84                -1  1    147712  models.common.Conv                      [128, 128, 3, 1]              
 85                -1  1    147712  models.common.Conv                      [128, 128, 3, 1]              
 86[-1, -3, -5, -7, -8]  1         0  models.common.Concat                    [1]                           
 87                -1  1    102720  models.common.Conv                      [640, 160, 1, 1]              
 88                -1  1         0  models.common.MP                        []                            
 89                -1  1     25920  models.common.Conv                      [160, 160, 1, 1]              
 90                -3  1     25920  models.common.Conv                      [160, 160, 1, 1]              
 91                -1  1    230720  models.common.Conv                      [160, 160, 3, 2]              
 92      [-1, -3, 73]  1         0  models.common.Concat                    [1]                           
 93                -1  1    164352  models.common.Conv                      [640, 256, 1, 1]              
 94                -2  1    164352  models.common.Conv                      [640, 256, 1, 1]              
 95                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 96                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 97                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 98                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
 99                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
100                -1  1    590336  models.common.Conv                      [256, 256, 3, 1]              
101[-1, -3, -5, -7, -8]  1         0  models.common.Concat                    [1]                           
102                -1  1    410240  models.common.Conv                      [1280, 320, 1, 1]             
103                -1  1         0  models.common.MP                        []                            
104                -1  1    103040  models.common.Conv                      [320, 320, 1, 1]              
105                -3  1    103040  models.common.Conv                      [320, 320, 1, 1]              
106                -1  1    922240  models.common.Conv                      [320, 320, 3, 2]              
107      [-1, -3, 59]  1         0  models.common.Concat                    [1]                           
108                -1  1    656384  models.common.Conv                      [1280, 512, 1, 1]             
109                -2  1    656384  models.common.Conv                      [1280, 512, 1, 1]             
110                -1  1   2360320  models.common.Conv                      [512, 512, 3, 1]              
111                -1  1   2360320  models.common.Conv                      [512, 512, 3, 1]              
112                -1  1   2360320  models.common.Conv                      [512, 512, 3, 1]              
113                -1  1   2360320  models.common.Conv                      [512, 512, 3, 1]              
114                -1  1   2360320  models.common.Conv                      [512, 512, 3, 1]              
115                -1  1   2360320  models.common.Conv                      [512, 512, 3, 1]              
116[-1, -3, -5, -7, -8]  1         0  models.common.Concat                    [1]                           
117                -1  1   1639680  models.common.Conv                      [2560, 640, 1, 1]             
118                87  1    461440  models.common.Conv                      [160, 320, 3, 1]              
119               102  1   1844480  models.common.Conv                      [320, 640, 3, 1]              
120               117  1   7375360  models.common.Conv                      [640, 1280, 3, 1]             
121   [118, 119, 120]  1     56144  models.yolo.IDetect                     [3, [[12, 16, 19, 36, 40, 28], [36, 75, 76, 55, 72, 146], [142, 110, 192, 243, 459, 401]], [320, 640, 1280]]
Model Summary: 467 layers, 70828568 parameters, 70828568 gradients

Transferred 644/644 items from ./runs/train/yolov7x_custom/weights/last.pt
Scaled weight_decay = 0.0005
Optimizer groups: 108 .bias, 108 conv.weight, 111 other
train: Scanning 'data/janken_dataset/train/labels.cache' images and labels... 480 found, 0 missing, 0 empty, 0 corrupted: 100% 480/480 [00:00<?, ?it/s]
val: Scanning 'data/janken_dataset/valid/labels.cache' images and labels... 120 found, 0 missing, 0 empty, 0 corrupted: 100% 120/120 [00:00<?, ?it/s]
Image sizes 640 train, 640 test
Using 2 dataloader workers
Logging results to runs/train/yolov7x_custom
Starting training for 300 epochs...

     Epoch   gpu_mem       box       obj       cls     total    labels  img_size
   295/299     14.3G   0.01737  0.004938  0.003235   0.02555        39       640: 100% 30/30 [01:01<00:00,  2.06s/it]
               Class      Images      Labels           P           R      mAP@.5  mAP@.5:.95:   0% 0/4 [00:00<?, ?it/s]/usr/local/lib/python3.10/dist-packages/torch/functional.py:504: UserWarning: torch.meshgrid: in an upcoming release, it will be required to pass the indexing argument. (Triggered internally at ../aten/src/ATen/native/TensorShape.cpp:3483.)
  return _VF.meshgrid(tensors, **kwargs)  # type: ignore[attr-defined]
               Class      Images      Labels           P           R      mAP@.5  mAP@.5:.95: 100% 4/4 [00:15<00:00,  3.88s/it]
                 all         120         120       0.996       0.992       0.997       0.747

     Epoch   gpu_mem       box       obj       cls     total    labels  img_size
   296/299     14.3G   0.01841  0.004869  0.003004   0.02628        49       640: 100% 30/30 [00:43<00:00,  1.45s/it]
               Class      Images      Labels           P           R      mAP@.5  mAP@.5:.95: 100% 4/4 [00:02<00:00,  1.35it/s]
                 all         120         120       0.996       0.992       0.997       0.748

     Epoch   gpu_mem       box       obj       cls     total    labels  img_size
   297/299     14.3G   0.01613     0.005  0.002463    0.0236        50       640: 100% 30/30 [00:44<00:00,  1.49s/it]
               Class      Images      Labels           P           R      mAP@.5  mAP@.5:.95: 100% 4/4 [00:02<00:00,  1.43it/s]
                 all         120         120       0.998       0.995       0.997       0.747

     Epoch   gpu_mem       box       obj       cls     total    labels  img_size
   298/299     14.3G   0.01826  0.005085  0.002929   0.02628        65       640: 100% 30/30 [00:42<00:00,  1.43s/it]
               Class      Images      Labels           P           R      mAP@.5  mAP@.5:.95: 100% 4/4 [00:03<00:00,  1.29it/s]
                 all         120         120       0.997       0.995       0.997       0.748

     Epoch   gpu_mem       box       obj       cls     total    labels  img_size
   299/299     14.3G   0.02072  0.004905  0.003531   0.02915        49       640: 100% 30/30 [00:43<00:00,  1.45s/it]
               Class      Images      Labels           P           R      mAP@.5  mAP@.5:.95: 100% 4/4 [00:04<00:00,  1.22s/it]
                 all         120         120       0.988       0.996       0.997        0.75
                 goo         120          40        0.99           1       0.997       0.733
               choki         120          40       0.976           1       0.997       0.726
                 par         120          40           1       0.988       0.997       0.791
5 epochs completed in 0.086 hours.

Optimizer stripped from runs/train/yolov7x_custom/weights/last.pt, 142.1MB
Optimizer stripped from runs/train/yolov7x_custom/weights/best.pt, 142.1MB

正常に終了すると、学習結果は「yolov7/runs/train/yolov7x_custom/」に保存される
・学習結果モデルは「yolov7x_custom/weights」評価指標は「yolov7x_custom」

F1 curve P curve PR curve R curve

↑

学習結果で推論を実行する画像編＜クラウド・サービス＞ †

学習結果モデル「runs/train/yolov7x_custom/weights/best.pt」を使用する
・テスト画像「janken.jpg」を「yolov7/」直下にアップしておく

検出コマンドを実行する

!python detect.py --weights runs/train/yolov7x_custom/weights/best.pt --conf 0.25 --img-size 640 --source janken.jpg

▼　- log -　GoogleColab Tesla T4

Namespace(weights=['runs/train/yolov7x_custom/weights/best.pt'], source='janken.jpg', img_size=640, conf_thres=0.25, iou_thres=0.45, device='', view_img=False, save_txt=False, save_conf=False, nosave=False, classes=None, agnostic_nms=False, augment=False, update=False, project='runs/detect', name='exp', exist_ok=False, no_trace=False)
YOLOR 🚀 v0.1-122-g3b41c2c torch 2.0.1+cu118 CPU

Fusing layers... 
IDetect.fuse
Model Summary: 362 layers, 70795920 parameters, 0 gradients
 Convert model to Traced-model... 
 traced_script_module saved! 
 model is traced! 

/usr/local/lib/python3.10/dist-packages/torch/functional.py:504: UserWarning: torch.meshgrid: in an upcoming release, it will be required to pass the indexing argument. (Triggered internally at ../aten/src/ATen/native/TensorShape.cpp:3483.)
  return _VF.meshgrid(tensors, **kwargs)  # type: ignore[attr-defined]
4 goos, 3 chokis, 3 pars, Done. (2575.6ms) Inference, (23.1ms) NMS
 The image with the result is saved in: runs/detect/exp3/janken.jpg
Done. (3.236s)

・推論結果の保存先は「yolov7/runs/detect/exp*」（ * は自動的に振られる番号）

↑

学習結果で推論を実行する画像編＜ローカルマシン＞ †

学習結果モデル「runs/train/yolov7x_custom/weights/best.pt」を使用する
・テスト画像は「/Images/janken.jpg」を使用する
・ローカルマシン上の「yolov7/」フォルダの場所は「/work/yolov7-main/」

検出コマンドを実行する

(py38a) PS > cd /anaconda_win/work/yolov7-main
(py38a) PS > python detect.py --weights runs/train/yolov7x_custom/weights/best.pt --conf 0.25 --img-size 640 --source ../../Images/janken.jpg

▼　- log -　NVIDIA GeForce GTX 1050 Ti

Namespace(agnostic_nms=False, augment=False, classes=None, conf_thres=0.25, device='', exist_ok=False, img_size=640, iou_thres=0.45, name='exp', no_trace=False, nosave=False, project='runs/detect', save_conf=False, save_txt=False, source='../../Images/janken.jpg', update=False, view_img=False, weights=['runs/train/yolov7x_custom/weights/best.pt'])
YOLOR  2023-4-13 torch 2.0.0+cpu CPU

Fusing layers...
IDetect.fuse
Model Summary: 362 layers, 70795920 parameters, 0 gradients
 Convert model to Traced-model...
 traced_script_module saved!
 model is traced!

C:\Users\izuts\anaconda3\envs\py38a\lib\site-packages\torch\functional.py:504: UserWarning: torch.meshgrid: in an upcoming release, it will be required to pass the indexing argument. (Triggered internally at C:\actions-runner\_work\pytorch\pytorch\builder\windows\pytorch\aten\src\ATen\native\TensorShape.cpp:3484.)
  return _VF.meshgrid(tensors, **kwargs)  # type: ignore[attr-defined]
4 goos, 3 chokis, 3 pars, Done. (1546.9ms) Inference, (13.0ms) NMS
 The image with the result is saved in: runs\detect\exp9\janken.jpg
Done. (1.683s)

・推論結果の保存先は「yolov7/runs/detect/exp*」（ * は自動的に振られる番号）

↑

学習結果で推論を実行する動画編＜クラウド・サービス＞ †

学習結果モデル「runs/train/yolov7x_custom/weights/best.pt」を使用する
・テスト画像「janken.mov」を「yolov7/」直下にアップしておく

検出コマンドを実行する

!python detect.py --weights runs/train/yolov7x_custom/weights/best.pt --conf 0.25 --img-size 640 --source janken.mov

▼　- log -　GoogleColab Tesla T4

Namespace(weights=['runs/train/yolov7x_custom/weights/best.pt'], source='janken.mov', img_size=640, conf_thres=0.25, iou_thres=0.45, device='', view_img=False, save_txt=False, save_conf=False, nosave=False, classes=None, agnostic_nms=False, augment=False, update=False, project='runs/detect', name='exp', exist_ok=False, no_trace=False)
YOLOR 🚀 v0.1-122-g3b41c2c torch 2.0.1+cu118 CPU

Fusing layers... 
IDetect.fuse
Model Summary: 362 layers, 70795920 parameters, 0 gradients
 Convert model to Traced-model... 
 traced_script_module saved! 
 model is traced! 

/usr/local/lib/python3.10/dist-packages/torch/functional.py:504: UserWarning: torch.meshgrid: in an upcoming release, it will be required to pass the indexing argument. (Triggered internally at ../aten/src/ATen/native/TensorShape.cpp:3483.)
  return _VF.meshgrid(tensors, **kwargs)  # type: ignore[attr-defined]
video 1/1 (1/261) /content/drive/MyDrive/yolov7/janken.mov: 1 par, Done. (1960.0ms) Inference, (17.2ms) NMS
video 1/1 (2/261) /content/drive/MyDrive/yolov7/janken.mov: 1 goo, 1 par, Done. (1725.0ms) Inference, (0.6ms) NMS
video 1/1 (3/261) /content/drive/MyDrive/yolov7/janken.mov: 1 goo, 1 par, Done. (1482.1ms) Inference, (0.5ms) NMS
video 1/1 (4/261) /content/drive/MyDrive/yolov7/janken.mov: 1 par, Done. (1505.9ms) Inference, (0.6ms) NMS
video 1/1 (5/261) /content/drive/MyDrive/yolov7/janken.mov: 1 par, Done. (1483.8ms) Inference, (0.5ms) NMS
video 1/1 (6/261) /content/drive/MyDrive/yolov7/janken.mov: 1 par, Done. (1616.7ms) Inference, (0.8ms) NMS
video 1/1 (7/261) /content/drive/MyDrive/yolov7/janken.mov: 1 par, Done. (1673.4ms) Inference, (1.0ms) NMS
        :
        :
video 1/1 (259/261) /content/drive/MyDrive/yolov7/janken.mov: 1 goo, 1 par, Done. (2029.7ms) Inference, (1.5ms) NMS
video 1/1 (260/261) /content/drive/MyDrive/yolov7/janken.mov: 1 goo, 1 par, Done. (1766.9ms) Inference, (0.6ms) NMS
video 1/1 (261/261) /content/drive/MyDrive/yolov7/janken.mov: 1 goo, 1 par, Done. (1954.2ms) Inference, (1.7ms) NMS
Done. (415.786s)

・推論結果の保存先は「yolov7/runs/detect/exp*」（ * は自動的に振られる番号）
・うまく識別できていない ‼

↑

学習結果で推論を実行する動画編＜ローカルマシン＞ †

学習結果モデル「runs/train/yolov7x_custom/weights/best.pt」を使用する
・テスト画像は「/Videos/janken.mov」を使用する
・ローカルマシン上の「yolov7/」フォルダの場所は「/work/yolov7-main/」

検出コマンドを実行する

(py38a) PS > cd /anaconda_win/work/yolov7-main
(py38a) PS > python detect.py --weights runs/train/yolov7x_custom/weights/best.pt --conf 0.25 --img-size 640 --source ../../Videos/janken.mov

▼　- log -　NVIDIA GeForce GTX 1050 Ti

Namespace(agnostic_nms=False, augment=False, classes=None, conf_thres=0.25, device='', exist_ok=False, img_size=640, iou_thres=0.45, name='exp', no_trace=False, nosave=False, project='runs/detect', save_conf=False, save_txt=False, source='../../Videos/janken.mov', update=False, view_img=False, weights=['runs/train/yolov7x_custom/weights/best.pt'])
YOLOR  2023-4-13 torch 2.0.0+cpu CPU

Fusing layers...
IDetect.fuse
Model Summary: 362 layers, 70795920 parameters, 0 gradients
 Convert model to Traced-model...
 traced_script_module saved!
 model is traced!

video 1/1 (1/261) N:\anaconda_win\work\yolov7-main\..\..\Videos\janken.mov: C:\Users\izuts\anaconda3\envs\py38a\lib\site-packages\torch\functional.py:504: UserWarning: torch.meshgrid: in an upcoming release, it will be required to pass the indexing argument. (Triggered internally at C:\actions-runner\_work\pytorch\pytorch\builder\windows\pytorch\aten\src\ATen\native\TensorShape.cpp:3484.)
  return _VF.meshgrid(tensors, **kwargs)  # type: ignore[attr-defined]
1 par, Done. (1264.6ms) Inference, (1.0ms) NMS
video 1/1 (2/261) N:\anaconda_win\work\yolov7-main\..\..\Videos\janken.mov: 1 goo, 1 par, Done. (1082.1ms) Inference, (1.0ms) NMS
video 1/1 (3/261) N:\anaconda_win\work\yolov7-main\..\..\Videos\janken.mov: 1 goo, 1 par, Done. (1141.9ms) Inference, (1.0ms) NMS
video 1/1 (4/261) N:\anaconda_win\work\yolov7-main\..\..\Videos\janken.mov: 1 par, Done. (1119.0ms) Inference, (1.0ms) NMS
video 1/1 (5/261) N:\anaconda_win\work\yolov7-main\..\..\Videos\janken.mov: 1 par, Done. (1136.9ms) Inference, (0.0ms) NMS
video 1/1 (6/261) N:\anaconda_win\work\yolov7-main\..\..\Videos\janken.mov: 1 par, Done. (1169.8ms) Inference, (1.0ms) NMS
video 1/1 (7/261) N:\anaconda_win\work\yolov7-main\..\..\Videos\janken.mov: 1 par, Done. (1118.0ms) Inference, (0.0ms) NMS
        :
        :
video 1/1 (259/261) N:\anaconda_win\work\yolov7-main\..\..\Videos\janken.mov: 1 goo, 1 par, Done. (1052.2ms) Inference, (1.0ms) NMS
video 1/1 (260/261) N:\anaconda_win\work\yolov7-main\..\..\Videos\janken.mov: 1 goo, 1 par, Done. (1071.1ms) Inference, (1.0ms) NMS
video 1/1 (261/261) N:\anaconda_win\work\yolov7-main\..\..\Videos\janken.mov: 1 goo, 1 par, Done. (1065.1ms) Inference, (1.0ms) NMS
Done. (298.257s)

・推論結果の保存先は「yolov7/runs/detect/exp*」（ * は自動的に振られる番号）
・うまく識別できていない ‼

↑

学習結果で推論を実行するカメラ編＜ローカルマシン＞（参考） †

学習結果モデル「runs/train/yolov7x_custom/weights/best.pt」を使用する
・ローカルマシン上の「yolov7/」フォルダの場所は「/work/yolov7-main/」

検出コマンドを実行する

(py38a) PS > cd /anaconda_win/work/yolov7-main
(py38a) PS > python detect.py --weights runs/train/yolov7x_custom/weights/best.pt --conf 0.25 --img-size 640 --source 0

▼　- log -　NVIDIA GeForce GTX 1050 Ti

(py38a) PS N:\anaconda_win\work\yolov7-main> python detect.py --weights runs/train/yolov7x_custom/weights/best.pt --conf 0.25 --img-size 640 --source 0         Namespace(agnostic_nms=False, augment=False, classes=None, conf_thres=0.25, device='', exist_ok=False, img_size=640, iou_thres=0.45, name='exp', no_trace=False, nosave=False, project='runs/detect', save_conf=False, save_txt=False, source='0', update=False, view_img=False, weights=['runs/train/yolov7x_custom/weights/best.pt'])
YOLOR  2023-4-13 torch 2.0.1+cu117 CUDA:0 (NVIDIA GeForce GTX 1050 Ti, 4095.75MB)

Fusing layers...
IDetect.fuse
Model Summary: 362 layers, 70795920 parameters, 0 gradients
 Convert model to Traced-model...
 traced_script_module saved!
 model is traced!

1/1: 0...  success (640x480 at 30.00 FPS).

C:\Users\izuts\anaconda3\envs\py38a\lib\site-packages\torch\functional.py:504: UserWarning: torch.meshgrid: in an upcoming release, it will be required to pass the indexing argument. (Triggered internally at C:\actions-runner\_work\pytorch\pytorch\builder\windows\pytorch\aten\src\ATen\native\TensorShape.cpp:3484.)
  return _VF.meshgrid(tensors, **kwargs)  # type: ignore[attr-defined]
0: 7 pars, Done. (134.6ms) Inference, (6.0ms) NMS
0: 8 pars, Done. (131.7ms) Inference, (2.0ms) NMS
0: 8 pars, Done. (140.6ms) Inference, (2.0ms) NMS
0: 8 pars, Done. (133.6ms) Inference, (2.0ms) NMS
0: 8 pars, Done. (134.6ms) Inference, (2.0ms) NMS
0: 8 pars, Done. (134.6ms) Inference, (1.0ms) NMS
    :
    :
0: 9 pars, Done. (135.6ms) Inference, (2.0ms) NMS
Traceback (most recent call last):                  ← 終了できないので 'Ctrl + C' を押しながらウインドウを閉じる
  File "detect.py", line 196, in <module>
    detect()
  File "detect.py", line 70, in detect
    for path, img, im0s, vid_cap in dataset:
  File "N:\anaconda_win\work\yolov7-main\utils\datasets.py", line 327, in __next__

(py38a) PS > ^C

・推論結果の保存先は「yolov7/runs/detect/exp*」（ * は自動的に振られる番号）
・うまく識別できていない ‼
　強制終了なのでファイルは生成されているが読めない！

↑

学習済みモデルを ONNX 形式にコンバートする＜クラウド・サービス＞ †

onnx パッケージをインストールする

!pip install onnx

▼　- log -　GoogleColab Tesla T4

Looking in indexes: https://pypi.org/simple, https://us-python.pkg.dev/colab-wheels/public/simple/
Collecting onnx
  Downloading onnx-1.14.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (14.6 MB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 14.6/14.6 MB 56.6 MB/s eta 0:00:00
Requirement already satisfied: numpy in /usr/local/lib/python3.10/dist-packages (from onnx) (1.22.4)
Requirement already satisfied: protobuf>=3.20.2 in /usr/local/lib/python3.10/dist-packages (from onnx) (3.20.3)
Requirement already satisfied: typing-extensions>=3.6.2.1 in /usr/local/lib/python3.10/dist-packages (from onnx) (4.5.0)
Installing collected packages: onnx
Successfully installed onnx-1.14.0

学習結果モデル「runs/train/yolov7x_custom/weights/best.pt」を onnx 形式に変換する

!python export.py --weights runs/train/yolov7x_custom/weights/best.pt

▼　- log -　GoogleColab Tesla T4

Import onnx_graphsurgeon failure: No module named 'onnx_graphsurgeon'
Namespace(weights='runs/train/yolov7x_custom/weights/best.pt', img_size=[640, 640], batch_size=1, dynamic=False, dynamic_batch=False, grid=False, end2end=False, max_wh=None, topk_all=100, iou_thres=0.45, conf_thres=0.25, device='cpu', simplify=False, include_nms=False, fp16=False, int8=False)
YOLOR 🚀 v0.1-122-g3b41c2c torch 2.0.1+cu118 CPU

Fusing layers... 
IDetect.fuse
Model Summary: 362 layers, 70795920 parameters, 0 gradients

Starting TorchScript export with torch 2.0.1+cu118...
TorchScript export success, saved as runs/train/yolov7x_custom/weights/best.torchscript.pt
CoreML export failure: No module named 'coremltools'

Starting TorchScript-Lite export with torch 2.0.1+cu118...
TorchScript-Lite export success, saved as runs/train/yolov7x_custom/weights/best.torchscript.ptl

Starting ONNX export with onnx 1.14.0...
/content/drive/MyDrive/yolov7/models/yolo.py:582: TracerWarning: Converting a tensor to a Python boolean might cause the trace to be incorrect. We can't record the data flow of Python values, so this value will be treated as a constant in the future. This means that the trace might not generalize to other inputs!
  if augment:
/content/drive/MyDrive/yolov7/models/yolo.py:614: TracerWarning: Converting a tensor to a Python boolean might cause the trace to be incorrect. We can't record the data flow of Python values, so this value will be treated as a constant in the future. This means that the trace might not generalize to other inputs!
  if profile:
/content/drive/MyDrive/yolov7/models/yolo.py:629: TracerWarning: Converting a tensor to a Python boolean might cause the trace to be incorrect. We can't record the data flow of Python values, so this value will be treated as a constant in the future. This means that the trace might not generalize to other inputs!
  if profile:
============= Diagnostic Run torch.onnx.export version 2.0.1+cu118 =============
verbose: False, log level: Level.ERROR
======================= 0 NONE 0 NOTE 0 WARNING 0 ERROR ========================

ONNX export success, saved as runs/train/yolov7x_custom/weights/best.onnx

Export complete (60.86s). Visualize with https://github.com/lutzroeder/netron.

変換結果「runs/train/yolov7x_custom/weights/best.onnx」をローカルマシンにダウンロードする

↑

OpenVINO™ で実行する＜ローカルマシン＞ †

学習済みモデル「best.onnx」を「/work/yolov7/」ディレクトリに「janken_best.onnx」の名前でコピーする
ラベルファイルを「/work/yolov7/」ディレクトリに用意する
・janken.nams
```
gook
choki
par
```
・janken.nams_jp
```
グー
チョキ
パー
```

ローカルマシンのターミナルで実行する

(py38a) PS > cd /anaconda_win/work/yolov7
(py38a) PS > python object_detect_yolo7.py -m janken_best.onnx -l janken.names_jp -i ../../Images/janken.jpg -o out_janken.jpg
Starting..
 - Program title  : Object detection YOLO V7
 - OpenCV version : 4.5.5
 - OpenVINO engine: 2022.1.0-7019-cdb9bec7210-releases/2022/1
 - Input image    : ../../Images/janken.jpg
 - Model          : janken_best.onnx
 - Device         : CPU
 - Label          : janken.names_jp
 - Log level      : 3
 - Title flag     : y
 - Speed flag     : y
 - Processed out  : out_janken.jpg
 - Preprocessing  : False
 - Batch size     : 1
 - number of inf  : 1
 - With grid      : False
FPS average:       0.90
Finished.

カメラ画像の入力（参考）

・うまく識別できていない ‼

(py38a) PS > python object_detect_yolo7.py -m janken_best.onnx -l janken.names_jp -i cam
Starting..
 - Program title  : Object detection YOLO V7
 - OpenCV version : 4.5.5
 - OpenVINO engine: 2022.1.0-7019-cdb9bec7210-releases/2022/1
 - Input image    : cam
 - Model          : janken_best.onnx
 - Device         : CPU
 - Label          : janken.names_jp
 - Log level      : 3
 - Title flag     : y
 - Speed flag     : y
 - Processed out  : non
 - Preprocessing  : False
 - Batch size     : 1
 - number of inf  : 1
 - With grid      : False
FPS average:       0.90
Finished.

↑

ここまでのまとめと今後の課題 †

↑

わかったこと †

手だけの静止画はともかく、背景や動きのある通常の場面での判定は難しい。
これまでのところ、カメラによるリアルタイムの判定は処理速度の点で困難。
いろいろな場面で必要となるカスタムデータの学習は、「YOLO v7」を使用すれば比較的容易に結果を得ることができる。
学習のための状況に即した多くの画像データの収集はかなり難しい。
教師ありのデータセットを作成するためのアノテーション（ラベリング）作業はソフトウェア・ツールがあるものの多くの人的リソースが必要。
ハードウエア環境については、クラウドサービス「Google Colaboratory」を利用できるが無償範囲では制限も多い。

↑

これからの課題 †

実際の運用環境で実用的な結果を得られる「学習済みモデル」を作成するためにはどうするか？
データセットに必要な画像データの収集方法。
学習精度を上げるための手法（評価指標の判定方法と学習時のパラメータ設定など）。
時間のかかる学習作業の有効なやり方。（ローカルマシンに環境を整備する／有料クラウドサービスを契約する … など）

↑

クラウドサービス「Google Colaboratory」について †

リソースの確認
ツールバーの「RAM/ディスク」をクリックすることで現在のリソースの状況を表示できる

割り当てられたGPUの確認

!nvidia-smi

Thu Jun 15 10:41:46 2023       
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 525.85.12    Driver Version: 525.85.12    CUDA Version: 12.0     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  Tesla T4            Off  | 00000000:00:04.0 Off |                    0 |
| N/A   51C    P8    12W /  70W |      0MiB / 15360MiB |      0%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+
                                                                               
+-----------------------------------------------------------------------------+
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|=============================================================================|
|  No running processes found                                                 |
+-----------------------------------------------------------------------------+

90分ルールの対策
何も操作せずに90分経つとリセットされるが、Pythonスクリプトをローカルで実行して、1時間毎に「Google Colab」にアクセスすることで回避できる。

## colab.py (Windows 環境の場合)

import time
import datetime
import webbrowser

# 1時間毎に任意のノートブックを開く（90分ルールの対策）
for i in range(12):
    browse = webbrowser.get('windows-default')
    browse.open('<任意のノートブックのURL>')
    print(i, datetime.datetime.today())
    time.sleep(60*60)

「Google ドライブ」を使用する
「90分ルール」「12時間ルール」「GPUの使用制限」でインスタンスがリセットされると、それまで学習したデータも削除されてしまう。
Googleドライブに作業フォルダを作成することで、データを永続化でき、リセットされても再開できるようになる。

※「Google ドライブ」は無償で 15GB まで利用できるが、学習作業などを行うとすぐに容量オーバーする。
　容量アップが望ましい。支払い方法は「Google Play」カードも使用できる。

「Google Colaboratory」GPUの使用制限
無償版では状況によって動的に変化する使用制限があり、インスタンスの最大存続時間などの上限は公開されていない。
制限がかかると 12時間以上（時間は不定）利用できなくなる場合がある。

有償版「Colab Pro/Pro+」契約を行うと多少制限は緩和されるらしい。
契約（月々のサブスクリプト）の支払いはクレジットカード決済による。
手軽に GPU環境が利用できるメリットがあるが、実際の運用にあたっては利用制限に難がある。

↑

ローカルマシンで学習（Training） †

　クラウドサービスを使わずにローカルマシン上で学習できる環境を手持ちのハードウェア上に構築してみる

↑

NVIDIAのGPUアーキテクチャ †

アーキテクチャ (読み方)	プロセスルール	販売開始	採用シリーズ
Kepler (ケプラー)	28nm	2012年	GeForce GTX/GT 600シリーズ
		2012年	GeForce GTX/GT 700シリーズ
		2013年	GeForce GTX TITANシリーズ
Maxwell (マクスウェル)	28nm	2014年	GeForce GTX 700シリーズ
Maxwell (マクスウェル)	28nm	2015年	GeForce GTX 900シリーズ
Pascal (パスカル)	16nm/14nm	2016年	GeForce GTX 10シリーズ
Turing (チューリング)	12nm	2018年	GeForce RTX 20シリーズ
Turing (チューリング)	12nm	2019年	GeForce GTX 16シリーズ
Ampere (アンペア)	8nm	2020年	GeForce RTX 30シリーズ
Ada Lovelace (エイダ・ラブレス)	5nm	2022年	GeForce RTX 40シリーズ

↑

使用するハードウェア †

HP EliteDesk 800 G2 SFF
CPU Intel® Core™ i7-6700 CPU @ 3.40GHz
GPU NVIDIA GeForce GTX 1050 TI 4GB
OS Windows10 Pro

↑

現状確認 †

「Pytorch」のバージョンを調べる

(py38a) PS > python
Python 3.8.16 (default, Mar  2 2023, 03:18:16) [MSC v.1916 64 bit (AMD64)] :: Anaconda, Inc. on win32
Type "help", "copyright", "credits" or "license" for more information.
>>> import torch
>>> print(torch.__version__)
2.0.0+cpu
>>> import torchvision
>>> print(torchvision.__version__)
0.15.0+cpu
>>> quit
Use quit() or Ctrl-Z plus Return to exit
>>> ^Z

CPUバージョンなのでアンインストールしておく
```
(py38a) PS > pip uninstall torch torchvision torchaudio
```

↑

NVIDIA ドライバーのインストール †

Pytorch オフィシャルサイトからインストールコマンドを取得
公式サイトGEFORCE® ドライバーから最新バージョンのドライバーをダウンロードしてインストール
CUDA 公式サイトCUDA Zoneから「Download」→「Archive of Previous CUDA Releases」とたどり「Pytorch インストールコマンド」に対応した「CUDA Toolkit」のバージョンををインストール

インストール結果の確認

(py38a) > nvidia-smi
Fri Jun 16 12:32:27 2023
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 516.01       Driver Version: 516.01       CUDA Version: 11.7     |
|-------------------------------+----------------------+----------------------+
| GPU  Name            TCC/WDDM | Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  NVIDIA GeForce ... WDDM  | 00000000:01:00.0  On |                  N/A |
| 45%   44C    P8    N/A /  75W |    146MiB /  4096MiB |      0%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|=============================================================================|
|    0   N/A  N/A      9488    C+G   C:\Windows\explorer.exe         N/A      |
|    0   N/A  N/A      9736    C+G   ...\PowerToys.FancyZones.exe    N/A      |
|    0   N/A  N/A     11020    C+G   ...5n1h2txyewy\SearchApp.exe    N/A      |
|    0   N/A  N/A     12460    C+G   ...2txyewy\TextInputHost.exe    N/A      |
+-----------------------------------------------------------------------------+

↑

「Pytorch」インストール †

Pytorch オフィシャルサイトのインストールコマンドを実行

(py38a) PS > pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu117

▼　- log -　NVIDIA GeForce GTX 1050 Ti

(py38a) > pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu117
Looking in indexes: https://download.pytorch.org/whl/cu117
Collecting torch
  Downloading https://download.pytorch.org/whl/cu117/torch-2.0.1%2Bcu117-cp38-cp38-win_amd64.whl (2343.7 MB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 2.3/2.3 GB 370.4 kB/s eta 0:00:00
Collecting torchvision
  Downloading https://download.pytorch.org/whl/cu117/torchvision-0.15.2%2Bcu117-cp38-cp38-win_amd64.whl (4.9 MB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 4.9/4.9 MB 1.8 MB/s eta 0:00:00
Requirement already satisfied: torchaudio in c:\users\izuts\anaconda3\envs\py38a\lib\site-packages (2.0.1+cu117)
Requirement already satisfied: typing-extensions in c:\users\izuts\anaconda3\envs\py38a\lib\site-packages (from torch) (4.5.0)
        :
        :
Requirement already satisfied: mpmath>=0.19 in c:\users\izuts\anaconda3\envs\py38a\lib\site-packages (from sympy->torch) (1.3.0)
Installing collected packages: torch, torchvision, torchaudio
  Attempting uninstall: torchaudio
    Found existing installation: torchaudio 2.0.1+cu117
    Uninstalling torchaudio-2.0.1+cu117:
      Successfully uninstalled torchaudio-2.0.1+cu117
Successfully installed torch-2.0.1+cu117 torchaudio-2.0.2+cu117 torchvision-0.15.2+cu117

インストール結果の確認

(py38a) PS > pip list
Package                 Version
----------------------- ------------
    :
torch                   2.0.1+cu117
torchaudio              2.0.2+cu117
torchvision             0.15.2+cu117
    :

(py38a) > python
Python 3.8.16 (default, Mar  2 2023, 03:18:16) [MSC v.1916 64 bit (AMD64)] :: Anaconda, Inc. on win32
Type "help", "copyright", "credits" or "license" for more information.
>>> import torch
>>> print(torch.__version__)
2.0.1+cu117
>>> import torchvision
>>> print(torchvision.__version__)
0.15.2+cu117
>>>

GPU を認識しているかのテストプログラム「workspace_py37/cuda_test.py」

import torch

print(torch.__version__)
print(f"cuda, {torch.cuda.is_available()}")
print(f"compute_{''.join(map(str,(torch.cuda.get_device_capability())))}")
device_num:int = torch.cuda.device_count()
print(f"find gpu devices, {device_num}")
for idx in range(device_num):
    print(f"cuda:{idx}, {torch.cuda.get_device_name(idx)}")

print("end")

・実行結果

(py38a) > python .\cuda_test.py
2.0.1+cu117
cuda, True
compute_61
find gpu devices, 1
cuda:0, NVIDIA GeForce GTX 1050 Ti
end

↑

パッケージの追加 †

「tensorboard」をインストール

(py38a) PS > pip install tensorboard

▼　- log -　NVIDIA GeForce GTX 1050 Ti

Collecting tensorboard
  Downloading tensorboard-2.13.0-py3-none-any.whl (5.6 MB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 5.6/5.6 MB 11.1 MB/s eta 0:00:00
    :
    :
Collecting oauthlib>=3.0.0
  Downloading oauthlib-3.2.2-py3-none-any.whl (151 kB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 151.7/151.7 kB 8.8 MB/s eta 0:00:00
Installing collected packages: werkzeug, tensorboard-data-server, pyasn1, oauthlib, grpcio, absl-py, rsa, requests-oauthlib, pyasn1-modules, markdown, google-auth, google-auth-oauthlib, tensorboard
Successfully installed absl-py-1.4.0 google-auth-2.20.0 google-auth-oauthlib-1.0.0 grpcio-1.54.2 markdown-3.4.3 oauthlib-3.2.2 pyasn1-0.5.0 pyasn1-modules-0.3.0 requests-oauthlib-1.3.1 rsa-4.9 tensorboard-2.13.0 tensorboard-data-server-0.7.1 werkzeug-2.3.6

↑

学習 Training を実行 †

データ・セット「janken_dataset」をフォルダごと「yolov7-main/data」にアップする
学習するのに必要な情報を記述したyamlファイル「janken_dataset.yaml」を「yolov7-main」の直下にアップする

下記のコマンドで学習を実行する
GPU ボードのメモリーサイズが 4GB なのでバッチサイズを 8>4>2 と小さくしていきエラーの出ない 2 を採用する

(py38a) PS > python train.py --workers 8 --batch-size 2 --data janken_dataset.yaml --cfg cfg/training/yolov7x.yaml --weights 'yolov7x.pt' --name yolov7x_custom2 --hyp data/hyp.scratch.p5.yaml --epochs 300 --device 0

▼　- log -　NVIDIA GeForce GTX 1050 Ti

(py38a) PS > python train.py --workers 8 --batch-size 2 --data janken_dataset.yaml --cfg cfg/training/yolov7x.yaml --weights 'yolov7x.pt' --name yolov7x_custom2 --hyp data/hyp.scratch.p5.yaml --epochs 300 --device 0
YOLOR  2023-4-13 torch 2.0.1+cu117 CUDA:0 (NVIDIA GeForce GTX 1050 Ti, 4095.75MB)

Namespace(adam=False, artifact_alias='latest', batch_size=2, bbox_interval=-1, bucket='', cache_images=False, cfg='cfg/training/yolov7x.yaml', data='janken_dataset.yaml', device='0', entity=None, epochs=300, evolve=False, exist_ok=False, freeze=[0], global_rank=-1, hyp='data/hyp.scratch.p5.yaml', image_weights=False, img_size=[640, 640], label_smoothing=0.0, linear_lr=False, local_rank=-1, multi_scale=False, name='yolov7x_custom2', noautoanchor=False, nosave=False, notest=False, project='runs/train', quad=False, rect=False, resume=False, save_dir='runs\\train\\yolov7x_custom213', save_period=-1, single_cls=False, sync_bn=False, total_batch_size=2, upload_dataset=False, v5_metric=False, weights='yolov7x.pt', workers=8, world_size=1)
tensorboard: Start with 'tensorboard --logdir runs/train', view at http://localhost:6006/
hyperparameters: lr0=0.01, lrf=0.1, momentum=0.937, weight_decay=0.0005, warmup_epochs=3.0, warmup_momentum=0.8, warmup_bias_lr=0.1, box=0.05, cls=0.3, cls_pw=1.0, obj=0.7, obj_pw=1.0, iou_t=0.2, anchor_t=4.0, fl_gamma=0.0, hsv_h=0.015, hsv_s=0.7, hsv_v=0.4, degrees=0.0, translate=0.2, scale=0.9, shear=0.0, perspective=0.0, flipud=0.0, fliplr=0.5, mosaic=1.0, mixup=0.15, copy_paste=0.0, paste_in=0.15, loss_ota=1
wandb: Install Weights & Biases for YOLOR logging with 'pip install wandb' (recommended)
Overriding model.yaml nc=80 with nc=3

                 from  n    params  module                                  arguments
  0                -1  1      1160  models.common.Conv                      [3, 40, 3, 1]
  1                -1  1     28960  models.common.Conv                      [40, 80, 3, 2]
  2                -1  1     57760  models.common.Conv                      [80, 80, 3, 1]
  3                -1  1    115520  models.common.Conv                      [80, 160, 3, 2]
  4                -1  1     10368  models.common.Conv                      [160, 64, 1, 1]
  5                -2  1     10368  models.common.Conv                      [160, 64, 1, 1]
  6                -1  1     36992  models.common.Conv                      [64, 64, 3, 1]
  7                -1  1     36992  models.common.Conv                      [64, 64, 3, 1]
  8                -1  1     36992  models.common.Conv                      [64, 64, 3, 1]
  9                -1  1     36992  models.common.Conv                      [64, 64, 3, 1]
 10                -1  1     36992  models.common.Conv                      [64, 64, 3, 1]
 11                -1  1     36992  models.common.Conv                      [64, 64, 3, 1]
 12[-1, -3, -5, -7, -8]  1         0  models.common.Concat                    [1]
 13                -1  1    103040  models.common.Conv                      [320, 320, 1, 1]
 14                -1  1         0  models.common.MP                        []  
        :

113                -1  1   2360320  models.common.Conv                      [512, 512, 3, 1]
114                -1  1   2360320  models.common.Conv                      [512, 512, 3, 1]
115                -1  1   2360320  models.common.Conv                      [512, 512, 3, 1]
116[-1, -3, -5, -7, -8]  1         0  models.common.Concat                    [1]
117                -1  1   1639680  models.common.Conv                      [2560, 640, 1, 1]
118                87  1    461440  models.common.Conv                      [160, 320, 3, 1]
119               102  1   1844480  models.common.Conv                      [320, 640, 3, 1]
120               117  1   7375360  models.common.Conv                      [640, 1280, 3, 1]
121   [118, 119, 120]  1     56144  models.yolo.IDetect                     [3, [[12, 16, 19, 36, 40, 28], [36, 75, 76, 55, 72, 146], [142, 110, 192, 243, 459, 401]], [320, 640, 1280]]
Model Summary: 467 layers, 70828568 parameters, 70828568 gradients

Transferred 630/644 items from yolov7x.pt
Scaled weight_decay = 0.0005
Optimizer groups: 108 .bias, 108 conv.weight, 111 other
train: Scanning 'data\janken_dataset\train\labels.cache' images and labels... 4
val: Scanning 'data\janken_dataset\valid\labels.cache' images and labels... 120

autoanchor: Analyzing anchors... anchors/target = 3.37, Best Possible Recall (BPR) = 1.0000
Image sizes 640 train, 640 test
Using 2 dataloader workers
Logging results to runs\train\yolov7x_custom213
Starting training for 300 epochs...

     Epoch   gpu_mem       box       obj       cls     total    labels  img_size
     0/299     3.42G   0.04388    0.0186   0.01564   0.07812         3       64
               Class      Images      Labels           P           R      mAP@.C:\Users\izuts\anaconda3\envs\py38a\lib\site-packages\torch\functional.py:504: UserWarning: torch.meshgrid: in an upcoming release, it will be required to pass the indexing argument. (Triggered internally at C:\actions-runner\_work\pytorch\pytorch\builder\windows\pytorch\aten\src\ATen\native\TensorShape.cpp:3484.)
  return _VF.meshgrid(tensors, **kwargs)  # type: ignore[attr-defined]
               Class      Images      Labels           P           R      mAP@.
                 all         120         120       0.536       0.222       0.271      0.0602

     Epoch   gpu_mem       box       obj       cls     total    labels  img_size
     1/299      3.4G    0.0368   0.01275   0.01445     0.064         5       64
               Class      Images      Labels           P           R      mAP@.
                 all         120         120      0.0103       0.183     0.00705     0.00127

     Epoch   gpu_mem       box       obj       cls     total    labels  img_size
     2/299      3.4G   0.03572  0.009825    0.0137   0.05924         2       64
               Class      Images      Labels           P           R      mAP@.
                 all         120         120       0.183       0.467       0.214      0.0657
        :

     Epoch   gpu_mem       box       obj       cls     total    labels  img_size
   298/299      3.4G   0.01384  0.005825  0.003427    0.0231         2       64
               Class      Images      Labels           P           R      mAP@.
                 all         120         120       0.994       0.983       0.995       0.731

     Epoch   gpu_mem       box       obj       cls     total    labels  img_size
   299/299      3.4G   0.01432  0.006242  0.003859   0.02442         5       64
               Class      Images      Labels           P           R      mAP@.
                 all         120         120       0.979       0.995       0.995        0.73
                 goo         120          40       0.987           1       0.995       0.719
               choki         120          40       0.976           1       0.994       0.698
                 par         120          40       0.975       0.985       0.995       0.772
300 epochs completed in 21.154 hours.

Optimizer stripped from runs\train\yolov7x_custom213\weights\last.pt, 142.1MB
Optimizer stripped from runs\train\yolov7x_custom213\weights\best.pt, 142.1MB

正常に終了すると、学習結果は「yolov7-main/runs/train/yolov7x_custom2/」に保存される
・学習結果モデルは「yolov7x_custom2/weights」評価指標は「yolov7x_custom2」
・この学習にかかった時間 → 21時間10分

F1 curve P curve PR curve R curve

↑

発生したエラー・メッセージとその原因 †

ModuleNotFoundError: No module named 'tensorboard'
原因：「tensorboard」モジュールがない

    :
Traceback (most recent call last):
  File "train.py", line 21, in <module>
    from torch.utils.tensorboard import SummaryWriter
  File "C:\Users\XXXXX\anaconda3\envs\py38a\lib\site-packages\torch\utils\tensorboard\__init__.py", line 1, in <module>
    import tensorboard
ModuleNotFoundError: No module named 'tensorboard'

AssertionError: CUDA unavailable, invalid device 0 requested
原因：CUDA ドライバがない、Pythorch が CUDA対応版でない

    :
Traceback (most recent call last):
  File "train.py", line 595, in <module>
    device = select_device(opt.device, batch_size=opt.batch_size)
  File "N:\anaconda_win\work\yolov7-main\utils\torch_utils.py", line 71, in select_device
    assert torch.cuda.is_available(), f'CUDA unavailable, invalid device {device} requested'  # check availability
AssertionError: CUDA unavailable, invalid device 0 requested

Command 'git tag' returned non-zero exit status 128.
原因： --weights 'yolov7x.pt' パラメータで指定されたファイルがない

    :
Traceback (most recent call last):
  File "train.py", line 616, in <module>
    train(hyp, opt, device, tb_writer)
  File "train.py", line 86, in train
    attempt_download(weights)  # download if not found locally
  File "N:\anaconda_win\work\yolov7-main\utils\google_utils.py", line 31, in attempt_download
    tag = subprocess.check_output('git tag', shell=True).decode().split()[-1]
  File "C:\Users\XXXXX\anaconda3\envs\py38a\lib\subprocess.py", line 415, in check_output
    return run(*popenargs, stdout=PIPE, timeout=timeout, check=True,
  File "C:\Users\XXXXX\anaconda3\envs\py38a\lib\subprocess.py", line 516, in run
    raise CalledProcessError(retcode, process.args,
subprocess.CalledProcessError: Command 'git tag' returned non-zero exit status 128.

↑