Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose (ROS support)

概要
セットアップ
- 環境条件
- インストール方法
実行方法
本モデルの学習
マイルストーン
参考文献

概要

論文 Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose の学習用コードが実装されたレポジトリをROS1に対応したレポジトリとなります．画像中にいる人物のポーズを特定するために，骨格（キーポイントとそれらの間の接続で構成される）を検出する．その中，「耳，目，鼻，首，肩，肘，手首，腰，膝，足首」の最大18個の骨格のリアルタイムに推定できます．またRGB-Dセンサの点群情報を用いて，2次元の骨格座標だけでなく，3次元の骨格座標を取得することもできる．

検出可能な骨格一覧

ID	Variable	Body Part
0	nose	nose
1	neck	neck
2	r_sho	right shoulder
3	r_elb	right elbow
4	r_wri	right wrist
5	l_sho	left shoulder
6	l_elb	left elbow
7	l_wri	left wrist
8	r_hip	right hip
9	r_knee	right knee
10	r_ank	right ankle
11	l_hip	left hip
12	l_knee	left knee
13	l_ank	left ankle
14	r_eye	right eye
15	l_eye	left eye
16	r_ear	right ear
17	l_ear	left ear

(上に戻る)

セットアップ

ここで，本レポジトリのセットアップ方法について説明します．

(上に戻る)

環境条件

まず，以下の環境を整えてから，次のインストール段階に進んでください．

System	Version
Ubuntu	20.04 (Focal Fossa)
ROS	Noetic Ninjemys
Python	3.8
OpenCV	4.9.0
PyTorch	>=0.4.1 (Tested on: 2.2.1)

(上に戻る)

インストール方法

ROSのsrcフォルダに移動します．

$ roscd
# もしくは，"cd ~/catkin_ws/"へ移動．
$ cd src/

本レポジトリをcloneします．

$ git clone https://github.com/TeamSOBITS/lightweight_human_pose_estimation_pytorch

レポジトリの中へ移動します．

$ cd lightweight_human_pose_estimation_pytorch/

依存パッケージをインストールします．
```
$ bash install.sh
```

パッケージをコンパイルします．

$ roscd
# もしくは，"cd ~/catkin_ws/"へ移動．
$ catkin_make

(上に戻る)

実行・操作方法

カメラの起動

USBカメラ(PC内蔵カメラ)を利用する場合，次のコマンドを実行する．

roslaunch lightweight_human_pose_estimation camera.launch

USBカメラエラーの対策法

以下のようなエラーが発生した場合：

[ERROR] [1663911409.917317256]: Permission denied opening /dev/bus/usb/001/002

以下のコードを実行する．

$ sudo chmod o+w /dev/bus/usb/001/002

[!NOTE] /dev/bus/usb/001/002が変わる可能性がある．表示に応じて，コマンドを修正してください．

Note

Azure Kinect やRealSenseを利用を使用する場合，それぞれのセットアップを済まし，カメラを起動してください．

(上に戻る)

推論パラメータ

2次元の推論時のパラメータをhuman_pose.launchに修正することができる．

パラメータ名	型	意味
checkpoint_path	string	モデルのweightファイルのパス
height_size	int	入力画像の拡張
cpu	bool	CPUのみで骨格検出するか(CUDAを利用する場合：`false`)
smooth	int	前フレームとの骨格をスムーズ化するかのフラグ
track	int	前フレームの結果を伝播するかのフラグ

骨格検出の起動

骨格検出関係の機能に応じてhuman_pose.launchに修正する．

<!-- Camera RBG Image Raw topic -->
<arg name ="input_image_topic"      default="/camera/rgb/image_raw"/>

<!-- Select the camera base frame -->
<arg name ="base_frame_name"        default="camera_link"/>
<!-- Select the pose_2d topic name -->
<arg name ="pose_2d_topic_name"     default="/human_pose_2d/pose_array"/>
<!-- Select the cloud topic name -->
<arg name ="cloud_topic_name"       default="/camera/depth/points"/>
<!-- Select the camera_info topic name -->
<arg name ="camera_info_topic_name" default="/camera/rgb/camera_info"/>

<!-- Enable 2D Pose detection (true) -->
<arg name ="pose_2d_detect"         default="true"/>
<!-- Show 2D Pose result image (true) -->
<arg name ="pose_2d_img_show"       default="false"/>
<!-- Publish 2D Pose result image (true) -->
<arg name ="pose_2d_img_pub"        default="true"/>
<!-- Show 2D Pose result as log in terminal (true) -->
<arg name ="pose_2d_log_show"       default="false"/>

<!-- Enable 3D Pose detection (true) -->
<arg name ="pose_3d_detect"         default="true"/>
<!-- Publish 3D Pose result as topic (true) -->
<arg name ="pose_3d_topic_pub"      default="false"/>
<!-- Broadcast 3D Pose result as TF (true) -->
<arg name ="pose_3d_tf_pub"         default="true"/>
<!-- Show 3D Pose result as log in terminal (true) -->
<arg name ="pose_3d_log_show"       default="false"/>

launchファイルを実行する．

roslaunch lightweight_human_pose_estimation human_pose.launch

Subscribers & Publishers

Subscribers:

トピック名	型	意味
/camera/rgb/image_raw	sensor_msgs/Image	センサの画像
/camera/depth/points	sensor_msgs/PointCloud2	センサの点群
/human_pose_2d/pose_array	lightweight_human_pose_estimation/KeyPoint2DArray	2次元の骨格情報

Publishers:

トピック名	型	意味
/human_pose_2d/pose_array	lightweight_human_pose_estimation/KeyPoint2DArray	2次元の骨格情報
/human_pose_2d/pose_img	sensor_msgs/Image	2次元の骨格画像
/human_pose_3d/pose_array	lightweight_human_pose_estimation/KeyPoints_3d	3次元の骨格情報

Services

サービス名	型	意味
/human_pose_2d/run_ctr	sobits_msgs/RunCtrl	2次元検出の切り替え(ON:`true`, OFF:`false`)
/human_pose_3d/run_ctr	sobits_msgs/RunCtrl	3次元検出の切り替え(ON:`true`, OFF:`false`)

本モデルの学習

詳細

事前設定

COCO2017データセットのダウンロード: http://cocodataset.org/#download で(train, val, annotations) と <COCO_HOME> フォルダに解凍します．
必要なパッケージをインストール

$ python3 -m pip install -r requirements.txt

学習

トレーニングは3つのステップ（完全な検証データセットのAP値が与えられます）:

MobileNetの重みから学習．このステップ後の予想APは～38%．
前のステップで得られた重みからのトレーニング．このステップ後に期待されるAPは～39%です．
前のステップで得られた重みからのトレーニング．このステップ後の期待されるAPは～40%です（洗練段階が1のネットワークでは，次の2段階は破棄されます）．

学習済みのMobileNet v1 weightsをmobilenet_sgd_68.848.pth.tarからダウンロードします: https://github.com/marvis/pytorch-mobilenet (sgd オプション). 指定されたモデルが存在しない場合, GoogleDriveからダウンロードしてください．

train annotationsを内部形式に変換する．その後，prepared_train_annotation.pkl`が生成され，内部形式のannotationsに変換される．

$ python3 scripts/prepare_train_labels.py --labels <COCO_HOME>/annotations/person_keypoints_train2017.json

[任意] 高速な検証のためには，検証データセットのサブセットを作成することを推奨する． val_subset.json`が生成され，（5000枚のうち）ランダムな250枚の画像にannotationsが付加される．

$ python3 scripts/make_val_subset.py --labels <COCO_HOME>/annotations/person_keypoints_val2017.json

MobileNetのweightsからトレーニングする

$ python3 train.py --train-images-folder <COCO_HOME>/train2017/ --prepared-train-labels prepared_train_annotation.pkl --val-labels val_subset.json --val-images-folder <COCO_HOME>/val2017/ --checkpoint-path <path_to>/mobilenet_sgd_68.848.pth.tar --from-mobilenet

次に，前のステップのチェックポイントからトレーニングする．

$ python3 train.py --train-images-folder <COCO_HOME>/train2017/ --prepared-train-labels prepared_train_annotation.pkl --val-labels val_subset.json --val-images-folder <COCO_HOME>/val2017/ --checkpoint-path <path_to>/checkpoint_iter_420000.pth --weights-only

最後に，前ステップのチェックポイントと3段階のネットワークから学習する． 370000回学習回数後，最終的なチェックポイントとした．

$ python3 train.py --train-images-folder <COCO_HOME>/train2017/ --prepared-train-labels prepared_train_annotation.pkl --val-labels val_subset.json --val-images-folder <COCO_HOME>/val2017/ --checkpoint-path <path_to>/checkpoint_iter_280000.pth --weights-only --num-refinement-stages 3

最適なチェックポイントを選択したわけではないため，より少ない学習回数で同様の結果が得られる可能性があります．

検証

以下を実行する．

$ python3 val.py --labels <COCO_HOME>/annotations/person_keypoints_val2017.json --images-folder <COCO_HOME>/val2017 --checkpoint-path <CHECKPOINT>

学習済みモデル

このモデルは，平面BGR形式の正規化画像（mean=[128, 128, 128]，scale=[1/256, 1/256, 1/256] ）を想定している． COCOで事前に訓練されたモデルは，checkpoint_iter_370000.pthであり，COCO検証セットで40％のAPを持っている（val subsetでは38.6％）．

Pythonデモ

Pythonデモは，簡単な結果のプレビューのために提供しています．最高のパフォーマンスを得るには，c++デモをご検討ください．ウェブカメラからpythonデモを実行する．

$ cd lightweight-human-pose-estimation/script
$ python3 demo.py --checkpoint-path checkpoints/checkpoint_iter_370000.pth --video 0

マイルストーン

OSS
- ドキュメンテーションの充実
- コーディングスタイルの統一

現時点のバグや新規機能の依頼を確認するためにIssueページをご覧ください．

参考文献

Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose

@inproceedings{osokin2018lightweight_openpose,
    author={Osokin, Daniil},
    title={Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose},
    booktitle = {arXiv preprint arXiv:1811.12004},
    year = {2018}
}

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
img		img
launch		launch
msg		msg
script		script
src		src
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.en.md		README.en.md
README.md		README.md
TRAIN-ON-CUSTOM-DATASET.md		TRAIN-ON-CUSTOM-DATASET.md
install.sh		install.sh
package.xml		package.xml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose (ROS support)

概要

セットアップ

環境条件

インストール方法

実行・操作方法

カメラの起動

推論パラメータ

骨格検出の起動

Subscribers & Publishers

Services

本モデルの学習

事前設定

学習

検証

学習済みモデル

Pythonデモ

マイルストーン

参考文献

About

Releases

Packages

Languages

License

TeamSOBITS/lightweight_human_pose_estimation

Folders and files

Latest commit

History

Repository files navigation

Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose (ROS support)

概要

セットアップ

環境条件

インストール方法

実行・操作方法

カメラの起動

推論パラメータ

骨格検出の起動

Subscribers & Publishers

Services

本モデルの学習

事前設定

学習

検証

学習済みモデル

Pythonデモ

マイルストーン

参考文献

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages