# pose-classification-kit **Repository Path**: guorui0911/pose-classification-kit ## Basic Information - **Project Name**: pose-classification-kit - **Description**: 人体动作实时检测 - **Primary Language**: Unknown - **License**: MIT - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2024-07-24 - **Last Updated**: 2024-07-24 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README

Pose Classification Kit

[![PyPI][PyPI-shield]][PyPI-url] [![PyV][PyV-shield]][PyV-url] [![lint][lint-shield]][lint-url] [![linkedin][linkedin-shield]][linkedin-url]

Banner

This Python package focus on the deployment of gesture control systems. It ease dataset creation, models evaluation, and processing pipeline deployment. The critical element in the proposed processing architecture is the intermediate representation of human bodies as key points to perform efficient classification. In addition to the main application, the package contains two datasets for body/hands pose classificaiton, several classification models, and data augmentation tools that can be accessed through an API. Feel free to check-out the [**drone-gesture-control repository**](https://github.com/ArthurFDLR/drone-gesture-control) for a deployment example on Jetson Nano using this package. - [Getting Started](#getting-started) - [Step 1 - Install the package](#step-1---install-the-package) - [Using PyPi](#using-pypi) - [From source](#from-source) - [Step 2 - Install OpenPose](#step-2---install-openpose) - [Step 3 - Launch application](#step-3---launch-application) - [Step 4 - Create new classification models](#step-4---create-new-classification-models) - [Demonstrations](#demonstrations) - [User guide](#user-guide) - [Real-time pose classification](#real-time-pose-classification) - [Create and manipulate datasets](#create-and-manipulate-datasets) - [Additional scripts](#additional-scripts) - [Documentation](#documentation) - [Body datasets](#body-datasets) - [Data augmentation](#data-augmentation) - [License](#license) ## Getting Started ### Step 1 - Install the package #### Using PyPi Run the following command to install the whole package in the desired Python environment: ``` pip install pose-classification-kit[app] ``` If you don't plan to use the application but just want access to the datasets and pre-trained models: ``` pip install pose-classification-kit ``` #### From source Ensure that [`Poetry`](https://poetry.eustace.io/) is installed for Python 3.7 and above on your system. 1. Git clone the repository ``` git clone https://github.com/ArthurFDLR/pose-classification-kit.git cd pose-classification-kit ``` 2. Create an adequate `venv` virtual environment ``` python -m poetry install ``` ### Step 2 - Install OpenPose The dataset creation and real-time model evaluation application heavily rely on the pose estimation system [**OpenPose**](https://github.com/CMU-Perceptual-Computing-Lab/openpose). It must be installed on your system to allow real-time gesture classification. This step is not requiered if you don't plan to use the application. 1. Follow [OpenPose installation instructions](https://github.com/CMU-Perceptual-Computing-Lab/openpose/tree/master/doc/installation). 2. Once the installation is completed, change the variable `OPENPOSE_PATH` ( [`.\pose-classification-kit\config.py`](https://github.com/ArthurFDLR/pose-classification-kit/blob/master/pose_classification_kit/config.py)) to the location of the OpenPose installation folder on your system. ### Step 3 - Launch application You should now be able to run the application if you installed all optionnal dependancies. See the usage section about how to use the app. ``` pose-classification-app ``` ### Step 4 - Create new classification models The [`.\examples`](https://github.com/ArthurFDLR/pose-classification-kit/blob/master/examples) folder contains Jupyter Notebook detailing the use of the API to create new classification models. Note that these Notebooks can be executed on Google Colab. ## Demonstrations

## User guide ### Real-time pose classification The video stream of the selected camera is fed to OpenPose at all times. The analysis results are displayed on the left side of the application. You have to choose one of the available models in the drop-down at the bottom of the analysis pannel. Keypoints extracted from the video by OpenPose are automatically normalized and fed to the classifier. ### Create and manipulate datasets First, you either have to load or create a new set of samples for a specific label and hand side. To do so, respectively choose *Open (Ctrl+O)* or *Create new (Ctrl+N)* in *Dataset* of the menu bar. You have to specify the hand side, the label, and the newly created samples set' accuracy threshold. A configuration window will ask for the label and the newly created samples set's accuracy threshold in case of creating a new class. The accuracy threshold defines the minimum accuracy of hand keypoints detection from OpenPose of any sample in the set. This accuracy is displayed on top of the keypoints graph. Now that a set is loaded in the application, you can record new samples from your video feed or inspect the set and delete inadequate samples. When your done, save the set through *Dataset -> Save (Ctrl+S)*. ### Additional scripts Some functionalities are currently unavailable through the GUI: - You can export all dataset samples from [`.\pose_classification_kit\datasets\Body`](https://github.com/ArthurFDLR/pose-classification-kit/tree/master/pose_classification_kit/datasets/Body) and [`.\pose_classification_kit\datasets\Hands`](https://github.com/ArthurFDLR/pose-classification-kit/tree/master/pose_classification_kit/datasets/Hands) in two respective CSV files. ``` export-datasets ``` - You can generate videos similar to [this one](https://youtu.be/FK-1G749cIo) ([`.\pose-classification-kit\scripts\video_creation.py`](https://github.com/ArthurFDLR/OpenHand-App/tree/master/pose-classification-kit/scripts/video_creation.py) might need some modification to fit your use case). 🚧 Currently not functional 🚧 ``` video-overlay ``` ## Documentation ### Body datasets There is a total of 20 body dataset classes which contains between 500 and $600$ samples each for a total of 10680 entries. Even if the number of samples from one class to the other varies in the raw dataset, the API yields a balanced dataset of 503 samples per class. Also, by default, 20% of these are reserved for final testing of the model. Each entry in the dataset is an array of 25 2D coordinates. The mapping of these keypoints follows the BODY25 body model. We created the dataset using the BODY25 representation as it is one of the most comprehensive standard body models. However, some pose estimation models, such as the one used on the Jetson Nano, use an 18 keypoints representation (BODY18). The seven missing keypoints do not strongly influence classification as 6 of them are used for feet representation, and the last one is a central hip keypoint. Still, the dataset must be converted to the BODY18 representation. This is done by reindexing the samples based on the comparison of the mapping of both body models. You can choose which body model to use when importing the dataset with the API.

Full body classes

Body models

### Data augmentation The data augmentation tool currently support the following operations: - **Scaling**: a random scaling factor drawn from a normal distribution of mean 0 and standard deviation σₛ is applied to all sample coordinates. - **Rotation**: a rotation of an angle randomly drawn from a normal distribution of mean 0 and standard deviation σᵣ is applied to the sample. - **Noise**: Gaussian noise of standard deviation σₙ is added to coordinates of the sample. - **Remove keypoints**: a pre-defined or random list of keypoints are removed (coordinates set to 0) from the sample.

See example

Augmentation Ratio σₛ σᵣ σₙ Remove keypoints

10% 0.08 0.0 0.0 None

10% 0.0 10.0 0.0 None

15% 0.0 0.0 0.03 Legs

15% 0.0 0.0 0.03 Legs & Hip

20% 0.0 0.0 0.03 2 random

```python from pose_classification_kit.datasets import BODY18, bodyDataset, dataAugmentation dataset = bodyDataset(testSplit=.2, shuffle=True, bodyModel=BODY18) x_train = dataset['x_train'] y_train = dataset['y_train_onehot'] x, y = [x_train], [y_train] # Scaling augmentation x[len(x):],y[len(y):] = tuple(zip(dataAugmentation( x_train, y_train, augmentation_ratio=.1, scaling_factor_standard_deviation=.08, ))) # Rotation augmentation x[len(x):],y[len(y):] = tuple(zip(dataAugmentation( x_train, y_train, augmentation_ratio=.1, rotation_angle_standard_deviation=10, ))) # Upper-body augmentation lowerBody_keypoints = np.where(np.isin(BODY18.mapping,[ "left_knee", "right_knee", "left_ankle", "right_ankle" ]))[0] x[len(x):],y[len(y):] = tuple(zip(dataAugmentation( x_train, y_train, augmentation_ratio=.15, remove_specific_keypoints=lowerBody_keypoints, random_noise_standard_deviation=.03 ))) lowerBody_keypoints = np.where(np.isin(BODY18.mapping,[ "left_knee", "right_knee", "left_ankle", "right_ankle", "left_hip", "right_hip", ]))[0] x[len(x):],y[len(y):] = tuple(zip(dataAugmentation( x_train, y_train, augmentation_ratio=.15, remove_specific_keypoints=lowerBody_keypoints, random_noise_standard_deviation=.03 ))) # Random partial input augmentation x[len(x):],y[len(y):] = tuple(zip(dataAugmentation( x_train, y_train, augmentation_ratio=.2, remove_rand_keypoints_nbr=2, random_noise_standard_deviation=.03 ))) x_train_augmented = np.concatenate(x, axis=0) y_train_augmented = np.concatenate(y, axis=0) ```

## License Distributed under the MIT License. See [`LICENSE`](https://github.com/ArthurFDLR/pose-classification-kit/blob/main/LICENSE) for more information. [PyPI-shield]: https://img.shields.io/pypi/v/pose-classification-kit?style=for-the-badge [PyPI-url]: https://pypi.org/project/pose-classification-kit/ [PyV-shield]: https://img.shields.io/badge/python-3.7%20%7C%203.8%20%7C%203.9-blue?style=for-the-badge [PyV-url]: https://github.com/ArthurFDLR/pose-classification-kit/blob/master/pyproject.toml [lint-shield]: https://img.shields.io/badge/code%20style-black-000000.svg?style=for-the-badge [lint-url]: https://github.com/psf/black [license-shield]: https://img.shields.io/github/license/ArthurFDLR/OpenHand-Classifier?style=for-the-badge [license-url]: https://github.com/ArthurFDLR/OpenHand-Classifier/blob/master/LICENSE [linkedin-shield]: https://img.shields.io/badge/-LinkedIn-black.svg?style=for-the-badge&logo=linkedin&colorB=555 [linkedin-url]: https://linkedin.com/in/arthurfdlr/

Augmentation Ratio	σₛ	σᵣ	σₙ	Remove keypoints
10%	0.08	0.0	0.0	None
10%	0.0	10.0	0.0	None
15%	0.0	0.0	0.03	Legs
15%	0.0	0.0	0.03	Legs & Hip
20%	0.0	0.0	0.03	2 random