Action Quality Assessment via Hierarchical Pose-guided Multi-stage Contrastive Regression

This is the official implementation of our paper Action Quality Assessment via Hierarchical Pose-guided Multi-stage Contrastive Regression

Introduction

Action Quality Assessment (AQA), which aims at automatic and fair evaluation of athletic performance, has gained increasing attention in recent years. However, athletes are often in rapid movement and the corresponding visual appearance variances are subtle, making it challenging to capture fine-grained pose differences and leading to poor estimation performance. Furthermore, most common AQA tasks, such as diving in sports, are usually divided into multiple sub-actions, each of which contains different durations. However, existing methods focus on segmenting the video into fixed frames, which disrupts the temporal continuity of sub-actions resulting in unavoidable prediction errors. To address these challenges, we propose a novel action quality assessment method through hierarchically pose-guided multi-stage contrastive regression. Firstly, we introduce a multi-scale dynamic visual-skeleton encoder to capture fine-grained spatio-temporal visual and skeletal features. Then, a procedure segmentation network is introduced to separate different sub-actions and obtain segmented features. Afterwards, the segmented visual and skeletal features are both fed into a multi-modal fusion module as physics structural priors, to guide the model in learning refined activity similarities and variances. Finally, a multi-stage contrastive learning regression approach is employed to learn discriminative representations and output prediction results. In addition, we introduce a newly-annotated FineDiving-Pose Dataset to improve the current low-quality human pose labels.

News

[08 Jan, 2024] We have released the Arxiv version of the paper. Code/Models are coming soon. Please stay tuned!

Data Preparation

a. We extracted and processed data from the FineDiving dataset.

b. We expanded the human-annotated pose data and the automatically annotated pose data.

c. We provide some pose data in the examples for presentation.

TODO: The extended dataset and code will be available once the paper is accepted. Stay tuned!

Train and Eval

TODO: Code/Models will be provided here.

Citation

If you find this project useful in your research, please consider citing:

@article{qi2025action,
title={Action Quality Assessment via Hierarchical Pose-guided Multi-stage Contrastive Regression},
author={Qi, Mengshi and Ye, Hao and Peng, Jiaxuan and Ma, Huadong},
journal={arXiv preprint arXiv:2501.03674},
year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
examples		examples
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Action Quality Assessment via Hierarchical Pose-guided Multi-stage Contrastive Regression

Introduction

Table of Contents

News

Data Preparation

Train and Eval

Citation

About

Releases

Packages

Lumos0507/HP-MCoRe

Folders and files

Latest commit

History

Repository files navigation

Action Quality Assessment via Hierarchical Pose-guided Multi-stage Contrastive Regression

Introduction

Table of Contents

News

Data Preparation

Train and Eval

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages