Interactive_Fusion_for_CAR

Pytorch implementation of Interactive Fusion of Multi-level Features for Compositional Activity Recognition.

Get started

Prerequisite

Our approach is tested on only Ubuntu with GPU and it needs at least 16G GPU memory. The neccseearay packages can be installed by the following commonds:

conda create -n Interactive_Fusion python=3.6
conda activate Interactive_Fusion
pip install pyyaml matplotlib tensorboardx opencv-python
pip install torch torchvision

Preprocess datasets

Something-Else

Download Something-Something Dataset and Something-Else Annotations.
Extract (or softlink) videos under dataset/sth_else/videos, and then dump the frames into dataset/sth_else/frames by the following command:

bash tools/dump_frames_sth.sh

Charades

Download Charades Dataset (scaled to 480p) and Action Genome Annotations.
Extract (or softlink) videos under dataset/charades/videos, put the annotations into dataset/charades/annotations, and then dump the frames into dataset/charades/frames by the following command:

bash tools/dump_frames_char.sh

Get video information

Get some video information, such as the height and width of the video, and the number of frames in each video. Alternatively, you can also download video_info.json from here.

bash tools/get_video_info.sh 'sth_else'
bash tools/get_video_info.sh 'charades'

Pre-generated files

You can check some necessary files in Baidu Cloud , such as the annotations and video_info.json into dataset and the data-split settings in dataset_splits. Download and put them in the same path.

Train a Standard Model from Scratch

# Compositional setting for Something-Else
python main.py --cfg STHELSE/COM/GT/OURS
# Fewshot setting for Something-Else
python main.py --cfg STHELSE/FEWSHOT/GT/OURS/base
python main.py --cfg STHELSE/FEWSHOT/GT/OURS/5shot
python main.py --cfg STHELSE/FEWSHOT/GT/OURS/10shot

Citation

If you wish to refer to the results of this work, please use the following BibTeX entry.

<!-- @article{yan2020interactive,
  title={Interactive Fusion of Multi-level Features for Compositional Activity Recognition},
  author={Yan, Rui and Xie, Lingxi and Shu, Xiangbo and Tang, Jinhui},
  journal={arXiv preprint arXiv:2012.05689},
  year={2020}
} -->
@article{yan2023progressive,
  title={Progressive instance-aware feature learning for compositional action recognition},
  author={Yan, Rui and Xie, Lingxi and Shu, Xiangbo and Zhang, Liyan and Tang, Jinhui},
  journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
  volume={45},
  number={8},
  pages={10317--10330},
  year={2023},
  publisher={IEEE}
}

Acknowledgments

Our code is built on the Pytorch implementation of STIN proposed by joaanna.

Contact Information

Feel free to create a pull request or contact me by Email = ["ruiyan", at, "njust", dot, "edu", dot, "cn"], if you find any bugs.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
configs		configs
data_utils		data_utils
model		model
tools		tools
README.md		README.md
callbacks.py		callbacks.py
main.py		main.py
requirements.sh		requirements.sh
scripts_STHELSE.sh		scripts_STHELSE.sh
train.py		train.py
train_helper.py		train_helper.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Interactive_Fusion_for_CAR

Get started

Prerequisite

Preprocess datasets

Something-Else

Charades

Get video information

Pre-generated files

Train a Standard Model from Scratch

Citation

Acknowledgments

Contact Information

About

Releases

Packages

Languages

ruiyan1995/Interactive_Fusion_for_CAR

Folders and files

Latest commit

History

Repository files navigation

Interactive_Fusion_for_CAR

Get started

Prerequisite

Preprocess datasets

Something-Else

Charades

Get video information

Pre-generated files

Train a Standard Model from Scratch

Citation

Acknowledgments

Contact Information

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages