RMT

An unofficial implementation of "RMT: Retentive Networks Meet Vision Transformers. I created this repo to exercise my paper-to-code translation skill while waiting for the official implementation to be published on: https://github.com/qhfan/RMT.

Introduction

RMT is an architecture that adopts the retention mechanism proposed by Sun et al. in the paper "Retentive Network: A Successor to Transformer for Large Language Models", which capably serves as a general-purpose backbone for computer vision. It extends the usability of retention mechanism from unidirectional, one-dimensional data (sequential data like texts) to bidirectional, two-dimensional data (images). Moreover, unlike the original Retentive Network, RMT does not apply the different-representation scenario for training and inference as the recurrent form greatly disrupts the parallelism of the model that results in a very slow inference speed.

RMT achieves strong performance on COCO object detection (51.6 box AP and 45.9 mask AP) and ADE20K semantic segmentation (52.0 mIoU), surpassing previous models by a huge margin.

Updates

28/11/2023

This repo was created by forking the mmpretrain repo (mmpretrain). Update the description inside the readme file.

Installation

Below are quick steps for installation:

conda create -n open-mmlab python=3.8 pytorch==1.10.1 torchvision==0.11.2 cudatoolkit=11.3 -c pytorch -y
conda activate open-mmlab
pip install openmim
git clone https://github.com/open-mmlab/mmpretrain.git
cd mmpretrain
mim install -e .

Please refer to installation documentation for more detailed installation and dataset preparation.

Usage

to train the RMT model, you can use tools/train.py. Here is the full usage of the script:

python tools/train.py ${CONFIG_FILE} [ARGS]

where CONFIG_FILE is the path to the config file. There are some predefined config files available inside the configs/rmt folder. One example is rmt-tiny_b128_cifar10.py where it runs the tiny configuration of rmt with batch size of 128 of the CIFAR10 dataset. Please refer to these tutorials about the basic usage of MMPretrain for new users:

Acknowledgement

MMPreTrain is an open source project that is contributed by researchers and engineers from various colleges and companies. Appreciation to all the contributors who implement their methods or add new features, as well as users who give valuable feedbacks. I also would like to thank the authors for writing such a wonderful paper.

Citation

If you find this project useful in your research, please consider cite:

@misc{rmt-unofficial,
    title={RMT Unofficial Implementation},
    author={Farros Alferro},
    howpublished = {\url{https://github.com/farrosalferro/RMT-unofficial}},
    year={2023}
}

License

This project is released under the Apache 2.0 license.

Name		Name	Last commit message	Last commit date
Latest commit History 994 Commits
.circleci		.circleci
.dev_scripts		.dev_scripts
.github		.github
configs		configs
demo		demo
docker		docker
docs		docs
mmpretrain		mmpretrain
projects		projects
requirements		requirements
resources		resources
tests		tests
tools		tools
.coveragerc		.coveragerc
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.readthedocs.yml		.readthedocs.yml
CITATION.cff		CITATION.cff
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
README_zh-CN.md		README_zh-CN.md
dataset-index.yml		dataset-index.yml
model-index.yml		model-index.yml
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RMT

Introduction

Updates

28/11/2023

Installation

Usage

Acknowledgement

Citation

License

About

Releases

Packages

Languages

License

farrosalferro/RMT-unofficial

Folders and files

Latest commit

History

Repository files navigation

RMT

Introduction

Updates

28/11/2023

Installation

Usage

Acknowledgement

Citation

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages