This is the official PyTorch implementation of our paper:
Improving Unsupervised Video Object Segmentation via Fake Flow Generation, arXiv 2024
Suhwan Cho, Minhyeok Lee, Jungho Lee, DongHyeong Kim, Seunghoon Lee, Sungmin Woo, Sangyoun Lee
Link: [arXiv]
You can also find other related papers at awesome-video-object-segmentation.
In unsupervised VOS, the scarcity of training data has been a significant bottleneck in achieving high segmentation accuracy. Inspired by observations on two-stream approaches, we introduce a novel data generation method based on the depth-to-flow conversion process. With our fake flow generation protocol, large-scale image-flow pairs can be leveraged during network training. To facilitate future research, we also prepare the DUTSv2 dataset, which is an extended version of DUTS, comprising pairs of the original images and the simulated flow maps.
1. Download DUTS, DAVIS, FBMS, YouTube-Objects, and Long-Videos from the official websites.
2. Estimate and save optical flow maps from the videos using RAFT.
3. For DUTS, simulate optical flow maps using DPT.
4. For convenience, I also provide the pre-processed DUTSv2, DAVIS, FBMS, YouTube-Objects, and Long-Videos.
5. Replace dataset paths in "run.py" file with your dataset paths.
1. Open the "run.py" file.
2. Specify the model version.
3. Verify the training settings.
4. Start FakeFlow training!
python run.py --train
1. Open the "run.py" file.
2. Specify the model version.
3. Choose a pre-trained model.
4. Start FakeFlow testing!
python run.py --test
pre-trained model (mitb0)
pre-trained model (mitb1)
pre-trained model (mitb2)
pre-computed results
Code and models are only available for non-commercial research purposes.
If you have any questions, please feel free to contact me :)
E-mail: [email protected]