Fork

This is a fork of "Hierarchical Deep Stereo Matching on High Resolution Images" to support newer Python, PyTorch and TorchVision.

The original implementation only works correctly with torchvision 0.2.0, and it is broken in 0.2.1. It can be found at https://github.com/gengshan-y/high-res-stereo.

This implementation has been tested to work correctly with:

python 2.7.x, 3.7.x
PyTorch 0.4.0, 0.4.1, 1.0.1 and 1.1.0
torchvision 0.2.0, 0.2.1 and 0.3.0

Hierarchical Deep Stereo Matching on High Resolution Images

Architecture:

Qualitative results on Middlebury (refer to project webpage for more results)

Performance on Middlebury benchmark (y-axis: the lower the better)

Weights

Download

Data

train/val

test

High-res-real-stereo (HR-RS): comming soon

Train

Download and extract training data in folder /d/. Training data include Middlebury train set, HR-VS, KITTI-12/15 and SceneFlow.
Run

CUDA_VISIBLE_DEVICES=0,1,2,3 python train.py --maxdisp 384 --batchsize 24 --database /d/ --logname log1 --savemodel /somewhere/  --epochs 10

Evalute on Middlebury additional images and KITTI validation set. After 10 epochs, average error on Middlebury additional images with half-res should be around 4.6 (excluding Shopvac).

Inference

Example:

CUDA_VISIBLE_DEVICES=3 python submission.py --datapath ./data-mbtest/   --outdir ./mboutput --loadmodel ./weights/final-768px.tar  --testres 1 --clean 0.8 --max_disp -1

Evaluation:

CUDA_VISIBLE_DEVICES=3 python submission.py --datapath ./data-HRRS/   --outdir ./output --loadmodel ./weights/final-768px.tar  --testres 0.5
python eval_disp.py --indir ./output --gtdir ./data-HRRS/

And use cvkit to visualize in 3D.

Example outputs

left image

3D projection

disparity map

uncertainty map (brighter->higher uncertainty)

Parameters

testres: 1 is full resolution, and 0.5 is half resolution, and so on
max_disp: maximum disparity range to search
clean: threshold of cleaning. clean=0 means removing all the pixels.

Citation

@InProceedings{yang2019hsm,
author = {Yang, Gengshan and Manela, Joshua and Happold, Michael and Ramanan, Deva},
title = {Hierarchical Deep Stereo Matching on High-Resolution Images},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2019}
}

Acknowledgement

Part of the code is borrowed from MiddEval-SDK, PSMNet, FlowNetPytorch and pytorch-semseg.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
data-mbtest/CrusadeP		data-mbtest/CrusadeP
dataloader		dataloader
mboutput		mboutput
models		models
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
architecture.png		architecture.png
eval_disp.py		eval_disp.py
middlebury-benchmark.png		middlebury-benchmark.png
submission.py		submission.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fork

Hierarchical Deep Stereo Matching on High Resolution Images

Weights

Data

train/val

test

Train

Inference

Example outputs

Parameters

Citation

Acknowledgement

About

Releases

Packages

Languages

License

SorcererX/high-res-stereo

Folders and files

Latest commit

History

Repository files navigation

Fork

Hierarchical Deep Stereo Matching on High Resolution Images

Weights

Data

train/val

test

Train

Inference

Example outputs

Parameters

Citation

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages