Initial commit

drv-agwl · May 8, 2019 · 6d4b2a1 · 6d4b2a1
commit 6d4b2a1
Show file tree

Hide file tree

Showing 37 changed files with 16,355 additions and 0 deletions.
diff --git a/.gitignore b/.gitignore
@@ -0,0 +1,10 @@
+/data
+/logs
+/models
+/post
+.idea
+*.pyc
+.autoenv*
+/*.png
+/*.pdf
+/*.svg
diff --git a/LICENSE b/LICENSE
@@ -0,0 +1,21 @@
+MIT License
+
+Copyright (c) 2019 Yichao Zhou
+
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.
diff --git a/README.md b/README.md
@@ -0,0 +1,147 @@
+# L-CNN — End-to-End Wireframe Parsing
+
+## Introduction
+
+This repository contains the official PyTorch implementation of L-CNN, a conceptually simple yet effective neural network-based algorithm for detecting the wireframe from a given image. It outperforms the previous state-of-the-art wireframe and line extraction algorithms by a large margin. We hope that this repository serves as an easily reproducible baseline for future researches in this area.
+
+## Main Results
+
+### Qualitative Measures
+
+|     <img src="figs/000452_LSD.png" width="180">      |    <img src="figs/000452_AFM.png" width="180">    |     <img src="figs/000452_WF.png" width="180">      | <img src="figs/000452_LCNN.png" width="180"> | <img src="figs/000452_GT.png" width="180"> |
+| :--------------------------------------------------: | :-----------------------------------------------: | :-------------------------------------------------: | :------------------------------------------: | :----------------------------------------: |
+| [LSD](https://ieeexplore.ieee.org/document/4731268/) | [AFM](https://github.com/cherubicXN/afm_cvpr2019) | [Wireframe](https://github.com/huangkuns/wireframe) |                  **L-CNN**                   |                Ground Truth                |
+
+### Quantitative Measures
+
+The following table reports the performance of several wireframe and line detection algorithms on the [Wireframe dataset](https://github.com/huangkuns/wireframe).
+
+|                                                      | Wireframe (sAP<sup>10</sup>) | Wireframe (AP<sup>H</sup>) | Wireframe (F<sup>H</sup>) | Wireframe (mAP<sup>J</sup>) | 
+| :--------------------------------------------------: | :--------------------------------: | :-----------------------------: | :----------------------------: | :------------------------------: | 
+| [LSD](https://ieeexplore.ieee.org/document/4731268/) |                 /                  |              52.0             |              61.0                |                /                 |              
+|  [AFM](https://github.com/cherubicXN/afm_cvpr2019)   |                24.4                |              69.5               |              77.2              |               23.3               |           
+| [Wireframe](https://github.com/huangkuns/wireframe)  |                5.1                 |              67.8               |              72.6              |               40.9               |              
+|                      **L-CNN**                       |              **62.9**              |            **81.0**             |            **83.8**            |             **59.3**             |               
+
+### Precision-Recall Curves
+<p align="center">
+<img src="figs/PR-APH.svg"  width="420">
+<img src="figs/PR-sAP10.svg" width="420">
+</p>
+
+## Code Structure
+
+Below is a quick overview of the function of each file.
+
+```bash
+########################### Data ###########################
+figs/
+data/                           # default folder for placing the data
+    wireframe/                  # folder for Wireframe dataset (Huang et al.)
+logs/                           # default folder for storing the output during training
+########################### Code ###########################
+config/                         # neural network hyper-parameters and configurations
+    wireframe.yaml              # default parameter for Wireframe dataset
+dataset/                        # all scripts related to data generation
+    wireframe.py                # script for pre-processing the Wireframe dataset to npz
+misc/                           # misc scripts that are not important
+    draw-wireframe.py           # script for generating figure grids
+    lsd.py                      # script for generating npz files for LSD
+    plot-sAP.py                 # script for plotting sAP10 for all algorithms
+lcnn/                           # lcnn module so you can "import lcnn" in other scripts
+    models/                     # neural network structure
+        hourglass_pose.py       # backbone network (stacked hourglass)
+        line_vectorizer.py      # sampler and line verification network
+        multitask_learner.py    # network for multi-task learning
+    datasets.py                 # reading the training data
+    metrics.py                  # functions for evaluation metrics
+    trainer.py                  # trainer
+    config.py                   # global variables for configuration
+    utils.py                    # misc functions
+eval-sAP.py                     # script for sAP evaluation
+eval-APH.py                     # script for APH evaluation
+eval-mAPJ.py                    # script for mAPJ evaluation
+train.py                        # script for training the neural network
+post.py                         # script for post-processing
+```
+
+## Reproducing Results
+
+### Installation
+
+For the ease of reproducibility, you are suggested to install [miniconda](https://docs.conda.io/en/latest/miniconda.html) (or [anaconda](https://www.anaconda.com/distribution/) if you prefer) before following executing the following commands. 
+
+```bash
+git clone https://github.com/zhou13/lcnn
+cd lcnn
+conda create -y -n lcnn
+source activate lcnn
+# Replace cudatoolkit=10.0 with your CUDA version: https://pytorch.org/get-started/
+conda install -y pytorch cudatoolkit=10.0 -c pytorch
+conda install -y tensorboardx -c conda-forge
+conda install -y pyyaml docopt matplotlib scikit-image opencv
+mkdir data logs post
+```
+
+### Downloading data
+Make sure `curl` is installed on your system and execute
+```bash
+cd data
+../misc/gdrive-download.sh 1T4_6Nb5r4yAXre3lf-zpmp3RbmyP1t9q wireframe.tar.xz
+tar xf wireframe.tar.xz
+rm *.xz
+cd ..
+```
+
+If `gdrive-download.sh` does not work for you, you can download the data manually from [Google
+Drive](https://drive.google.com/drive/u/1/folders/1rXLAh5VIj8jwf8vLfuZncStihRO2chFr) and proceed
+accordingly.
+
+### Training
+To train the neural network on GPU 0 (specified by `-d 0`) with the default parameters, execute
+```bash
+python ./train.py -d 0 --identifier baseline  config/wireframe.yaml
+```
+
+### Pre-trained Models
+
+You can download our reference pre-trained models from [Google
+Drive](https://drive.google.com/file/d/1NvZkEqWNUBAfuhFPNGiCItjy4iU0UOy2).  This model was trained
+with `config/wireframe.yaml` for 312k iterations.
+
+### Post Processing
+
+To post processing the output from neural network (only necessary if you are going to evaluate AP<sup>H</sup>), execute
+```bash
+python ./post.py --plot logs/RUN/npz/ITERATION post/RUN-ITERATION
+```
+where ``--plot`` is an *optional* argument to control whether the program should also generate
+images for visualization in addition to the npz files that contain the line information. You should
+replace `RUN` and `ITERATION` to the desired value of your training instance.
+
+### Evaluation
+
+To evaluate the sAP (recommended) of all your checkpoints under `logs/`, execute
+```bash
+python eval-sAP.py logs/*/npz/*
+```
+
+To evaluate the mAP<sup>J</sup>, execute
+```bash
+python eval-mAPJ.py logs/*/npz/*
+```
+
+To evaluate AP<sup>H</sup>, you first need to post process your result (see the previous section).
+In addition, **MATLAB is required for AP<sup>H</sup> evaluation** and `matlab` should be under your
+`$PATH`.  The **parallel computing toolbox** is highly suggested due to the usage of `parfor`.
+After post processing, execute
+
+```bash
+python eval-APH.py post/RUN-ITERATION post/RUN-ITERATION-APH
+```
+to get the plot.  Here `post/RUN-ITERATION-APH` is the temporary directory storing intermediate
+files.  Due to the usage of pixel-wise matching, the evaluation of AP<sup>H</sup> **may take up to
+an hour** depending on your CPUs.
+
+See the source code of `eval-sAP.py`, `eval-mAPJ.py`, `eval-APH.py`, and `misc/*.py` for more
+details on evaluation.
diff --git a/config/wireframe.yaml b/config/wireframe.yaml
@@ -0,0 +1,65 @@
+io:
+  logdir: logs/
+  datadir: data/wireframe/
+  resume_from:
+  num_workers: 4
+  tensorboard_port: 0
+  validation_interval: 24000
+
+model:
+  image:
+      mean: [109.730, 103.832, 98.681]
+      stddev: [22.275, 22.124, 23.229]
+
+  batch_size: 6
+
+  # backbone multi-task parameters
+  head_size: [[2], [1], [2]]
+  loss_weight:
+    jmap: 8.0
+    lmap: 0.5
+    joff: 0.25
+    lpos: 1
+    lneg: 1
+
+  # backbone parameters
+  backbone: stacked_hourglass
+  depth: 4
+  num_stacks: 2
+  num_blocks: 1
+
+  # sampler parameters
+  ## static sampler
+  n_stc_posl: 300
+  n_stc_negl: 40
+
+  ## dynamic sampler
+  n_dyn_junc: 300
+  n_dyn_posl: 300
+  n_dyn_negl: 80
+  n_dyn_othr: 600
+
+  # LOIPool layer parameters
+  n_pts0: 32
+  n_pts1: 8
+
+  # line verification network parameters
+  dim_loi: 128
+  dim_fc: 1024
+
+  # maximum junction and line outputs
+  n_out_junc: 250
+  n_out_line: 2500
+
+  # additional ablation study parameters
+  use_cood: 0
+  use_slop: 0
+  use_conv: 0
+
+optim:
+  name: Adam
+  lr: 4.0e-4
+  amsgrad: True
+  weight_decay: 1.0e-4
+  max_epoch: 24
+  lr_decay_epoch: 10