This module is an extension of Faster-RCNN to detect and classify German Traffic Signs. The following animation shows the output of this module.
For installation, I modified the original Faster-RCNN README.md file to adapt changes for run this module. Please check below for license and citation information.
- Requirements: software
- Requirements: hardware
- Basic installation
- Demo
- Beyond the demo: training and testing
- Usage
-
Requirements for
Caffe
andpycaffe
(see: Caffe installation instructions)Note: Caffe must be built with support for Python layers!
# In your Makefile.config, make sure to have this line uncommented WITH_PYTHON_LAYER := 1 # Unrelatedly, it's also recommended that you use CUDNN USE_CUDNN := 1
You can see the sample Makefile.config avialable with this repository. It uses conda with GPU support. You need to modify this file to suit your hardware configuration.
-
Python packages you might not have:
cython
,python-opencv
,easydict
-
[Optional] MATLAB is required for official PASCAL VOC evaluation only. The code now includes unofficial Python evaluation code.
- For training smaller networks (ZF, VGG_CNN_M_1024) a good GPU (e.g., Titan, K20, K40, ...) with at least 3G of memory suffices
- For training Fast R-CNN with VGG16, you'll need a K40 (~11G of memory)
- For training the end-to-end version of Faster R-CNN with VGG16, 3G of GPU memory is sufficient (using CUDNN)
-
Clone the Faster R-CNN repository
# Make sure to clone with --recursive git clone --recursive https://github.com/sridhar912/tsr-py-faster-rcnn.git
-
We'll call the directory that you cloned Faster R-CNN into
FRCN_ROOT
Ignore notes 1 and 2 if you followed step 1 above.
Note 1: If you didn't clone Faster R-CNN with the
--recursive
flag, then you'll need to manually clone thecaffe-fast-rcnn
submodule:git submodule update --init --recursive
Note 2: The
caffe-fast-rcnn
submodule needs to be on thefaster-rcnn
branch (or equivalent detached state). This will happen automatically if you followed step 1 instructions. -
Build the Cython modules
cd $FRCN_ROOT/lib make
-
Build Caffe and pycaffe
cd $FRCN_ROOT/caffe-fast-rcnn # Now follow the Caffe installation instructions here: # http://caffe.berkeleyvision.org/installation.html # If you're experienced with Caffe and have all of the requirements installed # and your Makefile.config in place, then simply do: make -j8 && make pycaffe
-
Download pre-trained detector from this link. This downloaded model need to be placed under the directory
$FRCN_ROOT/data/GTSDB/TrainedModel
After successfully completing basic installation, you'll be ready to run the demo.
To run the demo
cd $FRCN_ROOT
./tools/demo.py
The demo performs detection using a ZF network trained for detection on GTSDB. Few sample images from test dataset has been placed under folder
cd $FRCN_ROOT/data/demo
For the complete testing, test dataset has to be download and placed in the folder mentioned above
Before starting, you need to download the traffic sign datasets from German Traffic Signs Datasets. In this implementation, the training and test datasets that were used for the competition ( training data set (1.1 GB), test data set (500 MB) ) is used.
Here, the main goal is to enable Faster R-CNN to detect and classify traffic sign. So, model performance evaluation in test dataset was not carried out. The downloaded test dataset was only used for visual testing. After the dataset is downloaded, prepare the following directory structure. The training zip file contains the following files
- folders
- images (00000.ppm, 00001.ppm...., 00599.ppm)
- gt.txt
Copy all the images into Images directory as shown below. Rename gt.txt as train.txt and keep both gt.txt and train.txt as shown below.
At first, the dataset must be well organzied with the required format.
GTSDB
|-- Annotations
|-- gt.txt (Annotation files)
|-- Images
|-- *.ppm (Image files)
|-- ImageSets
|-- train.txt
Pre-trained ImageNet models can be downloaded for the three networks described in the paper: ZF and VGG16.
cd $FRCN_ROOT
./data/scripts/fetch_imagenet_models.sh
VGG16 comes from the Caffe Model Zoo, but is provided here for your convenience. ZF was trained at MSRA.
This implementation is tested only for approximate joint training.
To train and test a TSR Faster R-CNN detector using the approximate joint training method, use experiments/scripts/faster_rcnn_end2end.sh
.
Output is written underneath $FRCN_ROOT/output
.
cd $FRCN_ROOT
./experiments/scripts/faster_rcnn_end2end.sh [GPU_ID] [NET] [--set ...] [DATASET]
# GPU_ID is the GPU you want to train on
# NET in {ZF, VGG_CNN_M_1024, VGG16} is the network arch to use
# --set ... allows you to specify fast_rcnn.config options, e.g.
# --set EXP_DIR seed_rng1701 RNG_SEED 1701
# DATASET to be used for training
Example script to train ZF model:
cd $FRCN_ROOT
./experiments/scripts/faster_rcnn_end2end.sh 0 ZF gtsdb
Trained Fast R-CNN networks are saved under:
output/<experiment directory>/<dataset name>/
# Example: output/faster_rcnn_end2end/gtsdb_train
Test outputs are saved under:
output/<experiment directory>/<dataset name>/<network snapshot name>/
The official Faster R-CNN code (written in MATLAB) is available here. If your goal is to reproduce the results in our NIPS 2015 paper, please use the official code.
This repository contains a Python reimplementation of the MATLAB code. This Python implementation is built on a fork of Fast R-CNN. There are slight differences between the two implementations. In particular, this Python port
- is ~10% slower at test-time, because some operations execute on the CPU in Python layers (e.g., 220ms / image vs. 200ms / image for VGG16)
- gives similar, but not exactly the same, mAP as the MATLAB version
- is not compatible with models trained using the MATLAB code due to the minor implementation differences
- includes approximate joint training that is 1.5x faster than alternating optimization (for VGG16) -- see these slides for more information
By Shaoqing Ren, Kaiming He, Ross Girshick, Jian Sun (Microsoft Research)
This Python implementation contains contributions from Sean Bell (Cornell) written during an MSR internship.
Please see the official README.md for more details.
Faster R-CNN was initially described in an arXiv tech report and was subsequently published in NIPS 2015.
Faster R-CNN is released under the MIT License (refer to the LICENSE file for details).
If you find Faster R-CNN useful in your research, please consider citing:
@inproceedings{renNIPS15fasterrcnn,
Author = {Shaoqing Ren and Kaiming He and Ross Girshick and Jian Sun},
Title = {Faster {R-CNN}: Towards Real-Time Object Detection
with Region Proposal Networks},
Booktitle = {Advances in Neural Information Processing Systems ({NIPS})},
Year = {2015}
}