msracver / FCIS
- суббота, 13 мая 2017 г. в 03:12:55
Cuda
Fully Convolutional Instance-aware Semantic Segmentation
The major contributors of this repository include Haozhi Qi, Yi Li, Guodong Zhang, Haochen Zhang, Jifeng Dai, and Yichen Wei.
FCIS is a fully convolutional end-to-end solution for instance segmentation, which won the first place in COCO segmentation challenge 2016.
FCIS is initially described in a CVPR 2017 spotlight paper. It is worth noticing that:
This is an official implementation for Fully Convolutional Instance-aware Semantic Segmentation (FCIS) based on MXNet. It is worth noticing that:
© Microsoft, 2017. Licensed under an Apache-2.0 license.
If you find FCIS useful in your research, please consider citing:
@inproceedings{li2016fully,
Author = {Yi Li, Haozhi Qi, Jifeng Dai, Xiangyang Ji and Yichen Wei}
Title = {Fully Convolutional Instance-aware Semantic Segmentation},
Conference = {CVPR},
year = {2017}
}
training data | testing data | mAP^r@0.5 | mAP^r@0.7 | time | |
---|---|---|---|---|---|
FCIS, ResNet-v1-101 | VOC 2012 train | VOC 2012 val | 66.0 | 51.9 | 0.23s |
training data | testing data | mAP | mAP@0.5 | mAP@0.75 | mAP@S | mAP@M | mAP@L | |
---|---|---|---|---|---|---|---|---|
FCIS, ResNet-v1-101, OHEM | coco trainval35k | coco minival | 28.7 | 50.5 | 28.8 | 7.7 | 31.0 | 50.1 |
FCIS, ResNet-v1-101, OHEM | coco trainval35k | coco test-dev | 29.0 | 51.2 | 29.5 | 7.7 | 30.6 | 48.9 |
Running time is counted on a single Maxwell Titan X GPU (mini-batch size is 1 in inference).
MXNet from the offical repository. We tested our code on MXNet@(commit 62ecb60). Due to the rapid development of MXNet, it is recommended to checkout this version if you encounter any issues. We may maintain this repository periodically if MXNet adds important feature in future release.
Python packages might missing: cython, opencv-python >= 3.2.0, easydict. If pip
is set up on your system, those packages should be able to be fetched and installed by running
pip install Cython
pip install opencv-python==3.2.0.6
pip install easydict==1.6
pip install hickle
For Windows users, Visual Studio 2015 is needed to compile cython module.
Any NVIDIA GPUs with at least 5GB memory should be OK
git clone https://github.com/msracver/FCIS.git
cmd .\init.bat
. For Linux user, run sh ./init.sh
. The scripts will build cython module automatically and create some folders../fcis/operator_cxx
to $(YOUR_MXNET_FOLDER)/src/operator/contrib
and recompile MXNet../external/mxnet/$(YOUR_MXNET_PACKAGE)
, and modify MXNET_VERSION
in ./experiments/fcis/cfgs/*.yaml
to $(YOUR_MXNET_PACKAGE)
. Thus you can switch among different versions of MXNet quickly.To run the demo with our trained model (on COCO trainval35k), please download the model manually from OneDrive, and put it under folder model/
.
Make sure it looks like this:
./model/fcis_coco-0000.params
Run
python ./fcis/demo.py
Please download VOC 2012 dataset with additional annotations from SBD. Move inst, cls, img
folders to VOCdevit and make sure it looks like this:
Please use the train&val split in this repo, which follows the protocal of SDS.
.data/VOCdevkit/VOCSDS/img/
.data/VOCdevkit/VOCSDS/inst/
.data/VOCdevkit/VOCSDS/cls/
Please download COCO dataset and annotations for the 5k image minival subset and val2014 minus minival (val35k). Make sure it looks like this:
.data/coco/
.data/coco/annotations/instances_valminusminival2014.json
.data/coco/annotations/instances_minival2014.json
Please download ImageNet-pretrained ResNet-v1-101 model manually from OneDrive, and put it under folder ./model
. Make sure it looks like this:
./model/pretrained_model/resnet_v1_101-0000.params
./experiments/fcis/cfgs
.python experiments/fcis/fcis_end2end_train_test.py --cfg experiments/fcis/cfgs/resnet_v1_101_coco_fcis_end2end_ohem.yaml
A cache folder would be created automatically to save the model and the log under output/fcis/coco/
or output/fcis/voc/
.Code has been tested under: