GithubHelp home page GithubHelp logo

bityangke / mlod.tf Goto Github PK

View Code? Open in Web Editor NEW

This project forked from frostinassiky/mlod.tf

0.0 1.0 0.0 155 KB

Tensorflow code for "Missing Labels in Object Detection"

License: MIT License

Shell 3.11% Python 94.42% Makefile 0.03% MATLAB 0.63% C++ 0.05% Cuda 1.76%

mlod.tf's Introduction

mlod.tf

Tensorflow code for Missing Labels in Object Detection

This repository is currently not complete. Please open an issue if you want any script related to the paper.

This repository is based on the Tensorflow implementation of faster RCNN detection framework by Xinlei Chen ([email protected]). To set up the environment, please follow the instruction in this repo.

Installation

  1. Clone the repository
git clone https://https://github.com/Frostinassiky/mlod.tf.git
  1. Update your -arch in setup script to match your GPU
cd tf-faster-rcnn/lib
# Change the GPU architecture (-arch) if necessary
vim setup.py
GPU model Architecture
TitanX (Maxwell/Pascal) sm_52
GTX 960M sm_50
GTX 1080 (Ti) sm_61
Grid K520 (AWS g2.2xlarge) sm_30
Tesla K80 (AWS p2.xlarge) sm_37

Note: You are welcome to contribute the settings on your end if you have made the code work properly on other GPUs. Also even if you are only using CPU tensorflow, GPU based code (for NMS) will be used by default, so please set USE_GPU_NMS False to get the correct output.

  1. Build the Cython modules
make clean
make
cd ..
  1. Install the Python COCO API. The code requires the API to access COCO dataset.
cd data
git clone https://github.com/pdollar/coco.git
cd coco/PythonAPI
make
cd ../../..

Dataset

To convert PASCAL VOC or COCO dataset to a missing label dataset, please go to this repo

Train/Test your own model

Check the batch file.

Citation

If you find this implementation helpful, please consider citing:

@inproceedings{xu2019missing,
  title={Missing Labels in Object Detection},
  author={\textbf{Xu, Mengmeng} and Bai, Yancheng and Ghanem, Bernard},
  booktitle={The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops},
  year={2019}
}

For convenience, here is the faster RCNN citation:

@inproceedings{renNIPS15fasterrcnn,
    Author = {Shaoqing Ren and Kaiming He and Ross Girshick and Jian Sun},
    Title = {Faster {R-CNN}: Towards Real-Time Object Detection
             with Region Proposal Networks},
    Booktitle = {Advances in Neural Information Processing Systems ({NIPS})},
    Year = {2015}
}

mlod.tf's People

Contributors

frostinassiky avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.