GithubHelp home page GithubHelp logo

dreadlord1984 / r-net Goto Github PK

View Code? Open in Web Editor NEW

This project forked from wangyuxin87/r-net

0.0 2.0 0.0 90 KB

An implement of "R-Net: A Relationship Network for Efficient and Accurate Scene Text Detection”

Python 100.00%

r-net's Introduction

R-Net

Description

This is an implement of "R-Net: A Relationship Network for Efficient and Accurate Scene Text Detection". R-Net is a direct regression based method for multi-oriented scene text detection, which outperforms parallel methods by a large margin. The paper is avaliabel at Paperlink(TMM2020).

Result

Model recall precision F-measure TIoU-R TIoU-P TIoU-F
This implementation (IC15) 84.5 88.1 86.3 60.1 67.9 63.7

The released model is pre-trained on SynthText (1 epoch) and then finetune on ICDAR2015 (batchsize=8 on 1 gpu). Reasults on other datasets (e.g. MSRA-TD500, ICDAR2013, MLT, etc.) can be easily obtained by following our training setting.

Updates

2020/5/18 We have updated the code.

Requirement

  • Python3
  • PyTorch-0.4.1
  • torchvision-0.2.1
  • shapely-1.6.4.post2
  • lanms-1.0.2
  • opencv-python(4.1.0.25)

Installation

Data

Download data images and annotions from ICDAR 2015 Challenge 4. Prepare data as:

./dataset/train/imgs
./dataset/train/txt
./dataset/test/imgs
./dataset/test/txt

Pre-trained Model and Our trained Model.

Download pre-trained VGG16 (must rename as vgg16.pth) and our traind model (passward:oowz) Put both models into output/

Test

CUDA_VISIBLE_DEVICES=0 python test_rnet.py

Train

CUDA_VISIBLE_DEVICES=0,1 python train_rnet.py

Evaluation

We use online tool to evaluate our results.

Speed

Run

CUDA_VISIBLE_DEVICES=0 python speed_eval.py.

We add the model prediction time and the NMS time as the inference speed.

Citation

If you find our method useful for your reserach, please cite

@ARTICLE{rnet2020wang,
  author={Y. {Wang} and H. {Xie} and Z. {Zha} and Y. {Tian} and Z. {Fu} and Y. {Zhang}},
  journal={IEEE Transactions on Multimedia}, 
  title={R-Net: A Relationship Network for Efficient and Accurate Scene Text Detection}, 
  year={2020},
  pages={1-1},}

Feedback

Suggestions and discussions are greatly welcome. Please contact the authors by sending email to [email protected]

r-net's People

Contributors

wangyuxin87 avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.