GithubHelp home page GithubHelp logo

opconty / keras_std_plus_plus Goto Github PK

View Code? Open in Web Editor NEW
30.0 2.0 9.0 11 MB

This repository is the official implementation of `A Semantic-based Arbitrarily-Oriented Scene Text Detector`(named STD++ as it is the improved version of STD).

Home Page: https://mp.weixin.qq.com/s/S6z4KCcrycWF4QigBHNahw

Python 100.00%
ocr text-detection keras scene-text-detectors

keras_std_plus_plus's Introduction

STD++: A Semantic-based Arbitrarily-Oriented Scene Text Detector

This repository is the official implementation of A Semantic-based Arbitrarily-Oriented Scene Text Detector(named STD++ as it is the improved version of STD).due to lack of computing resources and time, we tested STD++ on MTWI2018 dataset, and we hope to perform more experiments on any other benchmark datasets, such as IC15,IC17,COCO-Text,MSRA-TD500 and so on.

bigpic

images come from icdar2017rctw

Introduction

STD++ is the improved version of STD, which solved STD's limitations and can be used to detect arbitrarily-oriented texts, yet still preserves its accuracy and efficiency:

  • no any further post-processings, like NMS.
  • anchor-free.
  • easy to generate training labels.
  • only one step process to get final bounding boxes.

Any questions or suggestions,please drop a comment or contact me,email: [email protected].

Training

Download RCTW17 dataset below, and configure your local directory path. refer to train.py

Inference

predict.py

Examples

examples

Dataset

We trained STD++ on MTWI2018 dataset, training and testing images can be downloaded from this site for Text Localization, and we make STD++ annotations available on baiduyun, code: nuti.

License

This project is released under the Apache 2.0 license.

Citation

If you use our codebase in your research, please cite this project. a paper or technical report will be released soon.

And besides, you are welcomed to join us to maintain this project.

@misc{std_plus_plus2019,
  author =       {Gao Lijun},
  title =        {STD++: A Semantic-based Arbitrarily-Oriented Scene Text Detector},
  howpublished = {\url{https://github.com/opconty/keras_std_plus_plus}},
  year =         {2019}
}

keras_std_plus_plus's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

keras_std_plus_plus's Issues

pretrained model

@opconty Thank you for your hard work,
Will you release a pretrained model so that we can test your code.

Also, how to use the model to detect an image?

paper

hi, it is has paper?

cannot import name 'Deconv2D'

Which version of tensorflow should i use?
When running train.py

Traceback (most recent call last):
  File "train.py", line 11, in <module>
    from models.losses import cls_center, regr_h, regr_offset, cl,angle
  File "/home/home/p13/keras_std_plus_plus/models/__init__.py", line 7, in <module>
    from .std_vgg16_skew import StdVGG16
  File "/home/home/p13/keras_std_plus_plus/models/std_vgg16_skew.py", line 10, in <module>
    from keras.layers import Input,Concatenate, Conv2D,UpSampling2D, BatchNormalization, Deconv2D
ImportError: cannot import name 'Deconv2D'

Training the mtwi dataset

@opconty @zhouyuangan

  • Which version of tensorflow should i use?
  • What changes did you make to config.py and train.py in-order to train the mtwi dataset?
  • For training mtwi dataset, what is your folder structure?
  • upload your trained model so i can test.

Inference Speed

@opconty As the title says, could you provide inference speed for me? I trained model on the mtwi dataset and ran predict.py to visualize result but cost 5 seconds per image, it's weird and confusing.

Generate XML annotations on own dataset

@opconty I have a dataset contains some gt files formated like x1,y1,x2,y2,...,x4,y4, label, I inspect your provided xml and confused me, could you provide script or code block for me? Thanks a lot.

Training icdar 2017

@opconty

How to train on icdar 2017?
I have noticed that you std++ code has it's own reconstructed annotations, so how can I prepare the icdar annotations and convert x1,y1,x2,y2,x3,y3 to std++ annotations?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.