GithubHelp home page GithubHelp logo

husterrc / seq2seq-layout-analysis Goto Github PK

View Code? Open in Web Editor NEW

This project forked from limengyang1992/seq2seq-layout-analysis

0.0 1.0 0.0 27.56 MB

end2end layout analysis based seq2seq

Python 100.00%

seq2seq-layout-analysis's Introduction

票据类版面分析算法

当前OCR之后的版面分析工作大家都是规则写的,本人也深受规则之苦,看到ocr输出的一大堆文字和坐标就头皮发麻。最近受了chineseocr作者模板引擎的启发,做了个基于seq2seq的端到端版面分析算法,希望能够帮到各位ocrer。

思路:通过有监督方式训练句向量,然后通过seq2seq的方式学习box类别

链接:https://blog.csdn.net/mochp/article/details/109491521

使用方法

  • 准备数据

    • 首先利用自己的OCR算法,将票据图片文字识别出来
    • 将结果存入labelme可读json
    • 使用labelme在groupid里标注所需要提取的box类别
    • 将标注数据放入data对应的train和test文件夹下
    • 数据样例参考data/train/1.json
  • 修改config.py其中5个参数,其余可根据情况调试

     self.class_char               # 标签列表(对应groupid)
     self.max_text_len = 20        # 最大文本长度
     self.max_box_num = 50         # 最大box个数
     self.expend_box_times = 8     # box扩增倍数
     self.rnn_hidden_size = 64     # 句向量维度
    
  • 训练句向量

     python processing.py   #数据预处理
     python rnn_w2v.py      #训练词向量
     python rnn_train.py    #训练句向量
    
  • 训练版面分析

     python s2s_train.py
    
    
  • 推理

     python infer.py
    
    

seq2seq-layout-analysis's People

Contributors

limengyang1992 avatar mochp avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.