GithubHelp home page GithubHelp logo

sun1638650145 / textextraction Goto Github PK

View Code? Open in Web Editor NEW
4.0 1.0 0.0 12 KB

An AutoML about text extraction, you can use three or four lines of code to extract text.

License: Apache License 2.0

Python 100.00%
transformer keras textextracting pipeline nlp

textextraction's Introduction

TextExtraction

一个关于文本提取的AutoML,你可以使用三四行代码就可以进行文本提取,

阅读其他语言版本:简体中文English

例子

pipeline

这是一个非常简短的例子(入门推荐)

from TextExtraction import TextExtractionPipeline
pipeline = TextExtractionPipeline(train_dataset_path='../Sentiment_Extraction103/train.csv')
pipeline.run()

例子中使用的数据集在这里点击

custom

如果你希望更高的准确率,可以使用自定义模式(针对有经验的开发者)

  1. 你可以使用model和tools下的API构建你自己的模型
  2. 有问题欢迎和作者交流,联系方式qq:1638650145,邮箱:[email protected]

性能

  1. 使用Jaccard系数评估,Jaccard系数在0.69-0.70之间
  2. 使用tf.data.Datasets读入数据集并使用默认参数,在Nvidia Tesla P100上的参考速度是370ms/step, 在Google TPU上参考速度是101ms/step
  3. 支持使用TPU运行

如果你想

  1. 如果你想改进代码,请使用PEP8标准,否则一定无法通过
  2. 如果你想使用其他的模型比如Bert、Albert,请与作者交流,联系方式在上面
  3. 如果你想star和fork,那就不用想了直接做就行了,你的想法非常明智,最后,非常感谢你的支持

textextraction's People

Contributors

sun1638650145 avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.