GithubHelp home page GithubHelp logo

we1l1n's Projects

spf icon spf

Cornell Semantic Parsing Framework

sqg icon sqg

Query Generation for Question Answering over Knowledge Bases

src icon src

tools for fast reading of docs

suanshu icon suanshu

Extension of original open-sourced math library, SuanShu.

superdesk icon superdesk

Superdesk is an end-to-end news creation, production, curation, distribution, and publishing platform.

synonym-extractor icon synonym-extractor

Extract synonyms, keywords from sentences using modified implementation of Aho Corasick algorithm

table-reading-understanding-in-documents-images icon table-reading-understanding-in-documents-images

Tables and forms are a very common way to organize information in structured documents. Their recognition is fundamental for the recognition of the documents. Indeed, the physical organization of a table or a form gives a lot of information concerning the logical meaning of the content. The requirement of detection and identification of tables from document images is crucial as tables contain important information, and also most of the layout analysis methods fail in the presence of tables in the document image. To build a solution that can detect tables and its layout in a given documents, then make sense of the information they present. Tables present in documents are often used to compactly communicate important information in rows and columns. To automatically extract this information by digitization of paper documents, the tabular structures need to be identified and the layout and inter-relationship between the table elements need to be reserved for subsequent analysis.

tablemaster-mmocr icon tablemaster-mmocr

2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.

tac-entity-linking icon tac-entity-linking

An entity linking prototype, developed using the datasets from the TAC-KBP sub-task.

tanda icon tanda

Learning to Compose Domain-Specific Transformations for Data Augmentation

taskflow icon taskflow

A General-purpose Parallel and Heterogeneous Task Programming System

tc-bot icon tc-bot

User Simulation for Task-Completion Dialogues

teddy icon teddy

A system for interactive review analysis.

text-classifier icon text-classifier

text-classifier is a toolkit for text classification. 文本分类,LR,Xgboost,TextCNN,FastText,BiLSTM_Attention,Transformer等模型实现,开箱即用。

text-detection-ctpn icon text-detection-ctpn

text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network

textanalyzer icon textanalyzer

A text analyzer which is based on machine learning,statistics and dictionaries that can analyze text. So far, it supports hot word extracting, text classification, part of speech tagging, named entity recognition, chinese word segment, extracting address, synonym, text clustering, word2vec model, edit distance, chinese word segment, sentence similarity,word sentiment tendency, name recognition, idiom recognition, placename recognition, organization recognition, traditional chinese recognition, pinyin transform.

textfeatureextraction icon textfeatureextraction

Self complemented text feature extraction using algorithms including CHI, DF, IG, MI for the experiment of text classification based on sogou online news, 基于卡方检验CHI,文档频率DF, 信息增益IG,互信息MI的文本特征提取与实现

textgrapher icon textgrapher

Text Content Grapher based on keyinfo extraction by NLP method。输入一篇文档,将文档进行关键信息提取,进行结构化,并最终组织成图谱组织形式,形成对文章语义信息的图谱化展示。

thingtalk icon thingtalk

The Programming Language of Virtual Assistants

thuocl icon thuocl

THUOCL(THU Open Chinese Lexicon)中文词库

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.