GithubHelp home page GithubHelp logo

test-mass-forker-org-1 / recognizers-text Goto Github PK

View Code? Open in Web Editor NEW

This project forked from microsoft/recognizers-text

0.0 0.0 0.0 47.46 MB

Microsoft.Recognizers.Text provides recognition and resolution of numbers, units, and date/time expressed in multiple languages (ZH, EN, FR, ES, PT, DE, IT, TR, HI, NL. Partial support for JA, KO, AR, SV). Contributions are greatly welcome! Packages are available at https://www.nuget.org/profiles/Recognizers.Text and https://www.npmjs.com/~recognizers.text

License: MIT License

Shell 0.03% JavaScript 10.13% Python 18.28% Java 19.67% C# 41.65% PowerShell 0.05% TypeScript 9.90% HTML 0.15% Batchfile 0.14% Dockerfile 0.01%

recognizers-text's Introduction

Microsoft Recognizers Text Overview

Build Status Build Status

Microsoft.Recognizers.Text provides robust recognition and resolution of entities like numbers, units, and date/time; expressed in multiple languages. Full support for Chinese, English, French, Spanish, Portuguese, German, Italian, Turkish, Hindi, and Dutch. Partial support for Japanese, Korean, Arabic, and Swedish. More on the way.

Utilizing the Project

Microsoft.Recognizers.Text powers pre-built entities in LUIS: Language Understanding Intelligent Service, Power Virtual Agents, and Microsoft Bot Framework; base entity types in Text Analytics Cognitive Service; and it is also available as standalone packages (for the base classes and the different entity recognizers).

The Microsoft.Recognizers.Text packages currently target four platforms:

Contributions are greatly welcome! Both for fixes and extensions in the currently supported languages and for expansion to new ones. Especially for Japanese, Korean, Arabic, Swedish, and others! More info below.

.NET is the primary package version and contributions propagate to the other platforms with time.

Help

If you have any questions, please go ahead and open an issue, even if it's not an actual bug. Issues are an acceptable discussion forum as well.

Contributing

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

Good starting points for contribution are:

  • the list of open issues (especially those marked as help wanted);
  • the json spec cases temporarily marked as NotSupported (Specs); and
  • translating json test spec cases that work in English, but don't yet exist in a target language.

The links below describe the project structure and provide both an overview and tips on how to contribute (although some steps may have become a little out-of-date). Thank you!

Supported Entities across Cultures

The table below summarizes the currently supported entities. Support for English is usually more complete than others. The primary platform is .NET (shown in table) and support should propagate to the others.

Entity Type EN ZH-CN NL FR DE IT JA KO PT ES
Number (cardinal)
Ordinal
Percentage
Number Range PA/EO
Unit - Age PA/EO
Unit - Currency PA/EO
Unit - Dimensions PA/EO
Unit - Temperature
Choice - Boolean SO
Seq. - E-mail G G* G G G G G* G* G G
Seq. - GUID G G G G G G G G G G
Seq. - Social G G G G G G G G G G
Seq. - IP Address G G G G G G G G G G
Seq. - Phone Number G G G G G G G G G G
Seq. - URL G G* G G G G G* G* G G
DateTime (+subtypes) SO SO
Entity Type SV BG TR HI AR
Number (cardinal) PA/EO
Ordinal PA/EO
Percentage PA/EO
Number Range PA/EO
Unit - Age
Unit - Currency
Unit - Dimensions
Unit - Temperature
Choice - Boolean
Seq. - E-mail G G G G G
Seq. - GUID G G G G G
Seq. - Social G G G G G
Seq. - IP Address G G G G G
Seq. - Phone Number
Seq. - URL G G G G* G*
DateTime (+subtypes) SO
  • G: Generic entity, not language-specific (* unicode TLDs not-supported);
  • EO: Extraction-only (parsing/resolution/normalization pending);
  • PA: Partial support (type not fully supported);
  • SO: Specs-only (test specs coverage OK, but support pending);
  • SP: Partial specs;
  • SI: Very initial specs (typically language support start for a new language).

recognizers-text's People

Contributors

juanar avatar tellarin avatar aitelint avatar sanxing-chen avatar guom08 avatar grey0202 avatar aitelintii avatar songwenhao1 avatar enzocano avatar pcostantini avatar rubio41 avatar aliandi avatar imicknl avatar sothan avatar dependabot[bot] avatar chopperman33 avatar anichikage avatar neudurgeshp avatar gasper-az avatar pete1854 avatar paradoxarg avatar visionshao avatar amitstein avatar ejadib avatar haoyangms avatar wgx998877 avatar johnataylor avatar matthewshim-ms avatar acblacktea avatar sherryzyy avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.