GithubHelp home page GithubHelp logo

Detection accuracy about langid.py HOT 3 CLOSED

vamseekm avatar vamseekm commented on August 29, 2024
Detection accuracy

from langid.py.

Comments (3)

vamseekm avatar vamseekm commented on August 29, 2024

Sorry my mistake, it was encoding issue. Thanks for the great tool.

from langid.py.

saffsd avatar saffsd commented on August 29, 2024

No worries. Indeed, my own check verifies that zh is correctly detected

**功夫是一门博大精深的武学艺术 , **功夫app , 介绍**功夫的分类、特点、器材、门派等与**功夫有关的内容!让广大读者能够更完整的了解**功夫的精华!
('zh', -1414.5709274662972)

Could you provide me some detail on how you used the tool? It would be good if I can detect potential encoding issues beforehand and try to address them, to make the tool as simple as possible for the end user.

from langid.py.

vamseekm avatar vamseekm commented on August 29, 2024

I forgot to supply utf8 encoding option to mysql connection while trying to get some unicode text from db. So I was essentially passing garbled mess to langid.py and asking it to identify the language. Again langid.py is a great tool it was my stupid mistake. :)

from langid.py.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.