GithubHelp home page GithubHelp logo

For help how to encode and decode another languages which are different from Chinese and English such as Arabic! about synthtext_chinese_version HOT 1 OPEN

jarveelee avatar jarveelee commented on June 30, 2024
For help how to encode and decode another languages which are different from Chinese and English such as Arabic!

from synthtext_chinese_version.

Comments (1)

zhengwx11 avatar zhengwx11 commented on June 30, 2024 1

@yingning
I was trying to make from my own modification too (based on ankush-me/SynthText).
Here are some tips:

  • Prepair your own fonts, and put it in data/font/, adding them in the fontlist.txt
  • Modify the data/models/font_px2pt.cp. This is a pickle file of a dict object. You know what key to add once you got a error about key not found.
  • text_utils.py, line 518. This should be the __init__ of TextSource class. You should change with open(fn, 'r') as f: to with open(fc, 'r', encoding='utf8'). Maybe it won't be 'utf8' for Arabic, I don't know.
  • Add your own text in data/newsgroup/newsgroup.txt. It is lines of text where words are separated by space. There is a little problem with Chinese for that words are not separated by space in Chinsese. I don't know how Arabic works, but be careful with it.
  • text_utils.py, line 130, there should be something like line_bounds = font.get_rect.... I got an error glyph not found for id 3 here. The reason seems to be that additional space is add round the text in lines. So I add line = lines[np.argmax(lengths)].strip() to fix it. But I still don't understand why space matters. Maybe something is wrong with the fonts file.

That's all I do to make it work for Chinese. Just run it over and over again along with bug fixing.

from synthtext_chinese_version.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.