Comments (7)
@loretoparisi さん、こんにちは。
あなたがやりたいことは大体理解しました。
ただしそれは、単純な「kakasi の逆」ではありません。
かな漢字からローマ字への変換は、ほぼ一意に求めることができます。
日本が好きです => にっぽん が すき です => nippon ga suki desu
ところが、ローマ字からかな漢字への変換は、必ずしも一意ではありません。
"ga" の一文字だけに着目しても「が」「画」「賀」「我」「蛾」と多くの候補があり、その文脈において最適な候補が何であるかを求めるにはアルゴリズムと創意工夫が必要です。
また、「nippongasukidesu」を「nippon ga suki desu」と意味のある区切りにする形態素解析も必要となります。
ddskk は、上記の「最適な候補の選択」と「形態素解析」を人間が行うと割り切ったかな漢字変換システムであり、極端に言えばあなたが望むものと最も遠いプログラムでしょう。
I hope someone will translate it into English.
from ddskk.
skktools
is management tool for dictionary file merge/sort/convert.
Is your hope a kana to kanji with Javascript?
from ddskk.
@tkita Thank you, then it was the wrong package. I was looking for skk
main library sources... So not in javascript, I would prefer the C/C++ version to build as an executable or a library. Then I could take care of using node gyp
to wrap headers in node.jsjavascript
or I would use a process fork to execute a compiled binary in the same way I do with kakasi.js. I'm aware of some very old tools like skkfep
but I'm not sure how do they work or if I can use it as standalone executable.
from ddskk.
refer to `Anthy' https://ja.osdn.net/projects/anthy/releases/37536
BTW, API is no good? http://www.google.com/transliterate
see function skk-google-cgi-api-for-japanese-input().
(skk-google-cgi-api-for-japanese-input "かんじ")
=> ("感じ" "漢字" "幹事" "カンジ" "監事")
from ddskk.
@tkita thank you, in my case I would use offline. My aim is to invert romaji to kanji, that is why I was thinking to skk
, so basically the opposite of kakasi, should it be possible right?
In the case of kakasi I'm doing this like
echo "日本が好きです。" | kakasi -i euc -Ha -Ka -Ja -Ea -ka -s -iutf8 -outf8
nippon ga suki desu .
from ddskk.
@tkita thank you so much for the clarification. Assumed the google translated did it well, my reply below 👍
I'm aware of the fact that the transliteration romaji -> kanji
it is not unique, so you need some intelligence to get the best candidate among k
. For Indian languages I'm using a SHMM
model plus a Neural Network of the weights, then Viterbi
is used for decoding to get the best of K
candidates among the optimal ones.
Now I understand why there is a morphological analyzer in anthy-morphological-analyzer
and the anthy-dic-tool
, thank you.
So we can say that dkk
can be used as a intermediate step between kanji
and romaji
, starting from kana
but not as-it-is then.
What I'm working on right now it is a Tensorflow sequence 2 sequence neural network architecture that inverts romaji to kanji (a seq2seq machine learning task) using a parallel corpus of sentences.
Thanks again for your help, closing then!
from ddskk.
Sorry for my weird English.
I wish for the success of your project.
from ddskk.
Related Issues (20)
- skk-autoloadsのrequireについて HOT 7
- nicola/skk-nicola.elのnext-command-eventについて HOT 1
- skk-get-current-candidate: Symbol’s function definition is void: skk-henkan-count HOT 7
- skk-jisyo の standard value について HOT 1
- trailing space HOT 4
- 「う゛」から「ゔ」使用に切り替えるパッチ HOT 1
- Multiple Cursorsと一緒使えない HOT 1
- マーカー(▼と▽)の変更 HOT 17
- MELPAでddskkではなくskkとして配布する HOT 7
- Markdown-modeの表中でddskkを使うと▼マーカーが削除されない HOT 4
- skk-undo-kakutei-return-previous-point non-nil の時の挙動 HOT 2
- skk-azikの「tU」->「っ」定義について HOT 2
- (void-function facemenu-color-equal) in Emacs-28 HOT 1
- Cursor colour no longer changes to match hiragana/katakana mode. HOT 18
- void-function facemenu-color-equal in Emacs-28.0 HOT 2
- sticky-key使用時、まだ消費されていない文字が消える
- make info で error HOT 2
- Emacs 29.0.50: Wrong type argument: listp, 1000000000 HOT 1
- '(setq skk-use-azik t)' in ~/.skk causes error 'Symbols' definition is void: del-alist` HOT 1
- emacs-29とddskk masterでisearchを使えば「Lisp error: (void-function nil)」が出ます HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ddskk.