GithubHelp home page GithubHelp logo

warnikchow / 3i4k Goto Github PK

View Code? Open in Web Editor NEW
85.0 4.0 22.0 21.42 MB

Intonation-aided intention identification for Korean

License: Creative Commons Attribution Share Alike 4.0 International

Python 100.00%

3i4k's Introduction

3i4K

Intonation-aided intention identification for Korean

Requirements

fastText, Keras (TensorFlow), Numpy, Librosa
Currently available for python 3.5 and upper version is in implementation

Word Vector

Pretrained 100dim fastText vector

  • Download this and unzip THE .BIN FILE in the NEW FOLDER named 'vectors'
  • This can be replaced with whatever model the user employs, but it requires an additional training.

Dataset

(20.10.13) We updated our LICENSE to CC-BY-SA-4.0 and rearranged our guideline (in Korean) so as to be used officially. The international version is to be prepared with the publication.

(19.06.06) We provide train & validation, and test set separately here, for an easier Keras-based implementation.

The (train+validation) : test ratio is 9 : 1, and train : validation ratio is also 9 : 1 (thus, in total, 0.81 : 0.09 : 0.1).

The renewed version of the corpus is uploaded along with the models. May not be changed unless severe defect is observed.

(18.12.28) Dataset under modification!!

We've found a few misclassified utterances and undergoing modification, thus the true-final version will be disclosed before Fabrary. Pilot implemenation of the system (e.g., as tutorial) is less involved with this problem, but do not cite this dataset as a benchmark until Fabrary. The notice will be available as soon as possible.

(18.11.22) A final version of dataset and the new model is uploaded!

The next version will incoporate much more utterances and will be treated as a separate dataset.

FCI: A seven-class text corpus for the classification of conversation-style and non-canonical Korean utterances

  • F: Fragments (nouns, noun phrases, incomplete sentences etc.) (FRs)
  • C: Clear-cut cases (statements, questions, commands, rhetorical questions, rhetorical commands) (CCs)
  • I: Intonation-dependent utterances (IUs)

Corpus composition


  • IAA: kappa = 0.85 for Corpus 1
  • Data for FCI module is labeled in 0-6, split in train:test with ratio 9:1.
  • Available in data folder.

Block diagram


System Description

  • Easy start: Demonstration.exe
 python3 3i4k_demo.py 

Intention Identification

  • Given only a text input, the system classifies the input into one the aforementioned 7 categories. Available in demo.
  • Text classification is also available in demo; a corpus (input: filename without '.txt') can be categorized into 7 classes.
  • Available by importing module
 from classify import pred_only_text('sentence_you_choose') 
 from classify import classify_document('filename_you_choose') 

Speech Intention Understanding

  • Available by importing module
 from classify import pred_audio_text('speechfilename_you_choose', 'sentence_you_choose') 

Annotation Guideline and Acknowledgement

The annotation guideline (in Korean) (previous version is here) was elaborately constructed by Won Ik Cho, with the great help of Ha Eun Park and Dae Ho Kook. Also, the authors appreciate Jong In Kim, Kyu Hwan Lee, and Jio Chung from SNU Spoken Language Processing laboratory (SNU SLP) for providing the useful corpus for the analysis. We note that this work was supported by the Technology Innovation Program (10076583, Development of free-running speech recognition technologies for embedded robot system) funded By the Ministry of Trade, Industry & Energy (MOTIE, Korea).

Citation

For the utilization of the word vector dictionary, cite the following:

@article{cho2018real,
	title={Real-time Automatic Word Segmentation for User-generated Text},
	author={Cho, Won Ik and Cheon, Sung Jun and Kang, Woo Hyun and Kim, Ji Won and Kim, Nam Soo},
	journal={arXiv preprint arXiv:1810.13113},
	year={2018}
}

For the utilization of the annotation guideline or the dataset, cite the following:

@article{cho2018speech,
	title={Speech Intention Understanding in a Head-final Language: A Disambiguation Utilizing Intonation-dependency},
	author={Cho, Won Ik and Lee, Hyeon Seung and Yoon, Ji Won and Kim, Seok Min and Kim, Nam Soo},
	journal={arXiv preprint arXiv:1811.04231},
	year={2018}
}

YouTube demo (non-audio-input version; for past submission)

https://youtu.be/OlvLlH8JgmM

3i4k's People

Contributors

warnikchow avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

3i4k's Issues

word embedding파일이 열리지 않습니다

안녕하세요:)
맥환경과 우분투환경에서 load가 되지 않아 ValueError: model_drama.bin has wrong file format! 에러가 뜹니다.
파일에 문제가 있다면 새롭게 업로드해주실 수 있을까요?

https://fasttext.cc/docs/en/crawl-vectors.html 에서 한국어 pretrained model을 이용한 후 파이썬과 tf,keras 부분 버전, utf변환 등을 수정해준후에 데모는 이용할 수 있었습니다. 감사합니다!

안녕하세요 혹시 음성 DB는 비공개인가요?

안녕하세요
"Speech Intention Understanding in a Head-final Language: A Disambiguation Utilizing Intonation-dependency" 논문 재밌게 잘 보았습니다.
혹시 논문에서 언급된 음성 DB와 annotation은 비공개인가요??ㅠ

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.