I have a project that I've to create a website with a voice recognition chatbot. User

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

I need to develop a generic voice recognition website about pocketsphinx.js HOT 4 CLOSED

hashimawan commented on June 16, 2024

I need to develop a generic voice recognition website

from pocketsphinx.js.

Comments (4)

syl22-00 commented on June 16, 2024

I think download time is probably not a huge deal as it happens in a web worker if you use recognizer.js. And the file is cached, so it would only be downloaded once.

However, large JavaScript files might not be valid with browsers, you have to try, I do not know.

There is probably an alternative which is to use HTML5 storage instead of compiling the files inside the JavaScript. For that you'd need to look at Emscripten's documentation:
https://github.com/kripken/emscripten/wiki/Filesystem-Guide
https://github.com/kripken/emscripten/wiki/Filesystem-API

About the basics of speech recognition, you'll find everything on http://cmusphinx.org.

Acoustic models: parameters that describe the phonemes (building blocks of words).
Statistical language model: type of language model that defines probabilities of series of words. As opposite of grammar that describe language as a graph.
Pronunciation dictionary: gives the mapping between words and phonemes. You can see it as the bridge between acoustic and language models.

For a speech recognition system, you need an acoustic model, a pronunciation dictionary and a language model (either a grammar or a statistical language model).

I hope that helps.

from pocketsphinx.js.

hashimawan commented on June 16, 2024

Thanks for your quick response!

I build pocketsphinx with an en_us acoustic model (57 MB) without language model and dictionary file, It created pocketsphinx.js file of size 253 MB, obviously its a huge size which is not affordable for a web base applications.
Can you please share with me links from where I can download generic acoustic model, lm, dic files and build them with pocketsphinx that create small size js file which I can use in my website that can easily understand all generic conversation of users.

Regarding HTML5 storage, I think for that I need to send audio files to the server (HTML5 Storage) where it will transform them into text. Please correct me if I'm wrong.

Thanks,

from pocketsphinx.js.

syl22-00 commented on June 16, 2024

@hashimawan you can find many resources, documentation and help from http://cmusphinx.org, including acoustic and language models. You'll also find links to http://voxforge.org/ with resources, acoustic and language models.

For HTML5 storage, please follow the docs I sent you, I don't know more than that, but it'd be great that you share your experience in a wiki entry if you get anything working.

from pocketsphinx.js.

syl22-00 commented on June 16, 2024

@hashimawan you can now package your acoustci model in separate files, see README.md. That will solve your issue.

from pocketsphinx.js.

I need to develop a generic voice recognition website about pocketsphinx.js HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs