Comments (4)
I think download time is probably not a huge deal as it happens in a web worker if you use recognizer.js
. And the file is cached, so it would only be downloaded once.
However, large JavaScript files might not be valid with browsers, you have to try, I do not know.
There is probably an alternative which is to use HTML5 storage instead of compiling the files inside the JavaScript. For that you'd need to look at Emscripten's documentation:
https://github.com/kripken/emscripten/wiki/Filesystem-Guide
https://github.com/kripken/emscripten/wiki/Filesystem-API
About the basics of speech recognition, you'll find everything on http://cmusphinx.org.
- Acoustic models: parameters that describe the phonemes (building blocks of words).
- Statistical language model: type of language model that defines probabilities of series of words. As opposite of grammar that describe language as a graph.
- Pronunciation dictionary: gives the mapping between words and phonemes. You can see it as the bridge between acoustic and language models.
For a speech recognition system, you need an acoustic model, a pronunciation dictionary and a language model (either a grammar or a statistical language model).
I hope that helps.
from pocketsphinx.js.
Thanks for your quick response!
I build pocketsphinx with an en_us acoustic model (57 MB) without language model and dictionary file, It created pocketsphinx.js file of size 253 MB, obviously its a huge size which is not affordable for a web base applications.
Can you please share with me links from where I can download generic acoustic model, lm, dic files and build them with pocketsphinx that create small size js file which I can use in my website that can easily understand all generic conversation of users.
Regarding HTML5 storage, I think for that I need to send audio files to the server (HTML5 Storage) where it will transform them into text. Please correct me if I'm wrong.
Thanks,
from pocketsphinx.js.
@hashimawan you can find many resources, documentation and help from http://cmusphinx.org, including acoustic and language models. You'll also find links to http://voxforge.org/ with resources, acoustic and language models.
For HTML5 storage, please follow the docs I sent you, I don't know more than that, but it'd be great that you share your experience in a wiki entry if you get anything working.
from pocketsphinx.js.
@hashimawan you can now package your acoustci model in separate files, see README.md. That will solve your issue.
from pocketsphinx.js.
Related Issues (20)
- lazyLoading, Module.FS_createPath not found (emscripten compilation without accoustic model) HOT 1
- some time when click on start i am getting Cannot change search while decoding, end utterance first HOT 1
- FATAL ERROR: CALL_AND_RETRY_LAST Allocation failed - JavaScript heap out of memory HOT 2
- Words.push_back accept 1 arguments only HOT 1
- Trying to build but submodules fail HOT 2
- BindingError with self-compiled pocketsphinx.js HOT 2
- Change threshold to increase accuracy detection HOT 2
- in web form asp.net? HOT 2
- Can't install with npm HOT 1
- Webapp.js file is missing HOT 1
- Buffer in recognizer uses obsolete data on multiple "process"-requests
- issue with make
- Web Worker Lazy Load Command - Outgoing Message is Empty HOT 1
- Compilation Failed In Custom Acoustic Model
- Grammar for numbers? HOT 5
- Pocketsphinx within web worker
- live.html demo no longer works in Chrome, only Firefox
- Compilation into WebAssembly fails HOT 1
- Sorry if I laugh HOT 1
- Local host not working with chrome
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pocketsphinx.js.