GithubHelp home page GithubHelp logo

Comments (4)

syl22-00 avatar syl22-00 commented on June 16, 2024
  1. PocketSphinx does support listening to multiple key phrases, but (currently) only if provided in a file. To do that you'd need to put your key phrases in a text file, package it into JavaScript as explained in the doc, load the JavaScript and initialize the decoder with the argument to load the file ("-kws"). Since the file is loaded when initializing, you also need to pass a corresponding dictionary file that contains the words in your key phrases. All this is documented in README.md. Of course a drawback of adding key phrases that way is that you can not change them at runtime, so it'd be great if the pocketsphinx API could include that too. Actually everything was already discussed and documented in earlier tickets, such as this one: #45
  2. None of them is incorrect, it depends on which files you have in your key phrases. All words should have their pronunciation. I'd say it is more straightforward to use individual word, but maybe combining short phrases into one word might increase accuracy. That must be tested. Note that you may have better performance using a grammar with well chosen and tested probabilities instead.

from pocketsphinx.js.

Villmer avatar Villmer commented on June 16, 2024

I've created a "grammar" version that works well but I am unable to determine when the recognizer has reached the "last" result after I've spoken. For example, as I speak, I see that the recognizer goes through several guesses before it finally stops. When it does, it is often correct - BUT I'm unable to find (in the various js files) how to determine if it is "done" listening so I can fire an event using only the last result.

The reason I want only the LAST guess is that I want to fire a function. As it is, I will fire several unwanted functions before the last one.

So, here is my question: How do I determine if the processing session is done (reached a final result) after I've stopped speaking? I'm making a mobile application that continually listens. The user will speak one or two-word phrases (short commands under a second).

from pocketsphinx.js.

Villmer avatar Villmer commented on June 16, 2024

I'm taking a look at the ticket now (#45)
... I think if I can get the multiple keywords working, it will be a better solution.

from pocketsphinx.js.

miguelmota avatar miguelmota commented on June 16, 2024

For future reference, here is a repo containing example using multiple keywords https://github.com/miguelmota/pocketsphinxjs-multiple-keywords

from pocketsphinx.js.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.