GithubHelp home page GithubHelp logo

Comments (12)

kenarsa avatar kenarsa commented on August 19, 2024

can you please elaborate on the project and what you are trying to achieve? Then I can help better.

from leopard.

Bradams2212 avatar Bradams2212 commented on August 19, 2024

The project includes a voice assistant which is what I've been working on. The assistant use pvporcupine for the wake word and then takes the users voice input. This voice input is then put through multiple processes, some of which requiring numbers, not the words of numbers. It is also being run on a raspberry pi so every bit of efficiency counts. If I am able to skip the step of saving the file, I can only assume it will improve the process. So I need it to run in a way where it will stop listening to the user after they stop speaking, not after a set time limit. If it could also adjust for background noise, that would be good

from leopard.

Bradams2212 avatar Bradams2212 commented on August 19, 2024

I live in Australia and I'm going to sleep now. Hopefully that information is enough

from leopard.

kenarsa avatar kenarsa commented on August 19, 2024

have you tried Rhino?

from leopard.

Bradams2212 avatar Bradams2212 commented on August 19, 2024

How does rhino work?

from leopard.

Bradams2212 avatar Bradams2212 commented on August 19, 2024

If the action is something like check the weather or get my schedule for this day, it can do that? What about more specific commands like raise a frame in tkinter

from leopard.

kenarsa avatar kenarsa commented on August 19, 2024

yup. Check the repo out.

from leopard.

Bradams2212 avatar Bradams2212 commented on August 19, 2024

What if I wanted to turn the lights to a specific brightness or the temperature to a specific point? Does it also not limit to a timed section? Can it get a town or suburb from someones speech? In some places, I still need a speech to text conversion

from leopard.

Bradams2212 avatar Bradams2212 commented on August 19, 2024

Also, the demo files only show saving the file to a wav. Is there a way around that?

from leopard.

Bradams2212 avatar Bradams2212 commented on August 19, 2024

I didn't need to with porcupine. If I use a similar method will that work or is porcupine different in that respect

from leopard.

Bradams2212 avatar Bradams2212 commented on August 19, 2024

I already have preexisting code to take the users voice input and interpret what they want. Would it be possible for rhino to interpret what the user wants to do, convert what they said to a string and then pass that into my pre-existing functions?

from leopard.

kenarsa avatar kenarsa commented on August 19, 2024

quick re-cap:

1- Leopard does not have the capability to turn written numbers into numeric form (i.e. seven to 7) and there is no immediate plan for it.
2- Rhino has that functionality. You can also mix or match it with Leopard as/if needed.
3- You should look into the header file (binding code) to learn how to use each engine. A demo is just a demo.

from leopard.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.