GithubHelp home page GithubHelp logo

Get text from audio about pywit HOT 13 CLOSED

wit-ai avatar wit-ai commented on July 30, 2024
Get text from audio

from pywit.

Comments (13)

willywongi avatar willywongi commented on July 30, 2024 2

"Good news everyone!" I pushed the correction @blandinw was asking - I forgot to allow users to set the correct content-type header.

from pywit.

oplatek avatar oplatek commented on July 30, 2024

Speech API would be nice!
Any update on this?

I know I can hack it and submit a speech request to your or any other speech API and than submit the 1-best hypothesis to your converse API.
However, as your (speech) API is quite slow, the latency is not trivial and the user experience horrible
just because I need to submit two requests instead of one.
If you would provide a converse API through speech directly it would speed up things considerably.

from pywit.

jhoelzl avatar jhoelzl commented on July 30, 2024

+1

from pywit.

goose121 avatar goose121 commented on July 30, 2024

I also think that this would be great; after all, there's not much of a point to natural speech if you can't actually speak

from pywit.

lowdev avatar lowdev commented on July 30, 2024

+1

from pywit.

milindaj avatar milindaj commented on July 30, 2024

+1 converse API through speech directly is a great feature to have

from pywit.

andehr avatar andehr commented on July 30, 2024

+1

from pywit.

Accentrix avatar Accentrix commented on July 30, 2024

This feature would make the Pywit library perfect! still waiting.... :/

from pywit.

blandinw avatar blandinw commented on July 30, 2024

Hi everybody, apologies for the lack of responsiveness here and thanks for keeping this issue alive.
We used to have audio recording + streaming in the first versions of the library, but it was a constant source of pain, as it involved a lot of platform specific code.

Regarding audio recording (from a microphone device), I don't think it makes sense to add that to pywit, as it's highly platform specific and does not make sense for server-side use cases.

Regarding the network streaming part, we'd be open to add back a method .speech() to the client that takes a "stream of bytes" (what's the idiomatic way to reprensent that?), uploads it to Wit and returns the response object. We'd need to come up with a solution that works on both Python 2 and 3. We may come around to doing that, but we're working on some other awesome things at the moment. Contributions welcome!

from pywit.

walchko avatar walchko commented on July 30, 2024

You might want to actually read what I was asking for ... I never asked you to capture audio. Just make python as complete as your http api so I can send an audio file for you to interpret ... it is simple!

You also might want to check your pull requests ... Method added to upload voice commands #67 already already does this. I independently implemented a very similar solution long ago, but was far too lazy to submit a pull request. @willywongi however did, so please take a look at his work and consider committing it.

from pywit.

blandinw avatar blandinw commented on July 30, 2024

I commented on the PR, hopefully @willywongi can get around to implementing the last bit soon. We'll merge then.

from pywit.

blandinw avatar blandinw commented on July 30, 2024

Thank you @willywongi!
I merged your PR + bumped Wit to 4.2.0 on PyPI.

from pywit.

sergios-ferreira avatar sergios-ferreira commented on July 30, 2024

Can you please write a complete library? Please include a function for speech (link to your API) passed as an audio file. Basically it does this (per your docs):

  $ curl -XPOST 'https://api.wit.ai/speech?v=20141022' \
   -i -L \
   -H "Authorization: Bearer $TOKEN" \
   -H "Content-Type: audio/wav" \
   --data-binary "@sample.wav"

curl -XPOST "https://api.wit.ai/speech?v=20211113" \
-i -L \
-H "Authorization: Bearer [YOUR_TOKEN]" \
-H "Content-Type: audio/raw;encoding=signed-integer;bits=16;rate=44100;endian=little" \
--data-binary "@[YOUR_AUDIO].wav"

Remember: @ front of YOUR_AUDIO is important.

from pywit.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.