GithubHelp home page GithubHelp logo

Voice input and output about chatgpt HOT 3 CLOSED

wieslawsoltes avatar wieslawsoltes commented on August 10, 2024
Voice input and output

from chatgpt.

Comments (3)

wieslawsoltes avatar wieslawsoltes commented on August 10, 2024

If you provide C# .NET audio input solution that works cross-platform (Windows, macOS, Linux, Android, iOS and wasm) and speech to text that's also has good api for C# .NET and works cross-platform (Windows, macOS, Linux, Android, iOS and wasm) I might consider, otherwise it's huge work and would need to be heavily sponsored or submitted via code contribution. Also note someone has to maintain and update it, otherwise it's huge maintenance burden.

from chatgpt.

damian-666 avatar damian-666 commented on August 10, 2024

well keep an eye out .. MS voice input Voice Access , is getting pretty good, maybe its learning and its real time. . but well see if they scape APIs for context, fix Start and do all this .. they really should.. Julia one is very linux oriented.. it only workd on android if you run linux on it. but the voice input module in JustSayit.il is separate.. Key is a 100 $ sm-58 rockstar mic, a sv 1000 or cardioid or studio mic with preamp helps alot to filter out fans, noises , a near field mic is super important . Bard is best at it because the prediction / typo fixing is aligned with it voice to text typo fix AI + context. If they get completion right the , mix witht the code/feature search they go from the worst design by commit mess UI to the best and simplest assistant.

from chatgpt.

damian-666 avatar damian-666 commented on August 10, 2024

helpful generalized spell checker in one prompt, also a way to get choice from the voice access at least on windows inot a postprocess and a choice: still needs a collaboration. the OCR part @killian i think did some.., but on widows could be useful , i my try to combite their work and chain some agents but someone will do it..

pico voice said my idea was SPAM and they have a useless slow laggy product and dont know have is a dsp or FPGA. or how to do hotwords fast enough. but this helps. at least with universal spell check. menu context completing

so having tried the they cause million of hours of duplicate work.
but the key is a good dynamic mic.

https://twitter.com/PAF_Kontrol/status/1758368591911530924

You are a helpful assistant named TypoFixerDictionationOrMenuPicker. for a long input, just fix typos in my text, make no major grammar or wording changes or reinterpretations. on a second pass, if it appears that there is the wrong word or acronym that's out of context you can switch the word,
Output the fixed text, then
give a numbered list of the changed typos, then the changed words on the second pass. For acronyms if they don't fix in context include those.. contains a list like [ Open File, File Close, Open Last Workspace], just choose the closest match for the word or words following the list. then output that as a "Change Context and Command is" , then give the likely match

from chatgpt.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.