Comments (7)
any update?
from whisperx.
This submission will allow for segmented streaming using Whisperx, improving client response time, but not for underlying streaming implementations.
Pull request here
from whisperx.
This issue from whisper.cpp comes to mind Support for realtime audio input .
It highlights some issues with doing realtime transcription with whisper in general.
from whisperx.
Would be great to add support for streaming, because folks have been using Whisper.cpp successfully and implemented streaming - for example gladia.io.
from whisperx.
Whisper itself can't stream, so I dont think so, unfortunately
I assume it is possible, sort of, because whisperx splits the audio to chunks, it can proccess each chunk individually and stream it after finished, instead of waiting for all chunks.
would be nice feature.
from whisperx.
Whisper itself can't stream, so I dont think so, unfortunately
from whisperx.
So this can now be used to transcribe live from an audiostream like mic input? If yes, do you maybe know how?
from whisperx.
Related Issues (20)
- How to enable diarization in python code (not terminal)? HOT 2
- Version 3.1.5 is distributed on pypi but Github repo only has 3.1.1? HOT 2
- WhisperX just stops at Diarization
- Is there a way to transcribe multiple audio files asynchronously/parallel with whisperX?
- Could not load library libcudnn_ops_infer.so.8. Error: libcudnn_ops_infer.so.8: cannot open shared object file: No such file or directory HOT 3
- Transcribing error HOT 2
- KeyError 'en'
- Wav2vec doesn't align numerical characters
- Open PR to add latest version of faster-whisper HOT 1
- Parameter to enable verbose/Segment level printing for better debugging HOT 1
- Can Hard Coded Hyperparameters be moved to a config file? HOT 4
- Use whisperx diarization offline
- Split words separated by hyphens
- bump `pyannote.audio` version to 3.3.1
- ValueError: Requested float16 compute type, but the target device or backend do not support efficient float16 computation. HOT 1
- I’ve successfully installed WhisperX, is there anything I can uninstall to save some disk space?
- Using large-v3 returns some segments in all uppercase
- Question on the pseudo code of arxiv paper
- Just use this script to make the srt more readable for the end results. almost perfect, try it and share your thoughts.
- compute_type whisperX transcription - option to use float32? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from whisperx.