Comments (3)
are you passing in --language de
, that way it knows it is german?
from whisperx.
I am using the Python API (result = model.transcribe(audio_file)
) and was not aware of a parameter for the transcribe function, that allowed me to enforce a certain language.
I was able to improve the performance to a usable level by adding the extend_duration
parameter with 0.1 as value, but it still cuts of the beginning of the word from time to time
from whisperx.
new VAD filtering feature should fix this, feel free to re-open if not
from whisperx.
Related Issues (20)
- detect_language() doesn't work on a GPU? HOT 2
- Transcription/translation language change -- '50358' / '50359' is not a valid task (accepted tasks: transcribe, translate)
- Wrong cuda device (device_id > 0) on VAD in whisperx.load_model() method
- A tip: color coding the outputs (e.g. srt) from the JSON files HOT 1
- RuntimeError: Library libcublas.so.12 is not found or cannot be loaded HOT 3
- Word Level Transcripts Error HOT 2
- Transcription fails with diarization enabled
- Some transcriptions missing properties HOT 1
- RuntimeError: parallel_for failed: cudaErrorNoKernelImageForDevice: no kernel image is available for execution on the device HOT 5
- Getting no audio found error HOT 1
- whisperx.load_model & default_asr_options Error in Colab HOT 4
- Doesn't accept num_speakers as argument HOT 2
- whisperx.align has empty word intervals for numbers HOT 1
- Error While Using Machine With Only CPU (EC2 Instance) HOT 2
- No speaker labels in txt format with diarization enabled HOT 3
- Support for vulkan (intel arc gpu)
- IGNORE
- Diarization precision - is there way to improve it? HOT 2
- torchaudio._backend.set_audio_backend has been deprecated. HOT 1
- Probability or score coming from faster-whisper and not alignment model
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from whisperx.