Comments (10)
Seems like the model is loaded successfully, so it's weird why it runs forever!
Is the short_audio_file
var holds the path to your mp3 file as a str
? Have you tried other files and you always run into the same issue ?
from pywhispercpp.
Hi, thanks for your reply. Yes, it's a str that holds the path. I haven't tried another file, but honestly it's a pretty basic mp3 of just spoken text with no additional sounds.
You can try yourself, I attached the file (It's zipped, so I can upload it to GitHub. It's not from my own video. It's a random Short from YouTube. So enjoy some Dragonball content)
input_short.zip
from pywhispercpp.
Hi @03l54rd1n3,
Thanks for providing the file, it took less than 4s on my machine to generate the results :
{model.py:133} INFO - Inference time: 3.481 s
[t0=0, t1=242, text=Why does Vegeta always hold his left arm?, t0=242, t1=528, text=Vegeta has multiple poses that are very distinctive of him,, t0=528, t1=778, text=for instance the infamous self-pointing thumb., t0=778, t1=1194, text=A very different one however, is that in which he holds his left arm in pain., t0=1194, t1=1604, text=Vegeta has gone through a lot of different battles and has sustained a crazy amount of injuries., t0=1604, t1=2158, text=But for some reason most of the time he always ends up holding his left arm as if he had some sort of chronic pain., t0=2158, t1=2504, text=A lot of people thought back then that this was because of Andrew at 18,, t0=2504, t1=2690, text=who really did a number on his left arm., t0=2690, t1=2784, text=Nevertheless,, t0=2784, t1=3168, text=it is possible to see Vegeta holding his left arm already in the namics saga., t0=3168, t1=3578, text=This implies that if Vegeta really does have some sort of chronic injury in his left arm,, t0=3578, t1=3800, text=then it must be previous to the android saga., t0=3800, t1=4268, text=Also, this is an injury that no sends a beans or dragon ball resurrection has been able to heal,, t0=4268, t1=4564, text=so whatever it is, it must be deeply rooted within his body.]
So there is something wrong with your installation.
Do you have ffmpeg
installed ?
from pywhispercpp.
Hi, thank you for your reply. Yes it's installed through apt. And I installed your tool through pip. Wonder what it is then... I'll check the whisper.cpp requirements as well...
from pywhispercpp.
Yes, try to compile and run whisper.cpp
first and let me know if that works.
from pywhispercpp.
OK, your tool works fine from the CLI (pwcpp). Original whisper.cpp also works. Seems like the unexpected behavior is just in python (script file or notebook). Any idea why it only happens there?
from pywhispercpp.
Correction, it happens in python when using the n_threads argument. Without that it works. The tools seems to deadlock. I'm on linux if that is relevant for you.
from pywhispercpp.
I only use Linux as well and this never happened.
But how many threads does your CPU support ?
from pywhispercpp.
good question. I have 4 cores. It's some 7th gen Intel i7, not the best, but with 16GB of RAM, the laptop still manages most tasks pretty well.
I just tried a couple of times again. In the python script it actually now works with n_threads
set to 2 or 4. In the notebook with it set to 1 or 2, sometimes I get to transcribing
, but no results. Sometimes it gets locked before that at kv cross size
.
from pywhispercpp.
Yeah it's good, but obviously you cannot go above your resources, so n_threads
should not exceed 4 (which is the default by the way).
So as long as it's running in a script then everything is good, you have to check your Jupyter notebook environment, I have also re-checked now in colab notebooks and it's working without any problem.
from pywhispercpp.
Related Issues (20)
- Unknown language error HOT 4
- How to make transcription and speaker diarization using pywhispercpp HOT 1
- How to use coreML models in Mac M2? HOT 3
- "Cannot find source file: ggml.h" when trying to install on Ubuntu 22.04 on aarch64 HOT 3
- Integrating pywhispercpp as the first extension to lollms-webui HOT 2
- Nothing happens HOT 13
- pywhispercpp/whisper.cpp/ggml-opencl.c:4:10: fatal error: 'clblast_c.h' file not found #include <clblast_c.h> HOT 9
- ERROR - Invalid model name `./model.bin` HOT 1
- ERROR - unable to initialize from path HOT 5
- Unable to install on raspberry pi 4 HOT 5
- How to add space between subtitles? HOT 1
- word-level timestamps? HOT 5
- About GPU question HOT 1
- Using the agent for interacting with ollama models HOT 29
- Unable to load `quantized` models HOT 3
- _pywhispercpp module could not be found HOT 28
- "ggml-metal.metal" file couldn't be found when loading the large-v3 model for CoreML HOT 7
- failed to compute log mel spectrogram HOT 3
- Installation from source leads to non-functional installation HOT 14
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pywhispercpp.