Comments (5)
This is probably due to the fact that whisper use temperature fallback and beam search, while it's currently not supported by whisper-timestamped (see issue #10, it's WIP).
Can you please try whisper with options
--beam_size None --temperature_increment None
and tell if you see the duplicate fragments?
If you see it, then your problem will be solved when issue #10 will be solved
from whisper-timestamped.
@romanzoniit You can now use option --accurate
in whisper_timestamped
to reproduce default options of whisper
(see README for more information).
Can you please check if that solves this issue for you?
from whisper-timestamped.
I think this issue is solved. Don't hesitate to re-open if it's still an issue after my last comments
from whisper-timestamped.
@romanzoniit You can now use option
--accurate
inwhisper_timestamped
to reproduce default options ofwhisper
(see README for more information). Can you please check if that solves this issue for you?
how to use this in python, not in console
from whisper-timestamped.
It's explained in the README
By default, all options that require several steps of decoding are disabled, in favour of an efficient decoding strategy. Use
beam_size=5, best_of=5, temperature=(0.0, 0.2, 0.4, 0.6, 0.8, 1.0)
for Whisper default.
from whisper-timestamped.
Related Issues (20)
- whisper_timestamped blocks from an URL in CLI into subprocess module HOT 6
- Error with Whisper v3 HOT 2
- everytime I update this, it bricks my python install HOT 1
- Beam Search Decoding How to Get Beam of Tokens as Output HOT 3
- Error when using -vad_v3.1 HOT 1
- Consider using whisper-distilled HOT 2
- Publication on Pypi failing HOT 7
- Is there a way to use it with whisper.cpp HOT 2
- Cannot find audio file HOT 3
- Only part of audio transcribed HOT 4
- Trouble transcribing list of files HOT 2
- torch hub path is not properly set HOT 1
- Broken link for plotting word alignment section HOT 7
- Loading finetuned model serialized with safetensors (and/or sharded models) HOT 10
- How to activate flash attention? HOT 2
- Could it be possible to apply the same technique to the whisper API? HOT 6
- ctranslate2 support HOT 1
- CPU only light install links are broken? HOT 3
- Issue with accented characters coming up as symbols in output json file
- Repetitive Phrase Looping HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from whisper-timestamped.