Comments (3)
Hi
Are you using the latest version of AutoSub? If yes, I switched the default inference to Coqui STT as it has better support for different languages. You can change this by setting --engine
to "ds"
while running main.py
and checking again.
If you are sure that you're using DeepSpeech, you can play around with the default parameter values here.
from autosub.
Are you using the latest version of AutoSub?
Yes I am using the latest master branch
If yes, I switched the default inference to Coqui STT as it has better support for different languages. You can change this by setting
--engine
to"ds"
while runningmain.py
and checking again.
Yes I also did change the engine to deepspeech
(sub) derry@10700k:~/ws/AutoSub$ python3 autosub/main.py --engine ds --file ./qjbBeORPUA4-oo9mOmdonl.mp4
[INFO] ARGS: Namespace(dry_run=False, engine='ds', file='./qjbBeORPUA4-oo9mOmdonl.mp4', format=['srt', 'vtt', 'txt'], model=None, scorer=None, split_duration=5)
[INFO] Model: /home/derry/ws/AutoSub/deepspeech-0.9.3-models.pbmm
[INFO] Scorer: /home/derry/ws/AutoSub/deepspeech-0.9.3-models.scorer
[INFO] Input file: ./qjbBeORPUA4-oo9mOmdonl.mp4
[INFO] Extracted audio to audio/qjbBeORPUA4-oo9mOmdonl.wav
[INFO] Splitting on silent parts in audio file
[INFO] Running inference...
TensorFlow: v2.3.0-6-g23ad988
DeepSpeech: v0.9.3-0-gf2e9c85
...
play around with the default parameter values here.
Thanks for the hints, I got 'better' results using these numbers smoothing_window=0.5, weight=0.01
.
However, I don't really understand how this parameter works. And also this magic number for st_win
and st_step
. Can you explain a little bit?
I think adding a switch to disable for silence removal is needed for non-movie video (full conversation). :)
from autosub.
This is a better explanation.
Thanks for the suggestion about silence removal. I'll think about how to decouple it from splitting the file.
from autosub.
Related Issues (20)
- fix imports to autosub module
- flac not wav HOT 1
- multi core
- stream not stack = write result to disk more often
- ImportError: attempted relative import with no known parent package HOT 8
- Broken Docker build HOT 1
- Broken logging module import HOT 1
- Move model COPY steps higher in Dockerfile
- Docker build broken HOT 6
- Cannot import logger HOT 1
- Issue in running model HOT 1
- Problem running on Mac OS X Moterey HOT 3
- Include OpenAI Whisper model HOT 1
- Generated VTT files are not standard compliant
- ERROR: Could not find a version that satisfies the requirement stt==1.0.0 HOT 1
- Docker run return ` No module named 'autosub'` HOT 1
- force use utf-8 open README.md HOT 5
- Feature Request: Readme update for Windows users
- Instructions not working
- I had to change np.int to np.int64 to appease numpy "int" deprecation warning
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from autosub.