Comments (10)
Hi everyone, I'm currently working on a branch that doesn't use whisperX alignment thus skipping the pyannote requirement that we don't even use in our project, I'll let you know when it's done
from whisper-diarization.
It uses PyTorch underneath so it will work
from whisper-diarization.
This is the entire error message I got in the end.
`/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/pyannote/audio/core/io.py:43: UserWarning: torchaudio._backend.set_audio_backend has been deprecated. With dispatcher enabled, this function is no-op. You can remove the function call.
torchaudio.set_audio_backend("soundfile")
Traceback (most recent call last):
File "/Users/myself/workspaces/whisper-diarization/diarize.py", line 3, in <module>
from helpers import *
File "/Users/myself/workspaces/whisper-diarization/helpers.py", line 7, in <module>
from whisperx.alignment import DEFAULT_ALIGN_MODELS_HF, DEFAULT_ALIGN_MODELS_TORCH
File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/whisperx/__init__.py", line 1, in <module>
from .transcribe import load_model
File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/whisperx/transcribe.py", line 10, in <module>
from .asr import load_model
File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/whisperx/asr.py", line 13, in <module>
from .vad import load_vad_model, merge_chunks
File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/whisperx/vad.py", line 11, in <module>
from pyannote.audio.pipelines import VoiceActivityDetection
File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/pyannote/audio/pipelines/__init__.py", line 26, in <module>
from .speaker_diarization import SpeakerDiarization
File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/pyannote/audio/pipelines/speaker_diarization.py", line 42, in <module>
from pyannote.audio.pipelines.speaker_verification import PretrainedSpeakerEmbedding
File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/pyannote/audio/pipelines/speaker_verification.py", line 56, in <module>
from nemo.collections.asr.models import (
File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/nemo/collections/asr/__init__.py", line 15, in <module>
from nemo.collections.asr import data, losses, models, modules
File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/nemo/collections/asr/models/__init__.py", line 36, in <module>
from nemo.collections.asr.models.transformer_bpe_models import EncDecTransfModelBPE
File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/nemo/collections/asr/models/transformer_bpe_models.py", line 52, in <module>
from nemo.collections.nlp.modules.common import TokenClassifier
File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/nemo/collections/nlp/__init__.py", line 15, in <module>
from nemo.collections.nlp import data, losses, models, modules
File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/nemo/collections/nlp/models/__init__.py", line 31, in <module>
from nemo.collections.nlp.models.machine_translation import MTEncDecModel
File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/nemo/collections/nlp/models/machine_translation/__init__.py", line 15, in <module>
from nemo.collections.nlp.models.machine_translation.mt_enc_dec_bottleneck_model import MTBottleneckModel
File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/nemo/collections/nlp/models/machine_translation/mt_enc_dec_bottleneck_model.py", line 23, in <module>
from nemo.collections.nlp.models.machine_translation.mt_enc_dec_model import MTEncDecModel
File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/nemo/collections/nlp/models/machine_translation/mt_enc_dec_model.py", line 38, in <module>
from nemo.collections.common.tokenizers.chinese_tokenizers import ChineseProcessor
File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/nemo/collections/common/tokenizers/chinese_tokenizers.py", line 38, in <module>
import opencc
File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/opencc.py", line 24, in <module>
libopencc = CDLL('libopencc.so.1', use_errno=True)
File "/opt/homebrew/Cellar/[email protected]/3.10.14/Frameworks/Python.framework/Versions/3.10/lib/python3.10/ctypes/__init__.py", line 374, in __init__
self._handle = _dlopen(self._name, mode)
OSError: dlopen(libopencc.so.1, 0x0006): tried: 'libopencc.so.1' (no such file), '/System/Volumes/Preboot/Cryptexes/OSlibopencc.so.1' (no such file), '/opt/homebrew/lib/libopencc.so.1' (no such file), '/System/Volumes/Preboot/Cryptexes/OS/opt/homebrew/lib/libopencc.so.1' (no such file), '/usr/lib/libopencc.so.1' (no such file, not in dyld cache), 'libopencc.so.1' (no such file)`
from whisper-diarization.
Same
from whisper-diarization.
i fixed this with:
brew install opencc
ln -s /opt/homebrew/lib/libopencc.dylib /opt/homebrew/lib/libopencc.so.1
(you just need to symlink it to one of the paths that it is searching for)
now unfortunately i get a KeyError when i pass --device mps
so i am stuck using the CPU for this :(
from whisper-diarization.
@keeth Unfortunately I don't have any device to test MPS support, can you contact me through mail or linkedin and I'll gladly add support for it.
from whisper-diarization.
Will nemo work without an Nvidia graphics card?
from whisper-diarization.
Idk, I'm struggling like crazy as well. Tried to get this installed on an Ubuntu server. Gave up and tried to follow your steps here to install on my Macbook with Apple Silicon.
Last thing I did was installing opencc
using brew and putting the so
in the same folder as I'm running diarize.py. The error I get when running is:
ModuleNotFoundError: No module named 'ctc_forced_aligner'
Makes sense because I did run the requirements.txt manually one by one and skipped ctc_forced_aligner
since it .. doesn't work 😄
Running pip install git+https://github.com/MahmoudAshraf97/ctc-forced-aligner.git
yields
/Applications/Xcode.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX.sdk/usr/include/c++/v1/__algorithm/max.h:31:1: note: candidate function template not viable: requires 3 arguments, but 2 were provided
max(const _Tp& __a, const _Tp& __b, _Compare __comp)
^
ctc_forced_aligner/forced_align_impl.cpp:131:33: error: no matching function for call to 'max'
ltrIdx - backPtr_offset[std::max(t - 1, static_cast<long int>(0))];
^~~~~~~~
/Applications/Xcode.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX.sdk/usr/include/c++/v1/__algorithm/max.h:40:1: note: candidate template ignored: deduced conflicting types for parameter '_Tp' ('int64_t' (aka 'long long') vs. 'long')
max(const _Tp& __a, const _Tp& __b)
^
/Applications/Xcode.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX.sdk/usr/include/c++/v1/__algorithm/max.h:51:1: note: candidate template ignored: could not match 'initializer_list<_Tp>' against 'int64_t' (aka 'long long')
max(initializer_list<_Tp> __t, _Compare __comp)
^
/Applications/Xcode.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX.sdk/usr/include/c++/v1/__algorithm/max.h:60:1: note: candidate function template not viable: requires single argument '__t', but 2 arguments were provided
max(initializer_list<_Tp> __t)
^
/Applications/Xcode.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX.sdk/usr/include/c++/v1/__algorithm/max.h:31:1: note: candidate function template not viable: requires 3 arguments, but 2 were provided
max(const _Tp& __a, const _Tp& __b, _Compare __comp)
^
2 errors generated.
error: command '/usr/bin/clang' failed with exit code 1
[end of output]
note: This error originates from a subprocess, and is likely not a problem with pip.
ERROR: Failed building wheel for ctc-forced-aligner
Running setup.py clean for ctc-forced-aligner
Failed to build ctc-forced-aligner
ERROR: Could not build wheels for ctc-forced-aligner, which is required to install pyproject.toml-based projects
from whisper-diarization.
@pakerfeldt Hi, this is the new alignment package that was merged recently, can you send me an email or a message on linkedin so we can debug this?
from whisper-diarization.
@MahmoudAshraf97 Sent you a message on LinkedIn!
from whisper-diarization.
Related Issues (20)
- Error: got an unexpected keyword argument 'max_new_tokens' HOT 2
- AssertionError: chunk size too large, text got clipped HOT 2
- Language param not working HOT 2
- install issue HOT 1
- WhisperX forced alignment HOT 1
- How to use Yaml File HOT 1
- word_timestamps - IndexError: list index out of range HOT 1
- Transcription for non-verbal/non-speech labels(laughter etc.)? HOT 3
- Any suggestions for improving speaker diarization!! HOT 3
- Install fails on Python 3.12 due to missing distutils HOT 5
- python version it best works in ?????? HOT 1
- pip install requirements.txt in Debian WSL on Windows takes forever HOT 2
- What is the maximum audio file that can be sent for execution? HOT 5
- Output format HOT 3
- Conflicting dependencies while installing requirements.txt HOT 3
- from .ctc_forced_aligner import forced_align as forced_align_cpp error HOT 3
- CTC forced alignment error HOT 9
- multiple speaker compatability HOT 2
- IndexError: list index out of range HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from whisper-diarization.