GithubHelp home page GithubHelp logo

Comments (10)

MahmoudAshraf97 avatar MahmoudAshraf97 commented on May 29, 2024 1

Hi everyone, I'm currently working on a branch that doesn't use whisperX alignment thus skipping the pyannote requirement that we don't even use in our project, I'll let you know when it's done

from whisper-diarization.

MahmoudAshraf97 avatar MahmoudAshraf97 commented on May 29, 2024 1

It uses PyTorch underneath so it will work

from whisper-diarization.

fkostadinov avatar fkostadinov commented on May 29, 2024

This is the entire error message I got in the end.

`/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/pyannote/audio/core/io.py:43: UserWarning: torchaudio._backend.set_audio_backend has been deprecated. With dispatcher enabled, this function is no-op. You can remove the function call.
  torchaudio.set_audio_backend("soundfile")
Traceback (most recent call last):

  File "/Users/myself/workspaces/whisper-diarization/diarize.py", line 3, in <module>
    from helpers import *
  File "/Users/myself/workspaces/whisper-diarization/helpers.py", line 7, in <module>
    from whisperx.alignment import DEFAULT_ALIGN_MODELS_HF, DEFAULT_ALIGN_MODELS_TORCH
  File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/whisperx/__init__.py", line 1, in <module>
    from .transcribe import load_model
  File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/whisperx/transcribe.py", line 10, in <module>
    from .asr import load_model
  File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/whisperx/asr.py", line 13, in <module>
    from .vad import load_vad_model, merge_chunks
  File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/whisperx/vad.py", line 11, in <module>
    from pyannote.audio.pipelines import VoiceActivityDetection
  File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/pyannote/audio/pipelines/__init__.py", line 26, in <module>
    from .speaker_diarization import SpeakerDiarization
  File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/pyannote/audio/pipelines/speaker_diarization.py", line 42, in <module>
    from pyannote.audio.pipelines.speaker_verification import PretrainedSpeakerEmbedding
  File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/pyannote/audio/pipelines/speaker_verification.py", line 56, in <module>
    from nemo.collections.asr.models import (
  File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/nemo/collections/asr/__init__.py", line 15, in <module>
    from nemo.collections.asr import data, losses, models, modules
  File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/nemo/collections/asr/models/__init__.py", line 36, in <module>
    from nemo.collections.asr.models.transformer_bpe_models import EncDecTransfModelBPE
  File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/nemo/collections/asr/models/transformer_bpe_models.py", line 52, in <module>
    from nemo.collections.nlp.modules.common import TokenClassifier
  File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/nemo/collections/nlp/__init__.py", line 15, in <module>
    from nemo.collections.nlp import data, losses, models, modules
  File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/nemo/collections/nlp/models/__init__.py", line 31, in <module>
    from nemo.collections.nlp.models.machine_translation import MTEncDecModel
  File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/nemo/collections/nlp/models/machine_translation/__init__.py", line 15, in <module>
    from nemo.collections.nlp.models.machine_translation.mt_enc_dec_bottleneck_model import MTBottleneckModel
  File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/nemo/collections/nlp/models/machine_translation/mt_enc_dec_bottleneck_model.py", line 23, in <module>
    from nemo.collections.nlp.models.machine_translation.mt_enc_dec_model import MTEncDecModel
  File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/nemo/collections/nlp/models/machine_translation/mt_enc_dec_model.py", line 38, in <module>
    from nemo.collections.common.tokenizers.chinese_tokenizers import ChineseProcessor
  File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/nemo/collections/common/tokenizers/chinese_tokenizers.py", line 38, in <module>
    import opencc
  File "/Users/myself/workspaces/whisper-diarization/venv/lib/python3.10/site-packages/opencc.py", line 24, in <module>
    libopencc = CDLL('libopencc.so.1', use_errno=True)
  File "/opt/homebrew/Cellar/[email protected]/3.10.14/Frameworks/Python.framework/Versions/3.10/lib/python3.10/ctypes/__init__.py", line 374, in __init__
    self._handle = _dlopen(self._name, mode)

OSError: dlopen(libopencc.so.1, 0x0006): tried: 'libopencc.so.1' (no such file), '/System/Volumes/Preboot/Cryptexes/OSlibopencc.so.1' (no such file), '/opt/homebrew/lib/libopencc.so.1' (no such file), '/System/Volumes/Preboot/Cryptexes/OS/opt/homebrew/lib/libopencc.so.1' (no such file), '/usr/lib/libopencc.so.1' (no such file, not in dyld cache), 'libopencc.so.1' (no such file)`

from whisper-diarization.

rosstex avatar rosstex commented on May 29, 2024

Same

from whisper-diarization.

keeth avatar keeth commented on May 29, 2024

i fixed this with:

brew install opencc
ln -s /opt/homebrew/lib/libopencc.dylib /opt/homebrew/lib/libopencc.so.1

(you just need to symlink it to one of the paths that it is searching for)

now unfortunately i get a KeyError when i pass --device mps so i am stuck using the CPU for this :(

from whisper-diarization.

MahmoudAshraf97 avatar MahmoudAshraf97 commented on May 29, 2024

@keeth Unfortunately I don't have any device to test MPS support, can you contact me through mail or linkedin and I'll gladly add support for it.

from whisper-diarization.

Sing303 avatar Sing303 commented on May 29, 2024

Will nemo work without an Nvidia graphics card?

from whisper-diarization.

pakerfeldt avatar pakerfeldt commented on May 29, 2024

Idk, I'm struggling like crazy as well. Tried to get this installed on an Ubuntu server. Gave up and tried to follow your steps here to install on my Macbook with Apple Silicon.

Last thing I did was installing opencc using brew and putting the so in the same folder as I'm running diarize.py. The error I get when running is:

ModuleNotFoundError: No module named 'ctc_forced_aligner'

Makes sense because I did run the requirements.txt manually one by one and skipped ctc_forced_aligner since it .. doesn't work 😄

Running pip install git+https://github.com/MahmoudAshraf97/ctc-forced-aligner.git yields

      /Applications/Xcode.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX.sdk/usr/include/c++/v1/__algorithm/max.h:31:1: note: candidate function template not viable: requires 3 arguments, but 2 were provided
      max(const _Tp& __a, const _Tp& __b, _Compare __comp)
      ^
      ctc_forced_aligner/forced_align_impl.cpp:131:33: error: no matching function for call to 'max'
              ltrIdx - backPtr_offset[std::max(t - 1, static_cast<long int>(0))];
                                      ^~~~~~~~
      /Applications/Xcode.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX.sdk/usr/include/c++/v1/__algorithm/max.h:40:1: note: candidate template ignored: deduced conflicting types for parameter '_Tp' ('int64_t' (aka 'long long') vs. 'long')
      max(const _Tp& __a, const _Tp& __b)
      ^
      /Applications/Xcode.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX.sdk/usr/include/c++/v1/__algorithm/max.h:51:1: note: candidate template ignored: could not match 'initializer_list<_Tp>' against 'int64_t' (aka 'long long')
      max(initializer_list<_Tp> __t, _Compare __comp)
      ^
      /Applications/Xcode.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX.sdk/usr/include/c++/v1/__algorithm/max.h:60:1: note: candidate function template not viable: requires single argument '__t', but 2 arguments were provided
      max(initializer_list<_Tp> __t)
      ^
      /Applications/Xcode.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX.sdk/usr/include/c++/v1/__algorithm/max.h:31:1: note: candidate function template not viable: requires 3 arguments, but 2 were provided
      max(const _Tp& __a, const _Tp& __b, _Compare __comp)
      ^
      2 errors generated.
      error: command '/usr/bin/clang' failed with exit code 1
      [end of output]

  note: This error originates from a subprocess, and is likely not a problem with pip.
  ERROR: Failed building wheel for ctc-forced-aligner
  Running setup.py clean for ctc-forced-aligner
Failed to build ctc-forced-aligner
ERROR: Could not build wheels for ctc-forced-aligner, which is required to install pyproject.toml-based projects

from whisper-diarization.

MahmoudAshraf97 avatar MahmoudAshraf97 commented on May 29, 2024

@pakerfeldt Hi, this is the new alignment package that was merged recently, can you send me an email or a message on linkedin so we can debug this?

from whisper-diarization.

pakerfeldt avatar pakerfeldt commented on May 29, 2024

@MahmoudAshraf97 Sent you a message on LinkedIn!

from whisper-diarization.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.