GithubHelp home page GithubHelp logo

simplediarization's Issues

Does it support direct Python calls?

I need to apply it to the code now. Do you have any encapsulated methods that can be directly called? If so, I haven't found any relevant documents, please give me some tips.

Inquiries regarding your intriguing research

Hello, as a student interested in the field of speech, first of all, thank you for your wonderful research.
I was also surprised by the very good diarization performance for arbitrary data on YouTube.

After using it with interest, I have a question as below and leave it on the issue.

  • Diarization was not successful for audio files whose sample rate is not 16000.
    Is it because most of the audio files used for the learning dataset have a sample rate of 16000?
    (I compared the same audio file with different sample rates)

  • I wonder if you have any further research plans on diarization in the section where the utterances overlap.

Thank you for your wonderful research. Have a good day๐Ÿ˜Š

ValueError: Found array with dim 3. AgglomerativeClustering expected <= 2.

Hello,

I have a file with three speakers:

yt-dlp -f bestaudio --extract-audio --audio-format wav --postprocessor-args "-ar 16000" --audio-quality 0 "https://www.youtube.com/watch?v=qHrN5Mf5sgo"

I believe that makes the clustering fail:

โ”‚ /home/emoman/.local/lib/python3.11/site-packages/sklearn/utils/validation.py:915 in check_array  โ”‚
โ”‚                                                                                                  โ”‚
โ”‚    912 โ”‚   โ”‚   โ”‚   โ”‚   "Convert your data to numeric values explicitly instead."                 โ”‚
โ”‚    913 โ”‚   โ”‚   โ”‚   )                                                                             โ”‚
โ”‚    914 โ”‚   โ”‚   if not allow_nd and array.ndim >= 3:                                              โ”‚
โ”‚ โฑ  915 โ”‚   โ”‚   โ”‚   raise ValueError(                                                             โ”‚
โ”‚    916 โ”‚   โ”‚   โ”‚   โ”‚   "Found array with dim %d. %s expected <= 2."                              โ”‚
โ”‚    917 โ”‚   โ”‚   โ”‚   โ”‚   % (array.ndim, estimator_name)                                            โ”‚
โ”‚    918 โ”‚   โ”‚   โ”‚   )                                                                             โ”‚
โ•ฐโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฏ
ValueError: Found array with dim 3. AgglomerativeClustering expected <= 2.

Any ideas?

Best,

Ed

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.