Hi, I am using your function to recognize speaker identification. I

Thank you for your interest <a class="user-mention notranslate" data-hovercard-type="u

speaker identification input WAV file about 3d-convolutional-speaker-recognition HOT 3 CLOSED

astorfi commented on July 19, 2024

speaker identification input WAV file

from 3d-convolutional-speaker-recognition.

Comments (3)

astorfi commented on July 19, 2024

Thank you for your interest @GawainLee. The overall process is described in the associated paper. If you are new to the topic, please read the paper at first.

Bests

from 3d-convolutional-speaker-recognition.

ovninosa commented on July 19, 2024

Astorfi,

I just read the paper and the repo, thanks for your time and effort.

I see the hdf5 file, this is the dataset features from all of the speakers, right?

I think the main idea of this question is about doing the training and verification with wav's dataset.

There is a simple way to do that??
Or I need to extract all the feature with a third party soft like SIDEKIT?

Thanks,
Jave

from 3d-convolutional-speaker-recognition.

astorfi commented on July 19, 2024

@javenosa Thanks for your kind words.

No ... The hdf5 is just a sample for showcasing how to represent the data to the network and give the demo the ability to run promptly ... You should generate your own custom data and the details are available in the associated paper.

HDF5 generation is not necessary for the general architecture design and model training. However, the code needs to be modified a little bit to incorporate the features in case you wanna you some other file format! Storing files in HDF5, TFrecords and etc formats is suggested though since it makes the whole process much simpler and faster although data file generation itself add a little bit of complexity.

The features must be extracted anyway. I used SpeechPy which is my own developed package for speech feature extracted but you use any package that you are more comfortable using.

from 3d-convolutional-speaker-recognition.

Recommend Projects

speaker identification input WAV file about 3d-convolutional-speaker-recognition HOT 3 CLOSED

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs