This repository is related to our Dataset and Detection code from the paper: AI-Synthesized Voice Detection Using Neural Vocoder Artifacts accepted in CVPR Workshop on Media Forensic 2023.

Home Page: https://arxiv.org/abs/2304.13085

License: MIT License

Python 100.00%

deepfake-detection speech-synthesis audio-deepfake-detection neural-vocoder

synthetic-voice-detection-vocoder-artifacts's Issues

Release of pre-trained models

Hello
Thanks for dataset, can you release pre-trained models for vocoder detection system?

Regards
Ajinkya

Clarification Needed on Intra-dataset vs Cross-dataset Evaluation Metrics in Paper

I have some questions regarding the evaluation metrics and results presented in Sections 4.4 and 4.5.

Intra-dataset Evaluation (Section 4.4)

The paper reports a very low EER of 0.19% on the WaveFake dataset using the RawNet2 model.

To confirm my understanding, was this evaluation performed with the model being trained and tested on the same WaveFake dataset?

Cross-dataset Evaluation (Section 4.5)

On the other hand, the EER significantly increased to 26.95% when the model trained on the LibriSeVoc dataset was tested on the WaveFake dataset. This suggests poor generalization to unseen data.

Are there any ongoing efforts to improve this aspect of the model, perhaps through domain adaptation techniques or exposure to a more diverse set of vocoders during training?

full script

Thank you for the awesome project. Could you provide a full script to me ([email protected]), I would like to do some experiment.

Thanks

Pre-train model release

Hi,
Thanks for all the great work. You said you will put pre-train model in core_script folder in reply of another issue. But I can't find it

Name of vocoders training datasets

Hello,

Thank you for sharing this dataset.

Would it be possible to have more information on the generation of audios? In particular, the names of the vocoders training datasets used.

Thank you.

Tony

Can you provide more detail description or sample source code for WavLM in your paper?

I have tried to reproduce it but the result seen not to be potential!

P/S: Do you know which is the best EER performing on ASVSpoof2019?

Request for Training and Evaluation Code

I'm interested in your CVPR paper and would like to replicate your experiments. Could you please upload the training and evaluation code?

csun22 / synthetic-voice-detection-vocoder-artifacts Goto Github PK

synthetic-voice-detection-vocoder-artifacts's Issues

Release of pre-trained models

Clarification Needed on Intra-dataset vs Cross-dataset Evaluation Metrics in Paper

Intra-dataset Evaluation (Section 4.4)

Cross-dataset Evaluation (Section 4.5)

full script

Pre-train model release

Name of vocoders training datasets

Hello, could you please provide the split files for training, dev, and test splits in Table 2?

Train.py script

Benckmark and experiment for WavLM in AntiSpoofing

Request for Training and Evaluation Code

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs