GithubHelp home page GithubHelp logo

act's People

Contributors

xinhaomei avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

act's Issues

Using the pretrained model on test data

Hi thanks for the code. I want to use the pretrained model for making inference on my own data. So the data is only audio files (and no csv files for training). I set up the environment, then put data files in data folder (waveforms). Updated settings.yaml file as instructed (eval mode, pretrained model etc.). Could you please guide me about making inference now? (I tried running train.py but it gives errors about missing train.h5 file etc.). Thanks.

Can you upload ACT_m_scratch and ACT_m_DeiT as well?

Hello, Dr. Mei! Thank you for open-sourcing this awesome work.

While the procedure to reproduce ACT_m_DeiT and ACT_m_scratch was sufficiently explained, I was wondering if it would be possible to upload them / otherwise obtain from you the model you used to produce the Results Table - Just like you did with the ACT_*_AudioSet_DeiT series.

Thank you!

Downloading data

Hi,
can you provide another way for us to download audiocaps data? I can't use Baidu.

evaluation problem

Hello, nice work!
After I replicate your work, and I got the model trained from scratch by myself ( the encoder used pretrained_model -- audioset_deit).
And when I want to evaluate the model, I set the config.path.eval_model to the path where my model is, but when I load the state_dict, there is something wrong.

{7C940F16-DC11-4467-8277-9B5C006A1AC4}

I do not know why there is nothing about decoder? Could you help me with this problem? Thank you for your early reply!!!

Issues with training files

Hi, thank you for this valuable resource.
The train.zip provided on the google drive, upon unzipping gives the following error message:
file #9024: bad zipfile offset (local header sig): 4819319277
error: invalid zip file with overlapped components (possible zip bomb)

How shall I resolve this issue?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.