Comments (3)
Turns out this was a good-old "too high a learn-rate" problem...
I was using default optimizer settings for lr
, betas
, eps
and weight_decay
Using the base configuration on audio-diffusion-pytorch-trainer solved this issue:
self.optimizer = torch.optim.AdamW(
params = list(self.model.parameters()),
lr = 1e-4,
betas= (0.95, 0.999),
eps= 1e-6,
weight_decay= 1e-3)
from audio-diffusion-pytorch.
I didn't thoroughly test vk diffusion (usually I go with v) but also never had exploding problems. Check that your dataset is distributed equally, i.e. if there are suddenly in a batch multiple silent samples that can mess up the model. For that, you might want to use the WAVDataset in https://github.com/archinetai/audio-data-pytorch with the check_silence
set to true
, or do some similar checks.
from audio-diffusion-pytorch.
I am actually, using WAVDataset with check_silence
set to True
- which is the default, also, the datasets I'm using are taken from one-shot sound packs and another was a set of wavetables, so that's not the problem apparently...
Thanks for the help btw! :)
from audio-diffusion-pytorch.
Related Issues (20)
- Add support to clip predicted samples to the desired range. HOT 2
- Alternative Noises: Offset, Pyramid, Pink HOT 2
- Spectrogram-based diffusion model HOT 2
- New Try
- Questions about conditional generation HOT 2
- Trained models
- Languages HOT 1
- Model architectures from the paper HOT 4
- Future Work - Models
- Can the repo be used to process MIDI data๏ผ
- AssertionError: ClassiferFreeGuidancePlugin requires embedding
- Class-conditional generation HOT 1
- I have a few questions about 1D-UNet HOT 4
- What is the structure of encoder in diffusionAE? HOT 1
- CUDA OF Memory for 80GB A100 : follow the mousai paper setting of text condition
- Unconditional model generates okay quality of fake human voice but failed on music. HOT 4
- RuntimeError: The size of tensor a (37) must match the size of tensor b (36) at non-singleton dimension 2 HOT 4
- Unconditional Generation generates noise
- RuntimeError: The size of tensor a (91) must match the size of tensor b (90) at non-singleton dimension 2
- Weird spikes in the loss
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from audio-diffusion-pytorch.