<a target="_blank" rel="noopener noreferrer nofollow" href="https://user-images.github

Memory leaks in AdamTrainer with RegressionLayer about convnetsharp HOT 5 CLOSED

raBpywa commented on August 18, 2024

Memory leaks in AdamTrainer with RegressionLayer

from convnetsharp.

Comments (5)

bjornbergq commented on August 18, 2024

No It's the list of Volumes for gmax and smax that is not disposed(?) correctly. I don't see a reason why it should dispose them but during training of my model I found that the GPU-class has ALOT of memory leaks. I fixed some of them, but this one is a mystery to me.

It creates a list of volumes, references the layers gradients and updates it, never destroying the list. BUT somewhere along the line it uses more and more GPU Memory. I think it has to do with how the GPU-class adopts the volume, maybe it gets a new pointer in memory after all(?).

from convnetsharp.

bjornbergq commented on August 18, 2024

OH, after a quick look again I think i found it. It restores the Storage on the device, making duplicates of the volume, instead of having a pointer to the same location. So do it even update the same values then?

from convnetsharp.

bjornbergq commented on August 18, 2024

Correction!

The other stuff was correct. But the adam trainer clones preivous volumes, hence creates more memory. Perahps the .Clone part should not be there?

from convnetsharp.

cbovar commented on August 18, 2024

Do you have a small sample that reproduces the leak ? I will look if that Clone is necessary.

I have merged a pull request just now that solves some GPU memory problem => Could you test with latest code?

Also, in Debug, you will always see lots of transfer between host and device because of here and here

from convnetsharp.

cbovar commented on August 18, 2024

I haven't found the leak yet. However there is a lot of volume instantiations in the trainers. I will introduce a pool of Volumes in VolumeBuilder to recycle them rather than disposing them and allocating them again.

from convnetsharp.

Recommend Projects

Memory leaks in AdamTrainer with RegressionLayer about convnetsharp HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs