Hi could you please share the memory required to pretrain on Electricity and Traffic d

Memory required to pretrain on Electricity and Traffic about patchtst HOT 3 CLOSED

linfeng-du commented on August 23, 2024

Memory required to pretrain on Electricity and Traffic

from patchtst.

Comments (3)

namctin commented on August 23, 2024 1

Hi Linfeng, for these large datasets, we trained on A100 gpus with 80Gb memory. These datasets contain large number of variates, so 32GB seems not sufficient unless you reduce the number of input token.

from patchtst.

yuqinie98 commented on August 23, 2024

Hi, it depends on the prediction length, batch size, look back window.... But generally speaking for those two large datasets we often use 4 (could be up to 8) 3090 or A5000 GPUs. Also we decrease the batch size. For the other datasets, one 3090 would be sufficient.

from patchtst.

linfeng-du commented on August 23, 2024

Thank you for your quick reply! However, I am specifically referring to the pre-training phase which does not have prediction length since we're reconstructing masked patches.

Also I noticed that the default context length for pre-training is 512, which is different from the look-back window length used in downstream forecasting tasks. Just would like to confirm if this is intended.

from patchtst.

Memory required to pretrain on Electricity and Traffic about patchtst HOT 3 CLOSED

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs