GithubHelp home page GithubHelp logo

Long run time? about patchtst HOT 11 CLOSED

yuqinie98 avatar yuqinie98 commented on August 23, 2024
Long run time?

from patchtst.

Comments (11)

ikvision avatar ikvision commented on August 23, 2024 1

To make it clear, I didn't write this code/paper, I am like you - using it.
In the open source community it is not always easy to understand each other.
I would suggest to be kinder in order to get assistance

from patchtst.

ikvision avatar ikvision commented on August 23, 2024

For a single step into the future, would that help
parser.add_argument('--target_points', type=int, default=1, help='forecast horizon')

The current masking in the code is random

noise = torch.rand(bs, L, nvars,device=xb.device) # noise in [0, 1], bs x L x nvars

Do you have a casual mask pytorch implementation you are considering?

from patchtst.

Eliav2479 avatar Eliav2479 commented on August 23, 2024

This does not address my question.
I was talking about run time issues

from patchtst.

ikvision avatar ikvision commented on August 23, 2024

Training time can be solved in many different ways - multi-gpu, larger batch size, faster data-loader...
Why do you think that causal mask is your main bottle neck?

from patchtst.

Eliav2479 avatar Eliav2479 commented on August 23, 2024

Please read the question

from patchtst.

Eliav2479 avatar Eliav2479 commented on August 23, 2024

When you have a window size of H and a causal mask you can predict H tokens in a single pass.

from patchtst.

ikvision avatar ikvision commented on August 23, 2024

Indeed the methods is patch based, it might to be the best fit for predicting a single data point
You might want to to use only the pre-training with patch to create embedding.
For the second stage (fine-tunning) you can have a very simple regression from embedding predicting a single time step (1 layer NN without patches)

from patchtst.

Eliav2479 avatar Eliav2479 commented on August 23, 2024

I would suggest to wait for the authors for a response.
Thank you for replying.

from patchtst.

yuqinie98 avatar yuqinie98 commented on August 23, 2024

Thanks for asking @Eliav2479 and sorry for the late reply. Unfortunately we do not understand your question very well so we would appreciate if you could explain more of your concern. We basically agree with the solution that @ikvision proposed if you want to apply it to multiple-step prediction. Or you would just directly do multiple-step forecasting (DMS rather than IMS in this paper https://arxiv.org/pdf/2205.13504.pdf). The input is X1,...,Xt and output is Xt+1,..,Xt+T, which is done in one pass.

from patchtst.

DIKSHAAGARWAL2015 avatar DIKSHAAGARWAL2015 commented on August 23, 2024

any estimate on how long it will take to run supervised and self-supervised learning based on default model and params.

from patchtst.

yuqinie98 avatar yuqinie98 commented on August 23, 2024

It varies on different datasets, epochs, GPU... thus it would be hard to answer. The fastest one may take half an hour while the largest model takes a day. @DIKSHAAGARWAL2015

from patchtst.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.