GithubHelp home page GithubHelp logo

window_sizw about tsformer-vo HOT 8 CLOSED

aofrancani avatar aofrancani commented on July 30, 2024
window_sizw

from tsformer-vo.

Comments (8)

110sha avatar 110sha commented on July 30, 2024

QQ截图20240418201511

from tsformer-vo.

aofrancani avatar aofrancani commented on July 30, 2024

Yes, they are the same and you got it correctly.

In "kitti.py" you will read the data with "window_size" frames in each iteration. (https://github.com/aofrancani/TSformer-VO/blob/main/train.py#L223), and in the model's output, you will have "window_size -1" estimations because you have one pose estimation for each 2 consecutive frames.

The value "window_size=3" you see in "kitti.py" is just a default value if you don't mention it when reading the data...

from tsformer-vo.

110sha avatar 110sha commented on July 30, 2024

Thank you very much for your reply. If it is not mentioned when reading data, the value "window_size=3" seen in "kitti. py" is only a default value, and we do not need to worry about it.
So just modify the window size=2, 3, and 4 in train. py to represent VO-1, 2, and 3, respectively.
But I only changed (window_size: 2) to (window_size: 3) in train. py, and the resulting error was quite large.
So I would like to ask if there are any other parameters that need to be modified accordingly when modifying the window_size value in train.py?

from tsformer-vo.

aofrancani avatar aofrancani commented on July 30, 2024

No, the window_size parameter is independent of the others. What I used to do was set the overlap to "window_size - 1", so that the larger the window, the more data I got to train (with redundancy in the batches, because from one video clip to the next only one frame has changed). So, the overlap between the windowed data might be the other parameter you are looking for...

from tsformer-vo.

110sha avatar 110sha commented on July 30, 2024

I'm very sorry, I was so foolish that I reread the article and still don't quite understand how to implement it in the code.
Initially, train. py: window_size=2; Kitti. py: window_size=3, which means the overlapping frame rate is 2, representing VO1?
Then I conducted two experiments according to my understanding:
1: In train, window_size=3, and in kitti, window_size=4, indicating an overlapping frame rate of 3, i.e. VO2;
2: In train, window_size=4, and in kitti, window_size=5, indicating an overlapping frame rate of 4, i.e. VO3;
But the result is still not right.
Has my understanding gone wrong again? I hope to receive your guidance again! Thank you.

from tsformer-vo.

aofrancani avatar aofrancani commented on July 30, 2024

I'm sorry I didn't get it... What do you mean the result is not right? the expected size of your windowed data or the final evaluation metrics after/during your training?

from tsformer-vo.

110sha avatar 110sha commented on July 30, 2024

I will reproduce your code, and if no changes are made, the final error result will be similar to that in your paper.
But if I want to reproduce TSformer VO-2 and TSformer VO-3, how should I change it?
I made the changes according to this idea, and the final error was significant.

1: In train, window_size=3, and in kitti, window_size=4, indicating an overlapping frame rate of 3, i.e. VO2;
2: In train, window_size=4, and in kitti, window_size=5, indicating an overlapping frame rate of 4, i.e. VO3;

Simply put, I don't understand how to modify code. Where to train VO2 and VO3?谢谢

from tsformer-vo.

aofrancani avatar aofrancani commented on July 30, 2024

Ok, so you mean the final error after training everything...

So, the only thing you should edit is the "train.py", you don't need to worry about "kitty.py" because when we read the data we pass the parameter "args["window_size"]" as input to the dataloader.

  • For VO-2: set "window_size=3 and overlap=2"
  • For VO-3: set "window_size=4 and overlap=3"

I hope this helps!

from tsformer-vo.

Related Issues (17)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.