GithubHelp home page GithubHelp logo

goutamyg / smat Goto Github PK

View Code? Open in Web Editor NEW
26.0 3.0 4.0 1.85 MB

[WACV 2024] Separable Self and Mixed Attention Transformers for Efficient Object Tracking

License: Apache License 2.0

Python 100.00%
single-object-tracking mixed-attention self-attention wacv2024 visual-object-tracking visual-tracking wacv

smat's People

Contributors

goutamyg avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

smat's Issues

What are the model parameters and MACs ?

Hello, thanks for great work!
I run the code from [OSTrack/tracking/profile_model.py] to measure the model parameters and MACs .(https://github.com/botaoye/OSTrack)
Howerer, overall params is 739.205K, which is different from 3.8M in the paper.
The experiments.yaml and lib/config/* are the same as Main branch(8b99155).
Could you please tell me how to measure the model size?

Training error

Hello, I downloaded the corresponding pre-trained model "mobilevitv2-1.0.pt" from [ml-cvnets] and configured the relevant parameters. However, during the training process, there are the following issues:
"SerWarning: An output with one or more elements was resized since it had shape [8388608], which does not match the required output shape [1, 128, 256, 256]."“
This warning covers almost every layer. How should I correct it? Thank you for your support and work.

About the figure5

Hello, how did you draw a heatmap? The effect I drew using the Gradcam method is much worse than the one in your paper. Would you like to make this part of the code public?

While training dimension mismatch error is coming.

Hii @goutamyg
While training, On the first epoch only, I am getting the same type of error as mentioned here. Also after commenting the lines told by you, i am still getting the same error, can you help me with it. Thank you
RuntimeError: Given groups=1, weight of size [256, 128, 1, 1], expected input[128, 3, 128, 128] to have 128 channels, but got 3 channels instead

When I train to epoch 10, I will encounter an error message

When I train to epoch 10, I will encounter an error message, which is “RuntimeError: Given groups=1, weight of size [32, 3, 3, 3], expected input[1, 128, 32, 32] to have 3 channels, but got 128 channels instead”. Can you help me solve this problem? Thank you.

Training on Custom Dataset

Hey, your work looks impressive!
I wanted to fine-tune SMAT on a custom dataset. However, changing all the configurations and files seems a little complex. Is there any chance that training on a custom dataset feature is also added in the codebase? Thanks!

confidence score

Hi, how can I find the confidence score or how can I calculate it?
Thank you.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.