GithubHelp home page GithubHelp logo

Comments (10)

tangyuhao avatar tangyuhao commented on August 16, 2024

Bounding box is in the form of <x1, y1, x2, y2>. In your case, it means if the second frame has size
(h, w) #h is height and w is width
the target should be in the bounding box with top left corner (y1 * h, x1 * w) and bottom right corner (y2 * h, x2 * w)

And I suppose all first frames have bounding box of <0.25,0.25,0.75,0.75> so I do not need to input this to the network. The only label is the position of the bonding box in the next frame.

from goturn-tensorflow.

tangyuhao avatar tangyuhao commented on August 16, 2024

However the network output is 10x of the original bbx. Thus it always outputs values ranging from 0 to 10. Sometimes exceeding it though.

from goturn-tensorflow.

liyang-7 avatar liyang-7 commented on August 16, 2024

Thank you very much! The CaffeNet resize the crop image into 227227, the input of the network is 227227, no matter the size(w, h) of the crop image. So the output of the network need to multiply w and h in order to regain the size of the crop image.
Because of your code, I understand the detail of GOTURN.
Extremely grateful!

from goturn-tensorflow.

tangyuhao avatar tangyuhao commented on August 16, 2024

You are welcome~

from goturn-tensorflow.

yihongXU avatar yihongXU commented on August 16, 2024

Hi,
May I ask you why sometimes they have negative values ?
Evan in your test sest examples, I found some negative values
test_example/target/000020.jpg,test_example/searching/000020.jpg,-0.13129449838187718,0.29418666666666665,0.5079611650485438,0.95328

thanks

from goturn-tensorflow.

tangyuhao avatar tangyuhao commented on August 16, 2024

because the actual object is outside the search area. I did this to test whether is have some ability to guess the actual position of an object if some part of it is not inside the searching area.

from goturn-tensorflow.

liyang-7 avatar liyang-7 commented on August 16, 2024

Hi, may I ask you the tensorflow version in this code?
I download the pretrained model from: GOTURN_MODEL.
Uncompress the checkpoints folder, then I got two items: checkpoint and checkpoint.ckpt-1.data-00000-of-00001
I got the error: Unsuccessful TensorSliceReader constructor: Failed to find any matching files.
I used the tf (1.1.0)
It seems that lack of the model.ckpt.index in the checkpoints folder.
Did you used the earlier tf version?

from goturn-tensorflow.

tangyuhao avatar tangyuhao commented on August 16, 2024

I used 1.0.0, but 1.1.0 should also work fine. I have added the checkpoint index file. It should fix the bug.

from goturn-tensorflow.

mrgloom avatar mrgloom commented on August 16, 2024

How Bounding box is in the form of <x1, y1, x2, y2>, usually from 0 to 1, but exceeding this range is also fine. bbox encoding related to encoding proposed in paper?

Is encoding in section 4.2 Learning Motion Smoothness is only used at train time for augmentation?

from goturn-tensorflow.

linhandev avatar linhandev commented on August 16, 2024

so does this mean when i use the output of the network to draw a bb, fc4/10 should be my x1,y1,x2,y2 and then use the above x1w , y1h , x2w , y2h as a pixel wise output

from goturn-tensorflow.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.