Hello, First of all, congrats for this amazing work, and thank you f

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

<div class="snippet-clipboard-content notranslate position-relative overflow-auto" data-snippet-clip

Error on tensors size when inputing actual image size about deep-exemplar-based-video-colorization HOT 6 OPEN

pasalvetti commented on June 1, 2024

Error on tensors size when inputing actual image size

from deep-exemplar-based-video-colorization.

Comments (6)

RachelBlin commented on June 1, 2024

Hello,

As the previous comment says, thank you very much for sharing your work.

I also tested you work on my own frames (500x500) and got a similar error:
RuntimeError: Sizes of tensors must match except in dimension 2. Got 62 and 63 (The offending index is 0)

After checking the code details, I think it might come from the class WarpNet(nn.Module): in the def __init__(self, batch_size): (models/NonlocalNet.py file) where they compute the features to be concatenated. We can see on top of each layer definition that they expect dimensions 44*44 as output for each of the four layers (probably corresponding to the features' dimensions for their default frame size), making two upsampling and one downsampling. The problem might be due to the fact the downsampling function must deal with features with odd dimensions at some point and trunc or/and round these numbers, causing a dimension mismatch between the four returned features.

As an illustration you can see that if the inputs to the layer functions are of shape:

torch.Size([1, 128, 125, 125])
torch.Size([1, 256, 62, 62])
torch.Size([1, 512, 31, 31])
torch.Size([1, 512, 15, 15])

The feature will be of shape:

torch.Size([1, 64, 63, 63]) # downsampling 125*125 by 2 returning 63*63
torch.Size([1, 64, 62, 62]) # keeping 62*62
torch.Size([1, 64, 62, 62]) # upsampling 31*31 by two returning 62*62
torch.Size([1, 64, 62, 60]) # upsampling 15*15 by 4 by two returning 60*60

However, I don't know how to correct that issue yet. I was wondering if @pasalvetti has some updates since the opening of the issue ?

Thanks a lot.

from deep-exemplar-based-video-colorization.

hrdunn commented on June 1, 2024

@RachelBlin @pasalvetti I have run into this issue as well. Has either of you managed to overcome this?

from deep-exemplar-based-video-colorization.

RachelBlin commented on June 1, 2024

Hi @hrdunn, unfortunately no, I gave up on the code and used another method. The only solution I found was reshaping the input images so they can be divided by 2^4.

from deep-exemplar-based-video-colorization.

hrdunn commented on June 1, 2024

@RachelBlin Interesting. Wonder if it has to do with the model being trained on specific image size. @zhangmozhe is this the case? Would we need to retrain the model to output with higher resolutions? Also, do you know if I could run inference on a TPU with the current code?

from deep-exemplar-based-video-colorization.

semel1 commented on June 1, 2024

Can't mange to get "image_size" to work. Tried -- image_size 216,384 as stated in the help (-h) "--image_size IMAGE_SIZE the image size, eg. [216,384]" - it trows an error: "test.py: error: argument --image_size: invalid int value: '216,384'". Can anybody please explain the meaning of that option and how properly use it. Thanks in advance for any help you are able to provide.

from deep-exemplar-based-video-colorization.

krishnacck commented on June 1, 2024

parser.add_argument("--image_size", type=int, default=[216 * 6, 384 * 6], help="the image size, eg. [216,384]")

the above code worked for me by multiplying the image size by even numbers

from deep-exemplar-based-video-colorization.

Error on tensors size when inputing actual image size about deep-exemplar-based-video-colorization HOT 6 OPEN

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs