jiwei0921 / dmra Goto Github PK

Code for ICCV 2019 paper. "Depth-induced Multi-scale Recurrent Attention Network for Saliency Detection". [RGB-D Salient Object Detection]

License: MIT License

Python 100.00%

salient-object-detection saliency-detection rgbd

dmra's People

Contributors

Stargazers

Watchers

Forkers

dut-iiau-oip-lab pchank ucrscholar minzhangm frequencyxxq wangyue7777 bella9771 berther xiaolongcheng zhouleisjtu saliencydetection xinxin-zhu starboy-at-earth ljtsss rsdljm mfkiwl abandonsea

dmra's Issues

inconsistent depth size in DUT-RGBD-400*600

I found the depth size is 256x256 in DUT-RGBD-400*600, can you provide the original depth file (400x600)?

how to make dataset?

i want to train on my own dataset, and i found the format of images in masks is 1-bit .png, i am wondering how to make 1-bit .png?

In your code you use "depth = np.array(depth, dtype=np.uint8)" to load your depth image, I was wondering maybe this is not okay. Because of the depth value is much bigger than 255. If you use 'np.uint8', the depth value is not correct when loaded.
Thank you very much. @jiwei0921

about ConvLSTM

Thanks for your code and paper!
I notice that in your implementation, the ConvLSTMCELL returns the o as the output of the cell.
x, new_c, new_o = getattr(self, name)(x, h, c) # ConvLSTMCell forward
(line 316 in fusion.py)
However, i notice that in the pytorch implementation, the convlstm use the h as the output the ConvLSTMCELL. Could you please tell me the difference and the reason?

RuntimeError: The size of tensor a (150) must match the size of tensor b (152) at non-singleton dimension 3

RuntimeError: The size of tensor a (150) must match the size of tensor b (152) at non-singleton dimension 3
how to solve it?

About learning rate

Hi Jiwei!

Why you try a little bit more bigger learning rate in your training phase?

You try some bigger lr like lr = 3e-4 or lr = 1e-6, can you suggest some useful experiential value?

def forward() drb5.shape=(1,64,100,152) but others's shape =(1,64,100,150)

when I run demo.py, I find that drb5's shape = (1,64,100,152), but drb1,drb2... their shape = (1,64,100,150)

PiCANet experiment in the paper

Hello,

Thank you for your nice work.
I have some questions with the Table 1 in your paper.
Did you train/test RGB methods with the RGB images in the RGBD datasets from scratch? or finetune their pretrained model?
For PiCANet, which backbone did you use for your table? Is it VGG16 or ResNet?

Thank you.
Best regards,
Ahyun

RuntimeError: The size of tensor a (150) must match the size of tensor b (152) at non-singleton dimension 3

Please answer this question during the test. Thank you

jiwei0921 / dmra Goto Github PK

dmra's People

Contributors

Stargazers

Watchers

Forkers

dmra's Issues

inconsistent depth size in DUT-RGBD-400*600

how to make dataset?

depth load in dataloader file

about ConvLSTM

RuntimeError: The size of tensor a (150) must match the size of tensor b (152) at non-singleton dimension 3

About learning rate

def forward() drb5.shape=(1,64,100,152) but others's shape =(1,64,100,150)

PiCANet experiment in the paper

RuntimeError: The size of tensor a (150) must match the size of tensor b (152) at non-singleton dimension 3

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs