GithubHelp home page GithubHelp logo

jiwei0921 / dmra Goto Github PK

View Code? Open in Web Editor NEW
101.0 5.0 21.0 1.21 MB

Code for ICCV 2019 paper. "Depth-induced Multi-scale Recurrent Attention Network for Saliency Detection". [RGB-D Salient Object Detection]

License: MIT License

Python 100.00%
salient-object-detection saliency-detection rgbd

dmra's People

Contributors

dut-iiau-oip-lab avatar jiwei0921 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

dmra's Issues

how to make dataset?

i want to train on my own dataset, and i found the format of images in masks is 1-bit .png, i am wondering how to make 1-bit .png?

depth load in dataloader file

In your code you use "depth = np.array(depth, dtype=np.uint8)" to load your depth image, I was wondering maybe this is not okay. Because of the depth value is much bigger than 255. If you use 'np.uint8', the depth value is not correct when loaded.
Thank you very much. @jiwei0921

about ConvLSTM

Thanks for your code and paper!
I notice that in your implementation, the ConvLSTMCELL returns the o as the output of the cell.
x, new_c, new_o = getattr(self, name)(x, h, c) # ConvLSTMCell forward
(line 316 in fusion.py)
However, i notice that in the pytorch implementation, the convlstm use the h as the output the ConvLSTMCELL. Could you please tell me the difference and the reason?

About learning rate

Hi Jiwei!

Why you try a little bit more bigger learning rate in your training phase?

Or

You try some bigger lr like lr = 3e-4 or lr = 1e-6, can you suggest some useful experiential value?

PiCANet experiment in the paper

Hello,

Thank you for your nice work.
I have some questions with the Table 1 in your paper.
Did you train/test RGB methods with the RGB images in the RGBD datasets from scratch? or finetune their pretrained model?
For PiCANet, which backbone did you use for your table? Is it VGG16 or ResNet?

Thank you.
Best regards,
Ahyun

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.