GithubHelp home page GithubHelp logo

seokjulee / insta-dm Goto Github PK

View Code? Open in Web Editor NEW
219.0 15.0 27.0 11.04 MB

Learning Monocular Depth in Dynamic Scenes via Instance-Aware Projection Consistency (AAAI 2021)

License: Other

Python 98.22% Shell 1.78%
monocular-depth-estimation motion-estimation video-instance-segmentation depth-estimation visual-odometry pytorch

insta-dm's Issues

backward and forward flows

Dear authors

I was wondering how you obtain the backward flow of the first frame and the forward flow of the last frame in a scene.
Ex: there are 000.jpg, 001.jpg, 002.jpg in a scene
000.jpg, 001.jpg => forward_000.flo
(no -001.jpg to compute backward_000.flo)
001.jpg, 002.jpg => forward_001.flo
001.jpg, 000.jpg => backward_001.flo
002.jpg, 001.jpg => backward_002.flo
(no 003.jpg to compute backward_002.flo)

Thank you very much.

Best,
Bolian

pre-training weights

How are the pre-training weights of each sub-network obtained? How are the sub-networks supervised learning?

realtime

Is it possible to use this method in realtime on a video stram from a webcam?

custom dataset

Dear authors:
Thank you for your great work in this paper ! If I want to reproduce it on my custom dataset, how should I preprocess my image data to obtain the instance segmentation results.

CityScape Ground Truth

Hi @SeokjuLee

I noticed that the Cityscape-VIS dataset does not include the GT depth, could you share a link to download the GT data as used in your paper? Thank you!

Visualize in breakpoint 1

Hi @SeokjuLee :

Thank you for sharing this amazing work, the visualization in this code helps me a lot in understanding the logic. However, I have a confusion, the line below:

fig.add_subplot(ea1,ea2,ii); ii += 1; plt.imshow(r2t_inst); plt.colorbar(); plt.text(10, -14, "r2t_inst", fontsize=7, bbox=dict(facecolor='None', edgecolor='None'));

will trigger the error
"TypeError: Invalid shape (256, 832, 2) for image data"

There are two instance masks in r2t_inst, are you supposed to visualize one of them like r2t_inst[:, :, 0] or do you have other implementations to visualize both of them in once?

Thank you!

Error: Undefined symbol

By running sh scripts/run_demo.sh, I get the following error:

/torch/lib/python3.6/site-packages/torch_sparse/_version_cuda.so: undefined symbol: ZN3c106detail12infer_schema20make_function_schemaENS_8ArrayRefINS1_11ArgumentDefEEES4

i install exactly
pytorch=1.7.0 torchvision torchaudio cudatoolkit=11.0 -c pytorch
and
pip3 install torch-scatter torch-sparse -f https://pytorch-geometric.com/whl/torch-1.7.0+cu110.html

ImageNet pretrain

Hi, thank you for sharing this amazing work, I just have a little bit confusion. Does the AbsRel error of 0.112 on kitti eigen split with or without imagenet pretraining? If not, do you have results with pretraining?

Thank you

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.