GithubHelp home page GithubHelp logo

compvis / geometry-free-view-synthesis Goto Github PK

View Code? Open in Web Editor NEW
365.0 365.0 34.0 165.41 MB

Is a geometric model required to synthesize novel views from a single image?

Home Page: https://arxiv.org/abs/2104.07652

License: MIT License

Python 90.70% Jupyter Notebook 9.00% Shell 0.30%
novel-view-synthesis transformers

geometry-free-view-synthesis's People

Contributors

pesser avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

geometry-free-view-synthesis's Issues

No module named 'timm'

During the collab demo running, I stuck with this problem.
Can I do something with it?

Downloading: "https://github.com/intel-isl/MiDaS/archive/master.zip" to /root/.cache/torch/hub/master.zip

---------------------------------------------------------------------------

ModuleNotFoundError                       Traceback (most recent call last)

<ipython-input-8-377ddaf3164f> in <module>()
      9     print("Warning: Running on CPU---sampling might take a while...")
     10     device = torch.device("cpu")
---> 11 midas = Midas().eval().to(device)
     12 renderer = Renderer(model=model, device=device)

9 frames

/root/.cache/torch/hub/intel-isl_MiDaS_master/midas/vit.py in <module>()
      1 import torch
      2 import torch.nn as nn
----> 3 import timm
      4 import types
      5 import math

ModuleNotFoundError: No module named 'timm'

training scripts

Hello,

Thanks for open-sourcing! Do you mind adding training/evaluation scripts for RealEstate & ACID?

Why use inverse of intrinsics matrix?

I notice that in the multiembedder, the inverse intrinsic matrix is also included. Since this information should be already determined by K, why is this also used? Did you notice some improvement?

Thanks

Colab?

Is this possible to run in colab with the demo? Infinite Nature was able to create a demo in Colab :)

Label assignment after source and destination frame ids are selected?

I am trying to understand the code and had one query about the dataloading in acid and real estate 10k dataset. I will try explaining it using the acid dataset case. The label assignment for large/small, forward/backward movement is done before actually sampling the source and destination frame ids. According to my understanding, this should be done after the frame id selection is done. Can you please let me know if I am missing something?

there is no "points"

when using colmap to generate the new data.

create empty data. But in the training code, there are many parts need to process this empty data. Why?

And could I do not use colmap to generate new data and train this model? I do not get the point to use it because we already have intrinsic params and camera pose. And I do not find any part in code using colmap data like database data.

Thank you!

Training on objects

Hello,

I am curious if the implementation would be capable of novel view synthesis of objects if trained on multiview object data, as opposed to scene data? Do you see any potential blockers?

Thanks!

[Question] Multiple Image Input

How can I include multiple input images into the synth? When starting braindance.py path-to-my-imagefolder, a filemanager opens that just let's me select a single image.

I'd like to re-create a whole room by including the info's of multiple images.

Rel10K training images for the first stage

Hi,

Could you share the Rel10K training images for the first stage training?
We would like to train a different first stage model with the identical training data for fair comparison.

Best,

downloading RealEstate10K

Hi, I downloaded the RealEstate10K dataset, and it only contains the txt files.
Can you attach your code for downloading the dataset from those txt files?
Thanks!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.