compvis / geometry-free-view-synthesis Goto Github PK

View Code? Open in Web Editor NEW

365.0 365.0 34.0 165.41 MB

Is a geometric model required to synthesize novel views from a single image?

Home Page: https://arxiv.org/abs/2104.07652

License: MIT License

Python 90.70% Jupyter Notebook 9.00% Shell 0.30%

novel-view-synthesis transformers

geometry-free-view-synthesis's People

Contributors

Stargazers

Watchers

geometry-free-view-synthesis's Issues

No module named 'timm'

During the collab demo running, I stuck with this problem.
Can I do something with it?

Downloading: "https://github.com/intel-isl/MiDaS/archive/master.zip" to /root/.cache/torch/hub/master.zip

---------------------------------------------------------------------------

ModuleNotFoundError                       Traceback (most recent call last)

<ipython-input-8-377ddaf3164f> in <module>()
      9     print("Warning: Running on CPU---sampling might take a while...")
     10     device = torch.device("cpu")
---> 11 midas = Midas().eval().to(device)
     12 renderer = Renderer(model=model, device=device)

9 frames

/root/.cache/torch/hub/intel-isl_MiDaS_master/midas/vit.py in <module>()
      1 import torch
      2 import torch.nn as nn
----> 3 import timm
      4 import types
      5 import math

ModuleNotFoundError: No module named 'timm'

training scripts

Hello,

Thanks for open-sourcing! Do you mind adding training/evaluation scripts for RealEstate & ACID?

Why use inverse of intrinsics matrix?

I notice that in the multiembedder, the inverse intrinsic matrix is also included. Since this information should be already determined by K, why is this also used? Did you notice some improvement?

Thanks

Colab?

Is this possible to run in colab with the demo? Infinite Nature was able to create a demo in Colab :)

Label assignment after source and destination frame ids are selected?

I am trying to understand the code and had one query about the dataloading in acid and real estate 10k dataset. I will try explaining it using the acid dataset case. The label assignment for large/small, forward/backward movement is done before actually sampling the source and destination frame ids. According to my understanding, this should be done after the frame id selection is done. Can you please let me know if I am missing something?

cpu inference

is inference on cpu possible?

Minimum GPU VRAM Required

Hi @pesser ,
What is the minimum GPU VRAM required for training on the RealEstate10K dataset?

there is no "points"

when using colmap to generate the new data.

geometry-free-view-synthesis/scripts/sparse_from_realestate_format.py

Line 185 in 00dc639

open(points3D_txt, "w").close()

create empty data. But in the training code, there are many parts need to process this empty data. Why?

And could I do not use colmap to generate new data and train this model? I do not get the point to use it because we already have intrinsic params and camera pose. And I do not find any part in code using colmap data like database data.

Thank you!

ERROR: Could not find a version that satisfies the requirement geometry-free-view-synthesis

when I try to execute pip install geometry-free-view-synthesis, I got this error. Is that problem with the pip source?

OSError: Cannot open resource on ImageFont

I've used the "alternative" install method. I'm running into this when hitting SPACE in the scene.

Training on objects

Hello,

I am curious if the implementation would be capable of novel view synthesis of objects if trained on multiview object data, as opposed to scene data? Do you see any potential blockers?

Thanks!

[Question] Multiple Image Input

How can I include multiple input images into the synth? When starting braindance.py path-to-my-imagefolder, a filemanager opens that just let's me select a single image.

I'd like to re-create a whole room by including the info's of multiple images.

run braindance.py

Rel10K training images for the first stage

Hi,

Could you share the Rel10K training images for the first stage training?
We would like to train a different first stage model with the identical training data for fair comparison.

Best,

Could not find a version that satisfies the requirement splatting

hello,

when i run pip install geometry-free-view-synthesis, proper version of splatting seems missing,

Any suggestion about this problem?

downloading RealEstate10K

Hi, I downloaded the RealEstate10K dataset, and it only contains the txt files.
Can you attach your code for downloading the dataset from those txt files?
Thanks!

compvis / geometry-free-view-synthesis Goto Github PK

geometry-free-view-synthesis's People

Contributors

Stargazers

Watchers

Forkers

geometry-free-view-synthesis's Issues

Recommend Projects

Recommend Topics

Recommend Org

Jobs