GithubHelp home page GithubHelp logo

Comments (7)

fefespn avatar fefespn commented on August 25, 2024 2

So yea, what I am doing now is like this:
for a basic 3D object we need geometry + texture.
from the wonder3D I get the geometry, it's good enough for my needs.
Then I use the second stage of dreamgaussian repositry/paper, they have a differentiable rendered for texture refinement.
you need to make sure the 3D obj from wonder3d aligned with the renderer.
so I optimize the Object texture using my multi-view images and that renderer.

from wonder3d.

flamehaze1115 avatar flamehaze1115 commented on August 25, 2024

Hello, thanks for your interests in our work.
Currently our diffusion model only support 256x256 resolution, so 512 does not work.
For the OOM problem, you should use 1gpu.yaml, the diffusion won't cost too much memory. Current inference code supports float32, I will change it to float16 later to save memory.
For stage 2, I will write a document about the parameters to favor better reconstruction methods.

from wonder3d.

Jiaozrr avatar Jiaozrr commented on August 25, 2024

First of all, amazing work on this project!

Would it be possible to increase texture quality? I've tried increasing the img_wh under validation_dataset in the config to [512, 512] but I get: RuntimeError: Sizes of tensors must match except in dimension 1. Expected size 32 but got size 64 for tensor number 1 in the list.

As for second phase, what relevant parameters in the 'neuralangelo-ortho-wmask.yaml' config would allow modification to mesh quality?

And finally, less relevant to the above, but I've tried running on multi-GPU and the seconds phase (instant-nsr-pl) works fine when reducing the iterations but when trying to run the diffusion phase with a similar config to 8gpu.yaml but with 4 instead (running on 4xNVIDIA L4) I get OOM. Any tips you can share on this?

Thanks a lot!

Hello! I met this problem, too.
I find that img_wh is corresponding to the size of latents between unet and vae. So changing the sample_size of unet in mvdiffusion-joint-ortho-6views.yaml into 64 will make their size the same and fix the bug. And setting crop_size to 384 leads to a better image proportion. But I can only get inconsistent results, not as amazing as the result of 256x256.
Hope authors can provide a higher-resolution pretrained unet or even training code!

from wonder3d.

fefespn avatar fefespn commented on August 25, 2024

Hello, @flamehaze1115 that would very appreciated if you could share a reconstruction doc !!
@jclarkk, I will try to answer with what I did, I still experiment it so maybe in the future would be better:

  1. The reconstruction stage starts first of all with resizeing the 256X256 images to 1024X1024. Now First Make sure when resizing the resolution wouldn't decrease a lot. Second make sure the object is scaled/aligned exactly to 1024 (and not small in the middle with lot of whot margins) . Third: you can manually change the front image with your input image (make sure it's aligned with the front_normals image, you can make sure of this by crop=-1 in the diffusion first stage).
  2. If you resize it to 2048X2048 it would be better (of course take more time to reconstruction and more memory)
  3. In the config there is a weights for each view, give the front view the highest weight.
  4. (I am not sure of this) you can do manual postprocess using blender or any other software, to map the texture+normals to the front view (this will make sure, depending in the geometry, that your texture will be good)
  5. (I will try in the future) play with the optimizer parameters.
  6. (I will try it in the future and I am not sure if I understand it right (appreciate feedback if you know the theory)) because they use NeUs, they embed the points colors to mlp network ( like nerf? ) and this would give little bit blurry for gentle small details if there is no enough input data (just 6) ), so by removing this mlp and map it as it is, the color would be nice (that wouldn't work for reflections and mirrors...)

from wonder3d.

jclarkk avatar jclarkk commented on August 25, 2024

@fefespn
Thanks a lot for your notes, I'll try to play around with this as well.

from wonder3d.

 avatar commented on August 25, 2024

@fefespn Hi, so have you tried with your suggestions? And, do you have any further good suggestions? Thanks

from wonder3d.

LordLiang avatar LordLiang commented on August 25, 2024

So yea, what I am doing now is like this: for a basic 3D object we need geometry + texture. from the wonder3D I get the geometry, it's good enough for my needs. Then I use the second stage of dreamgaussian repositry/paper, they have a differentiable rendered for texture refinement. you need to make sure the 3D obj from wonder3d aligned with the renderer. so I optimize the Object texture using my multi-view images and that renderer.

@fefespn Hello! Can you share more about your modified second stage of dg? The different camera settings of these two methods are a little confusing.

from wonder3d.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.