GithubHelp home page GithubHelp logo

mosca's Introduction

MoSca: Dynamic Gaussian Fusion from Casual Videos via 4D Motion Scaffolds

Project Page | Video

The code will be released no later than the acceptance of the paper.

mosca's People

Contributors

jiahuilei avatar

Stargazers

SS avatar zhouwy19 avatar Jie Wang avatar Kunhong avatar Jiadai Sun avatar  avatar Wanquan Feng avatar  avatar Nick Maggio avatar Chaoyue Song avatar Lu Ming avatar Abed Rayyan avatar savoki avatar Wenzhao Zheng avatar Nan Huang avatar Ray avatar Wastoon avatar Jeffrey Li avatar  avatar Jiacheng Wei avatar Chenghong Li avatar Diwen Wan avatar Sauradip Nag avatar Tianrui Chen avatar Xin Cai avatar FU ZHOUJIE avatar  avatar Wang Shizun avatar Florin Shen avatar Haoyu Wang avatar  avatar  avatar JIMMY ZHAO avatar  avatar Qi Sun 孙启 avatar  avatar  avatar Sergey Prokudin avatar  avatar Leon avatar  avatar WQS avatar  avatar Zhizheng Liu avatar Lin-Zhuo Chen avatar  avatar  avatar VARUN SAKUNIA avatar Kairun Wen avatar  avatar Kai Cheng avatar Snow avatar Tuan Duc Ngo avatar Shurui Gui avatar jemm avatar  avatar  avatar Jiahao Cui avatar Geralt_of_Rivia avatar I Made Putra Arya Winata avatar Asrar avatar  avatar lizhihao6 avatar 奚子琛(Zichen Xi) avatar  avatar  avatar Ilya Zakharkin avatar Linyi Jin avatar Zhongrui Yu avatar Yufu avatar Yunsheng Luo avatar  avatar Xiaoyuan Wang avatar Weihang Li avatar Weirong Chen avatar 爱可可-爱生活 avatar minami avatar Hang avatar  avatar lizhan avatar Jin-Chuan Shi avatar  avatar Fan Yang avatar Narek Tumanyan avatar Pengfei Xuan avatar Chenxu Zhou avatar Changyeon Won avatar Kellyxiaowei avatar John Casey avatar Rekkles avatar Philippe Gauthier avatar Jiawei Yang avatar Andrew R McHugh avatar Francesco avatar Weining Ren avatar Alexandre Morgand avatar  avatar  avatar Zhixuan Xu avatar Hengkai Guo avatar

Watchers

bro_q_dev avatar  avatar Hengkai Guo avatar Jiaming Sun avatar Phong Nguyen Ha avatar  avatar Yue Gao avatar Snow avatar Wooqy avatar  avatar Pyjcsx avatar SS avatar  avatar lizhan avatar  avatar Weirong Chen avatar jemm avatar Weining Ren avatar Jin Yao avatar Thomas Wimmer avatar Zhongrui Yu avatar  avatar Zhixuan Xu avatar Pablo Dawson avatar Josiah Reeves avatar I Made Putra Arya Winata avatar Zheng Zhang avatar Lihe Ding avatar Oguchi avatar  avatar Haoyu Wang avatar

mosca's Issues

Question about optimizing camera poses

Hello,

Thanks for your fancy work. I'm interested in the dynamic 3D from a monocular video. While reading the paper, I was curious about the part about optimizing camera pose. Since the current Gaussian Splatting renderer does not support gradient propagation of camera poses, it seems that you are not optimizing camera poses with the l1 loss of the rendered image and GT, as shown in Eq. 14 and 15.

However, I'm a little confused about "Note that we will also optimize the camera poses Wt throughout the rendering phases as well with
photometric losses to further adjust the camera". I saw this was also mentioned in another issue. Does this mean that the camera pose is also optimized using Eq. 13 as loss? If so, have you implemented a renderer that can pass the camera pose gradient? If so, that's so cool.

[Question about jointly the complete representation]

Dear authors,

Thanks for your impressive work!
In the "Full Pipeline" section, you mentioned that MoSca performs "Initialize the dynamic Gaussians G and jointly optimize the complete representation including the static background Gaussians, dynamic foreground Gaussians, the deformation motion graph, and cameras".

Will you still optimize the camera poses after the BA process in section 3.5?
Also, in section 3.5, you mention that "we jointly optimize a correction to depth Da[pa], consisting of per-frame global scaling factors and small per-pixel corrections". Can you provide more detailed descriptions for your depth alignment implementation and the camera poses training and representation?

Implementation details on epipolar error map

Hi, thanks for your excellent work MoSca!

Currently I am trying to follow some techniques in MoSca to handle the monocular dynamic scene. And I meet a small problem, which I really hope that you could help me to solve it. Here is my question:

In MoSca, you use the epipolar error map to segment the dynamic region, which is really elegant. I found this part can be refered to the code in robust dynamic radiance field in CVPR2023. And when I use it to segment the dynamic part in nvidia dynamic datasets and iphone datasets, the results seem poor or instable. Here is some example in apple scene (iphone) and skating scene (nvidia). I guess that some parameters should be adjusted to adopt to these scenes. It would be so nice for you to share some experience.

微信图片_20240606215753
微信图片_20240606215757

[Rendering FPS]

Dear authors,

Thanks for your impressive work!
Can you provide detailed rendering FPS for both the iPhone and Nvidia datasets for comparison purposes?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.