yinghaohuang91 / muvs Goto Github PK

This is the code for MuVS (Multi-view SMPLify)

License: MIT License

Python 93.54% Shell 6.46%

muvs's Introduction

MuVS

This is the demo code for the MuVS (Multi-view SMPLify) method presented in the paper:

Yinghao Huang, Federica Bogo, Christoph Lassner, Angjoo Kanazawa, Peter V. Gehler, Javier Romero, Ijaz Akhter and Michael J. Black, Towards Accurate Marker-less Human Shape and Pose Estimation over Time, International Conference on 3D Vision (3DV) 2017.

Dependencies

You need to install the the following packages via pip like shown below:

pip install --upgrade numpy scipy tensorflow-gpu matplotlib
pip install git+https://github.com/mattloper/chumpy.git@refs/pull/18/head # required since the main branch is broken since pip 10
pip install https://github.com/MPI-IS/meshlite/raw/master/download/psbody_meshlite-0.1-cp27-cp27mu-linux_x86_64.whl # meshlite

Additionally, smpl needs to be downloaed from http://smpl.is.tue.mpg.de/downloads and symlinked

ln -s <smpl dir>/smpl_webuser TF_Code/smpl

Deepcut and one human-specific segmentation method are also used (Not used now, since no differentialbe render in Tensorflow availabe). You need to install them from the project links.

Folder structure

Code_TF

This is where the code resides. A brief summary of what each file does in the following:

camera.py: perspective pinhole camera model, which projects 3D points into image space.
smpl_batch.py: Tensorflow verion of SMPL model, which takes in SMPL paramters and yields 3D SMPL joints.
util.py: experimental settings and data loading methods.

Data

Image data, DCT basis, SMPL model and pose prior is stored here. If you want to try your own data, you need to prepare it in the way shown in folder HEVA_Validate:

1, One folder for each view.
2, For each view, images are in *Image* subfolder.
3, 2D pose information in *Pose_2D subfolder*.
4, Create subfolders *Res_1* and *Res_2* to save the results.

How to run

Firstly you have to organize your data like described in the previous setction. Then you need to run Deepercut for all the images, and place them in the right folder. After that you can use these commands to run MuVS from the main folder:

./gen_run_jobs_1.sh # This runs per-frame fitting for all the frames
./gen_run_jobs_2.sh # This runs temporal fitting for all the temporal units

The resultant files (3D SMPL mesh, SMPL shape, pose and translation parameters) will be saved in ply and pkl in Res_1/Res_2 respectively.

Citation

If you find this code useful for your research, please consider citing:

@inproceedings{MuVS:3DV:2017,
  title = {Towards Accurate Marker-less Human Shape and Pose Estimation over Time},
  author = {Huang, Yinghao and Bogo, Federica and Lassner, Christoph and Kanazawa, Angjoo and Gehler, Peter V. and Romero, Javier and Akhter, Ijaz and Black, Michael J.},
  booktitle = {International Conference on 3D Vision (3DV)},
  year = {2017}
}

muvs's People

Contributors

Stargazers

Watchers

muvs's Issues

The dynamic model sequence is shaking

Hello Yinghao,

Thank you for your wonderful job! But there is a problem I met. When I run the demo code on data attached on the respository, the SMPL model fits well on every frame. However, when I render the SMPL models as a consecutive video, the model is shaking. How to deal with this problem? Thank you!

Switch to OpenPose from LSP

Hi @YinghaoHuang91 I am switching to Openpose as a joint detector for MuVS.
I read the codes of SMPL-X, I will refer to their way. I have a question about this line:

MuVS/Code_TF/util.py

Line 28 in 199f51e

TORSO_IDS = [3-1, 4-1, 9-1, 10-1]

I understand that the former line is to map LSP joints to SMPL's, and this torso joints are the 4 SMPL torso joints, but what are these 3-1, 4-1, 9-1, and 10-1?

Would you mind providing the version of each required packages?

Some issues occurred when I am installing the MuVS. Could you please provide the version of required packages and the Python?

About camera pose

How do you get the camera extrinsic from human3.6m dataset?

With a larger picture like 1024*1024, the train results poor loss like 10E6

I replace the data used in the project with 1024*1024 jpg, also 14 2d joints involved.
While the loss result turns out to be extraordinarily large:

Tit   = total number of iterations
Tnf   = total number of function evaluations
Tnint = total number of segments explored during Cauchy searches
Skip  = number of BFGS updates skipped
Nact  = number of active bounds at final generalized Cauchy point
Projg = norm of the final projected gradient
F     = final function value

           * * *

   N    Tit     Tnf  Tnint  Skip  Nact     Projg        F
    6     23     26      1     0     0   1.000D+06   7.801D+06
  F =   7800716.0000000000

Does the picture size matter with the training? Or anything I missed then.

I think There is missing in psbody file or something

I do not read installation guide carefully Sorry for mistake

Can the code be run on CPU?

Can you please let me know whether I can run this code on CPU?

The result shape is strange

Hello Yinghao, thank you for your wonderful idea and open source. When I run the demo without any change, the reconstructed model is strange especially the pose of head, and the loss of optimization is about 5000. I also tested only using 1 view, the result is much worse than the Smplify Tensorflow edition. Is that the reason of configuration? Thank you for your help!

About the test code

Thanks for sharing your code~
but when I only run 'gen_run_jobs_1.sh' with your test data, the result is very strange, it look like this:
/home/yck/桌面/2019-07-21 22-27-33屏幕截图.png
I did't change any code, who can tell me where the problem?

Can not see code about shape.

I have saw 'Prior_Shape', but seems different from the description in paper.

Strange Pose fitting results when using OpenPose J2D input

Hi man, I changed your Deepcut joint input to OpenPose 18 joints.
I removed the Head_top joint of DeepCut and mapped the nose_vertex_ID to openpose nose joint.
I only changed the codes in util.py and smpl_batch.py didn't touch frame_fit.py.
The result of frame_006 looks strange:

the right one use ur original DeepCut input is:

The print output is:

./gen_run_jobs_1.sh
0
../Data/HEVA_Validate/S1_Box_1_C1/Image/frame0006.png
Tensor("strided_slice_136:0", shape=(1, 13, 3), dtype=float32)
WARNING:tensorflow:From frame_fit.py:92: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead.
2020-09-22 14:22:43.737382: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1
2020-09-22 14:22:43.801989: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties:
name: GeForce GTX 1080 Ti major: 6 minor: 1 memoryClockRate(GHz): 1.582
pciBusID: 0000:05:00.0
2020-09-22 14:22:43.803034: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties:
name: GeForce GTX 1080 Ti major: 6 minor: 1 memoryClockRate(GHz): 1.582
pciBusID: 0000:06:00.0
2020-09-22 14:22:43.804058: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties:
name: GeForce GTX 1080 Ti major: 6 minor: 1 memoryClockRate(GHz): 1.582
pciBusID: 0000:09:00.0
2020-09-22 14:22:43.804240: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Could not dlopen library 'libcudart.so.10.0'; dlerror: libcudart.so.10.0: cannot open shared object file: No such file or directory
2020-09-22 14:22:43.804335: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Could not dlopen library 'libcublas.so.10.0'; dlerror: libcublas.so.10.0: cannot open shared object file: No such file or directory
2020-09-22 14:22:43.804423: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Could not dlopen library 'libcufft.so.10.0'; dlerror: libcufft.so.10.0: cannot open shared object file: No such file or directory
2020-09-22 14:22:43.804510: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Could not dlopen library 'libcurand.so.10.0'; dlerror: libcurand.so.10.0: cannot open shared object file: No such file or directory
2020-09-22 14:22:43.804599: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Could not dlopen library 'libcusolver.so.10.0'; dlerror: libcusolver.so.10.0: cannot open shared object file: No such file or directory
2020-09-22 14:22:43.804686: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Could not dlopen library 'libcusparse.so.10.0'; dlerror: libcusparse.so.10.0: cannot open shared object file: No such file or directory
2020-09-22 14:22:43.807522: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7
2020-09-22 14:22:43.807549: W tensorflow/core/common_runtime/gpu/gpu_device.cc:1663] Cannot dlopen some GPU libraries. Skipping registering GPU devices...
2020-09-22 14:22:43.807838: I tensorflow/core/platform/cpu_feature_guard.cc:142] Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX2 FMA
2020-09-22 14:22:44.152283: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x557cfd57c100 executing computations on platform CUDA. Devices:
2020-09-22 14:22:44.152340: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): GeForce GTX 1080 Ti, Compute Capability 6.1
2020-09-22 14:22:44.152359: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): GeForce GTX 1080 Ti, Compute Capability 6.1
2020-09-22 14:22:44.152373: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): GeForce GTX 1080 Ti, Compute Capability 6.1
2020-09-22 14:22:44.175759: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 3498000000 Hz
2020-09-22 14:22:44.176515: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x557cfc112f80 executing computations on platform Host. Devices:
2020-09-22 14:22:44.176566: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): ,
2020-09-22 14:22:44.176735: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix:
2020-09-22 14:22:44.176770: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187]
WARNING:tensorflow:From frame_fit.py:93: The name tf.global_variables_initializer is deprecated. Please use tf.compat.v1.global_variables_initializer instead.

2020-09-22 14:22:44.234960: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile.

RUNNING THE L-BFGS-B CODE

   * * *

Machine precision = 2.220D-16
N = 6 M = 10

This problem is unconstrained.

At X0 0 variables are exactly at the bounds

At iterate 0 f= 3.32008D+04 |proj g|= 5.35095D+04

At iterate 1 f= 5.12031D+03 |proj g|= 7.11054D+03

At iterate 2 f= 3.77815D+03 |proj g|= 5.47154D+03

At iterate 3 f= 2.42080D+03 |proj g|= 3.95927D+03

At iterate 4 f= 1.36007D+03 |proj g|= 3.27027D+03

At iterate 5 f= 7.52664D+02 |proj g|= 9.48627D+02

At iterate 6 f= 6.89472D+02 |proj g|= 1.83354D+02

At iterate 7 f= 6.84448D+02 |proj g|= 1.01980D+02

At iterate 8 f= 6.76808D+02 |proj g|= 1.95080D+02

At iterate 9 f= 6.55143D+02 |proj g|= 1.02498D+03

At iterate 10 f= 5.22447D+02 |proj g|= 1.65344D+03

At iterate 11 f= 3.32857D+02 |proj g|= 2.76960D+03

At iterate 12 f= 2.82925D+02 |proj g|= 2.34534D+03

At iterate 13 f= 1.92021D+02 |proj g|= 5.68519D+02

At iterate 14 f= 1.68120D+02 |proj g|= 3.90624D+02

At iterate 15 f= 1.62494D+02 |proj g|= 2.04222D+02

At iterate 16 f= 1.56634D+02 |proj g|= 1.65284D+02

At iterate 17 f= 1.49091D+02 |proj g|= 1.42732D+02

At iterate 18 f= 1.45896D+02 |proj g|= 2.77377D+02

At iterate 19 f= 1.45014D+02 |proj g|= 4.93933D+01

At iterate 20 f= 1.44913D+02 |proj g|= 6.80469D+00

       * * *

Tit = total number of iterations
Tnf = total number of function evaluations
Tnint = total number of segments explored during Cauchy searches
Skip = number of BFGS updates skipped
Nact = number of active bounds at final generalized Cauchy point
Projg = norm of the final projected gradient
F = final function value

       * * *

N Tit Tnf Tnint Skip Nact Projg F
6 20 25 1 0 0 6.805D+00 1.449D+02
F = 144.91284179687500

CONVERGENCE: REL_REDUCTION_OF_F_<=_FACTR*EPSMCH

Cauchy time 0.000E+00 seconds.
Subspace minimization time 0.000E+00 seconds.
Line search time 0.000E+00 seconds.

Total User time 0.000E+00 seconds.

RUNNING THE L-BFGS-B CODE

       * * *

Machine precision = 2.220D-16
N = 85 M = 10
This problem is unconstrained.

At X0 0 variables are exactly at the bounds

At iterate 0 f= 5.09612D+04 |proj g|= 2.31368D+04

At iterate 1 f= 4.88808D+04 |proj g|= 2.98825D+04

At iterate 2 f= 4.70862D+04 |proj g|= 2.49123D+04

At iterate 3 f= 4.32866D+04 |proj g|= 2.85334D+04

At iterate 4 f= 4.06109D+04 |proj g|= 2.34374D+04

At iterate 5 f= 3.98855D+04 |proj g|= 4.84759D+03

At iterate 6 f= 3.95298D+04 |proj g|= 4.49670D+03

At iterate 7 f= 3.85606D+04 |proj g|= 1.30690D+04

At iterate 8 f= 3.79381D+04 |proj g|= 8.20185D+03

At iterate 9 f= 3.73606D+04 |proj g|= 2.97926D+03

At iterate 10 f= 3.68244D+04 |proj g|= 7.99022D+03

At iterate 11 f= 3.63427D+04 |proj g|= 6.89147D+03

At iterate 12 f= 3.58900D+04 |proj g|= 5.91139D+03

At iterate 13 f= 3.56545D+04 |proj g|= 9.22863D+03

At iterate 14 f= 3.54833D+04 |proj g|= 1.25315D+04

At iterate 15 f= 3.53099D+04 |proj g|= 3.23353D+03

At iterate 16 f= 3.51866D+04 |proj g|= 2.60039D+03

At iterate 17 f= 3.49947D+04 |proj g|= 7.10795D+03

At iterate 18 f= 3.47829D+04 |proj g|= 5.58012D+03

At iterate 19 f= 3.46971D+04 |proj g|= 9.31179D+03

At iterate 20 f= 3.45636D+04 |proj g|= 3.14364D+03

At iterate 21 f= 3.45150D+04 |proj g|= 8.36188D+02

At iterate 22 f= 3.44934D+04 |proj g|= 1.46735D+03

       * * *

Tit = total number of iterations
Tnf = total number of function evaluations
Tnint = total number of segments explored during Cauchy searches
Skip = number of BFGS updates skipped
Nact = number of active bounds at final generalized Cauchy point
Projg = norm of the final projected gradient
F = final function value

       * * *

N Tit Tnf Tnint Skip Nact Projg F
85 22 25 1 0 0 1.467D+03 3.449D+04
F = 34493.414062500000

CONVERGENCE: REL_REDUCTION_OF_F_<=_FACTR*EPSMCH

Cauchy time 0.000E+00 seconds.
Subspace minimization time 0.000E+00 seconds.
Line search time 0.000E+00 seconds.

Total User time 0.000E+00 seconds.

We can see in the second optimizing round, the F value is big but almost not change, which finally results in converging.
The actor's left arm cannot be optimized until the last. Could you give me any advice about where is the problem?

Update: the problem seems like the 2nd optimization falls into the local minimum problems thus ends too early.
In some frames, such as frame 30, It runs good, 65 iterations and I got the same result with deep-cut input since its optimization.
Could you help me to figure out why the optimization always ends up with D+04, such a big f value?

When I run this code, I get an error

from psbody.mesh import Mesh
ImportError: No module named mesh

Resulting shape and pose is not good.

Hi Yang! thanks for your awesome work!

i have been having issues while testing it. Testing it on the data that you provide (as it is) turns out a pretty good shape and pose. But when I try to test on videos taken from CMU Panoptic dataset or any other video. The resulting shape and pose comes out very poor. I also inputted the right camera parameters. Can it be the format of the camera parameters input or some kind of scaling required? Can you tell me how you represent the camera parameters? . I also used 3 views and also tried on 4 views. None of it seems to work.

I read the camera parameters from the CMU panoptic datasets calibration.json and I convert them to np.float32 before passing to the function. Can it be something there?.

Regards

About the Use of Silhouette

Hi, @YinghaoHuang91 Thanks for your great work! It seems that your codes do not use segmentation results. Can you share that part? Thx!

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.