Light

helioszhao / animate124 Goto Github PK

View Code? Open in Web Editor NEW

157.0 157.0 7.0 10.15 MB

Animate124: Animating One Image to 4D Dynamic Scene

License: Apache License 2.0

Python 82.18% Roff 0.02% C++ 3.52% Cuda 12.02% C 1.57% Shell 0.69%

animate124's Introduction

Hi there, I'm Yuyang Zhao 👋

Contact Me:

✉️ Email: [email protected]

🔗 Website: https://yuyangzhao.com

🔎 Google Scholar: https://scholar.google.com/citations?user=u5M6XPAAAAAJ

animate124's People

Contributors

Stargazers

Watchers

Forkers

hjsybyq dsaurus chnxindong sfidea jo-heejin bruinxiong whuhxb

animate124's Issues

Hello VRAM is too high to me, how do you guys handle this problem?

Sincerely...it is literaily too big so I've been failing again and again.
Also, threestudio's thing doesn't work.
You need to check it.

Now, I met Peter who is a google tech lead, he said that I can get google credit 3000 for startup.

In that case, is it worth to try? or do you have any recommendation about GPU or providers? what's your hardware set up?
Is it possible to inference this for visionPro?
Also, Is the result only available in 2D screen? Even though the contents are 3D...
Because I want to make a animatable 3d asset for spatial computing like an AR environment.

Sorry I don't want to leave this message in this issue section but no choice.

error on running step 2 - Can't load tokenizer for 'damo-vilab/text-to-video-ms-1.7b'

run:
bash teststep2.sh

more teststep2.sh
seed=0
gpu=0
exp_root_dir=outputs
DATA_DIR="panda-dance"
STATIC_PROMPT="a high resolution DSLR image of panda"
DYNAMIC_PROMPT="a panda is dancing"
CN_PROMPT="a is dancing"
lambda_sd_img=0.01

--------- Stage 2 (Dynamic Coarse Stage) ---------

ckpt=outputs/animate124-stage1/${STATIC_PROMPT}@LAST/ckpts/last.ckpt
python launch.py --config custom/threestudio-animate124/configs/animate124-stage2-ms.yaml --train --gpu $gpu
data.image.image_path=custom/threestudio-animate124/load/${DATA_DIR}/_rgba.png
system.prompt_processor.prompt="${DYNAMIC_PROMPT}"
system.weights="$ckpt"

error:

Seed set to 0
[INFO] Loading Stable Diffusion ...
model_index.json: 100%|████████████████████████████████████████████████████████████████| 384/384 [00:00<00:00, 4.08MB/s]
text_encoder/config.json: 100%|████████████████████████████████████████████████████████| 644/644 [00:00<00:00, 2.26MB/s]
scheduler/scheduler_config.json: 100%|█████████████████████████████████████████████████| 465/465 [00:00<00:00, 1.74MB/s]
unet/config.json: 100%|████████████████████████████████████████████████████████████████| 787/787 [00:00<00:00, 8.42MB/s]
vae/config.json: 100%|█████████████████████████████████████████████████████████████████| 657/657 [00:00<00:00, 8.67MB/s]
diffusion_pytorch_model.safetensors: 100%|███████████████████████████████████████████| 335M/335M [00:50<00:00, 6.60MB/s]
model.safetensors: 100%|███████████████████████████████████████████████████████████| 1.36G/1.36G [02:36<00:00, 8.72MB/s]
diffusion_pytorch_model.safetensors: 100%|█████████████████████████████████████████| 5.65G/5.65G [06:45<00:00, 13.9MB/s]
Fetching 8 files: 100%|███████████████████████████████████████████████████████████████████| 8/8 [06:46<00:00, 50.84s/it]
Loading pipeline components...: 100%|█████████████████████████████████████████████████████| 4/4 [00:45<00:00, 11.34s/it]
Traceback (most recent call last):s: 100%|█████████████████████████████████████████| 5.65G/5.65G [06:45<00:00, 18.6MB/s]
File "/home/andy/threestudio/launch.py", line 301, in
main(args, extras)
File "/home/andy/threestudio/launch.py", line 169, in main
system: BaseSystem = threestudio.find(cfg.system_type)(
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/andy/threestudio/custom/threestudio-animate124/systems/base.py", line 40, in init
self.configure()
File "/home/andy/threestudio/custom/threestudio-animate124/systems/animate124.py", line 63, in configure
self.guidance_video = threestudio.find(self.cfg.guidance_type)(
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/andy/threestudio/threestudio/utils/base.py", line 83, in init
self.configure(*args, **kwargs)
File "/home/andy/threestudio/custom/threestudio-animate124/models/guidance/zeroscope_guidance.py", line 70, in configure
self.tokenizer = CLIPTokenizer.from_pretrained(
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/andy/miniconda3/envs/threestudio/lib/python3.11/site-packages/transformers/tokenization_utils_base.py", line 1795, in from_pretrained
raise EnvironmentError(
OSError: Can't load tokenizer for 'damo-vilab/text-to-video-ms-1.7b'. If you were trying to load it from 'https://huggingface.co/models', make sure you don't have a local directory with the same name. Otherwise, make sure 'damo-vilab/text-to-video-ms-1.7b' is the correct path to a directory containing all relevant files for a CLIPTokenizer tokenizer.

No video getting saved while running the run-cn.sh

Hi, thanks for your amazing work. I was wondering where the videos following semantic refinement are getting saved. Although run-dynamic.sh saves the videos in the specified location.

Is it possible to generate a 3d mesh model, for example a .dae file?

I was able to run all 3 steps on my PC after reducing some of the max steps. I used threestudio. I want to include animated models in a game, not just have the video, have a .dae file that you can import and use. Is that possible?

Missing key lambda_sd_img

When I run the following test.sh I get this error:

raise ConfigKeyError(f"Missing key {key!s}")

omegaconf.errors.ConfigKeyError: Missing key lambda_sd_img
full_key: loss.lambda_sd_img
object_type=dict

test.sh:
seed=0
gpu=0
exp_root_dir=outputs
DATA_DIR="panda-dance"
STATIC_PROMPT="a high resolution DSLR image of panda"
DYNAMIC_PROMPT="a panda is dancing"
CN_PROMPT="a is dancing"

--------- Stage 1 (Static Stage) ---------

python launch.py --config custom/threestudio-animate124/configs/animate124-stage1.yaml --train --gpu $gpu
data.image.image_path=custom/threestudio-animate124/load/${DATA_DIR}/_rgba.png
system.prompt_processor.prompt="${STATIC_PROMPT}"

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.

Jobs

Jooble