Hi there, I'm Yuyang Zhao ๐
Contact Me:
โ๏ธ Email: [email protected]
๐ Website: https://yuyangzhao.com
๐ Google Scholar: https://scholar.google.com/citations?user=u5M6XPAAAAAJ
Animate124: Animating One Image to 4D Dynamic Scene
License: Apache License 2.0
Contact Me:
โ๏ธ Email: [email protected]
๐ Website: https://yuyangzhao.com
๐ Google Scholar: https://scholar.google.com/citations?user=u5M6XPAAAAAJ
Sincerely...it is literaily too big so I've been failing again and again.
Also, threestudio's thing doesn't work.
You need to check it.
Now, I met Peter who is a google tech lead, he said that I can get google credit 3000 for startup.
Sorry I don't want to leave this message in this issue section but no choice.
run:
bash teststep2.sh
more teststep2.sh
seed=0
gpu=0
exp_root_dir=outputs
DATA_DIR="panda-dance"
STATIC_PROMPT="a high resolution DSLR image of panda"
DYNAMIC_PROMPT="a panda is dancing"
CN_PROMPT="a is dancing"
lambda_sd_img=0.01
ckpt=outputs/animate124-stage1/${STATIC_PROMPT}@LAST/ckpts/last.ckpt
python launch.py --config custom/threestudio-animate124/configs/animate124-stage2-ms.yaml --train --gpu $gpu
data.image.image_path=custom/threestudio-animate124/load/${DATA_DIR}/_rgba.png
system.prompt_processor.prompt="${DYNAMIC_PROMPT}"
system.weights="$ckpt"
error:
Seed set to 0
[INFO] Loading Stable Diffusion ...
model_index.json: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 384/384 [00:00<00:00, 4.08MB/s]
text_encoder/config.json: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 644/644 [00:00<00:00, 2.26MB/s]
scheduler/scheduler_config.json: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 465/465 [00:00<00:00, 1.74MB/s]
unet/config.json: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 787/787 [00:00<00:00, 8.42MB/s]
vae/config.json: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 657/657 [00:00<00:00, 8.67MB/s]
diffusion_pytorch_model.safetensors: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 335M/335M [00:50<00:00, 6.60MB/s]
model.safetensors: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 1.36G/1.36G [02:36<00:00, 8.72MB/s]
diffusion_pytorch_model.safetensors: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 5.65G/5.65G [06:45<00:00, 13.9MB/s]
Fetching 8 files: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 8/8 [06:46<00:00, 50.84s/it]
Loading pipeline components...: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 4/4 [00:45<00:00, 11.34s/it]
Traceback (most recent call last):s: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 5.65G/5.65G [06:45<00:00, 18.6MB/s]
File "/home/andy/threestudio/launch.py", line 301, in
main(args, extras)
File "/home/andy/threestudio/launch.py", line 169, in main
system: BaseSystem = threestudio.find(cfg.system_type)(
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/andy/threestudio/custom/threestudio-animate124/systems/base.py", line 40, in init
self.configure()
File "/home/andy/threestudio/custom/threestudio-animate124/systems/animate124.py", line 63, in configure
self.guidance_video = threestudio.find(self.cfg.guidance_type)(
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/andy/threestudio/threestudio/utils/base.py", line 83, in init
self.configure(*args, **kwargs)
File "/home/andy/threestudio/custom/threestudio-animate124/models/guidance/zeroscope_guidance.py", line 70, in configure
self.tokenizer = CLIPTokenizer.from_pretrained(
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/andy/miniconda3/envs/threestudio/lib/python3.11/site-packages/transformers/tokenization_utils_base.py", line 1795, in from_pretrained
raise EnvironmentError(
OSError: Can't load tokenizer for 'damo-vilab/text-to-video-ms-1.7b'. If you were trying to load it from 'https://huggingface.co/models', make sure you don't have a local directory with the same name. Otherwise, make sure 'damo-vilab/text-to-video-ms-1.7b' is the correct path to a directory containing all relevant files for a CLIPTokenizer tokenizer.
Hi, thanks for your amazing work. I was wondering where the videos following semantic refinement are getting saved. Although run-dynamic.sh saves the videos in the specified location.
I was able to run all 3 steps on my PC after reducing some of the max steps. I used threestudio. I want to include animated models in a game, not just have the video, have a .dae file that you can import and use. Is that possible?
When I run the following test.sh I get this error:
raise ConfigKeyError(f"Missing key {key!s}")
omegaconf.errors.ConfigKeyError: Missing key lambda_sd_img
full_key: loss.lambda_sd_img
object_type=dict
test.sh:
seed=0
gpu=0
exp_root_dir=outputs
DATA_DIR="panda-dance"
STATIC_PROMPT="a high resolution DSLR image of panda"
DYNAMIC_PROMPT="a panda is dancing"
CN_PROMPT="a is dancing"
python launch.py --config custom/threestudio-animate124/configs/animate124-stage1.yaml --train --gpu $gpu
data.image.image_path=custom/threestudio-animate124/load/${DATA_DIR}/_rgba.png
system.prompt_processor.prompt="${STATIC_PROMPT}"
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.