Comments (11)
我也刚踩了这个坑,12.2降级到11.8, 重装了 anaconda。最后总结一下,上是pytorch的问题,只要pytorch正确安装就行,cuda和pytorch的版本一致就行。
最主要的是mmcv-full这个包,不能mim方式安装,要pip方式安装。
from facechain.
我的是3090 驱动也是只支持12.2,我猜测可能要卸载驱动,重新安装对应的老驱动
from facechain.
2023-08-16 15:13:09,794 - mmcv - INFO -
pafpn_convs.1.conv.bias - torch.Size([16]):
The value is the same before and after calling init_weights
of PAFPN
2023-08-16 15:13:09,794 - modelscope - INFO - loading model from /home/adu/.cache/modelscope/hub/damo/cv_ddsar_face-detection_iclr23-damofd/pytorch_model.pt
load checkpoint from local path: /home/adu/.cache/modelscope/hub/damo/cv_ddsar_face-detection_iclr23-damofd/pytorch_model.pt
2023-08-16 15:13:09,806 - modelscope - INFO - load model done
2023-08-16 15:13:09,815 - modelscope - INFO - loading model from /home/adu/.cache/modelscope/hub/damo/cv_manual_facial-landmark-confidence_flcm/pytorch_model.pt
2023-08-16 15:13:09,823 - modelscope - INFO - load model done
cathed for image process of 001.jpg
Error: nms_impl: implementation for device cuda:0 not found.
cathed for image process of 002.jpg
Error: nms_impl: implementation for device cuda:0 not found.
cathed for image process of 003.jpg
Error: nms_impl: implementation for device cuda:0 not found.
cathed for image process of 000.jpg
Error: nms_impl: implementation for device cuda:0 not found.
[]
Error: result is empty.
Traceback (most recent call last):
File "/home/adu/anaconda3/envs/facechain/lib/python3.8/site-packages/gradio/routes.py", line 488, in run_predict
output = await app.get_blocks().process_api(
File "/home/adu/anaconda3/envs/facechain/lib/python3.8/site-packages/gradio/blocks.py", line 1431, in process_api
result = await self.call_function(
File "/home/adu/anaconda3/envs/facechain/lib/python3.8/site-packages/gradio/blocks.py", line 1109, in call_function
prediction = await anyio.to_thread.run_sync(
File "/home/adu/anaconda3/envs/facechain/lib/python3.8/site-packages/anyio/to_thread.py", line 33, in run_sync
return await get_asynclib().run_sync_in_worker_thread(
File "/home/adu/anaconda3/envs/facechain/lib/python3.8/site-packages/anyio/_backends/_asyncio.py", line 877, in run_sync_in_worker_thread
return await future
File "/home/adu/anaconda3/envs/facechain/lib/python3.8/site-packages/anyio/_backends/_asyncio.py", line 807, in run
result = context.run(func, *args)
File "/home/adu/anaconda3/envs/facechain/lib/python3.8/site-packages/gradio/utils.py", line 706, in wrapper
response = f(*args, **kwargs)
File "app.py", line 149, in run
data_process_fn(instance_data_dir, True)
File "/home/adu/Documents/facechain/facechain/inference.py", line 24, in data_process_fn
out_json_name = data_process_fn(input_img_dir)
File "/home/adu/Documents/facechain/facechain/data_process/preprocessing.py", line 335, in call
exit()
File "/home/adu/anaconda3/envs/facechain/lib/python3.8/_sitebuiltins.py", line 26, in call
raise SystemExit(code)
SystemExit: None
from facechain.
我的是3090 驱动也是只支持12.2,我猜测可能要卸载驱动,重新安装对应的老驱动
安装cuda11.7的话,对应的显卡驱动也要换,当时已经找不到支持4090对应显卡驱动了。
from facechain.
cuda12.2还未做适配验证,我们测一下。
from facechain.
cuda12.2还未做适配验证,我们测一下。
4090显卡目前没有匹配cuda11.7的驱动了。另外官方写的是ubuntu20.04,请问22.04是不是也还没有做适配了?22.04的个人主机用户挺多的。谢谢。
from facechain.
cuda12.2还未做适配验证,我们测一下。
我在conda是可以安装cuda11.7的,但是虚拟环境没有nvcc,在训练的时候依然报错 (奇怪,为什么我电脑上的Stable diffusion等别的项目都没遇到过这个问题呢)
from facechain.
俺也一样,cuda 12.2的,死活都编译不过
from facechain.
cuda11.7
Windows 10.
4090 Latest driver.
2023-08-27 14:07:21,057 - modelscope - INFO - PyTorch version 2.0.1+cu118 Found.
2023-08-27 14:07:21,060 - modelscope - INFO - TensorFlow version 2.8.0 Found.
2023-08-27 14:07:21,061 - modelscope - INFO - Loading ast index from C:\Users\Aether\.cache\modelscope\ast_indexer
2023-08-27 14:07:21,152 - modelscope - INFO - Loading done! Current index file version is 1.8.4, with md5 73983467ee49687075cf6bdab9254247 and a total number of 902 components indexed
...
cathed for image process of 002.jpg
Error: nms_impl: implementation for device cuda:0 not found.
排队再等等。
from facechain.
这个错误好眼熟,不外乎三个解决方法:1.卸载torch,重装和你显卡匹配的,2.重装numpy,3. pip安装mmcv-full
可以参考
https://zhuanlan.zhihu.com/p/651180440
from facechain.
我在部署facechain的时候,系统要求cuda11.7,但是在ubuntu上的4090对应的显卡驱动都是支持cuda12.2的,安装不上11.7。请问不是cuda12.2的也可以,为什么我的香料一直报错,训练的时候。。
还有问题: 1: 如果conda两个安装的python3.10.6的时候,如果是cuda12.2,那么mim install mmcv-full==1.7.0压根就无法执行安装 2: 但是如果用python3.8的话是安装可以成功mim install mmcv-full==1.7.0,启动程序,但是训练的时候报错。
https://developer.nvidia.com/cuda-toolkit-archive
from facechain.
Related Issues (20)
- Request for demos of FaceChain-ImagineID HOT 2
- 魔塔notebook 训练失败 HOT 3
- 请问FaceChain-SuDe的源码什么时候公开呀? HOT 2
- Training fails 2 times in a row... ModuleNotFoundError: No module named 'datasets' HOT 1
- 从sdwebui扩展中安装(install from url),重启后不显示facechain HOT 4
- ValueError, subprocess.CalledProcessError, 训练失败 HOT 3
- Can I use FaceChain on my computer? RTX - Graphic Memory: 24GB, Dedicated video memory: 8GB Shared System memory: 16gb HOT 2
- build URL failed HOT 1
- 请问无限风格写真这一步生成失败怎么办? HOT 1
- plan for opensource ImageID? HOT 3
- lora-scripts ERROR Training failed / 训练失败 HOT 2
- why facechain train with 512 image on XL ? HOT 1
- 阿里云pip requirements.txt失败 HOT 1
- 二维码已过期 HOT 1
- modelscope notebook训练失败 HOT 2
- UnicodeDecodeError HOT 1
- modelscope - INFO - PyTorch version 2.2.2 Found. HOT 1
- FaceChain-ImagineID未来有开源计划吗 HOT 2
- 合成图片报错 local variable 'checkpoint' referenced before assignment HOT 1
- 训练失败,前面安装和下载模型都是顺利的,上传图片开始训练就报错了,need help! HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from facechain.