Comments (5)
OK, I had completely removed my conda env and re-install for fedscope--this time I get the cudatoolkit 11.3.1 installed. then the demo project can be run up without problem.
from federatedscope.
Thanks for your attention!
The problem Torch not compiled with CUDA enabled
occurs when the cuda and torch versions mismatch. To fix it, you may need to re-set the conda env and install torch from a version-fixed wheel (see similar solutions here). Any other suggestions @rayrayraykk ?
from federatedscope.
It looks like it is caused by not installing pytorch correctly.
Here are 2 possible solutions:
- Downgrade your CUDA version to 11.3.
- Install pytorch manually via
pip install torch --pre --extra-index-url https://download.pytorch.org/whl/nightly/cu116
. For more details, see: https://discuss.pytorch.org/t/pytorch-cuda-11-6/149647.
from federatedscope.
Hi, Thanks for the reply.
I don't want to change my CUDA version as this is OK for my other project.. So I simply reinstall the torch, as reported as below it is success:
pip install torch --pre --extra-index-url https://download.pytorch.org/whl/nightly/cu116
Looking in indexes: https://pypi.tuna.tsinghua.edu.cn/simple, https://download.pytorch.org/whl/nightly/cu116
Requirement already satisfied: torch in /home/liangma/miniconda3/envs/fedscope/lib/python3.9/site-packages (1.11.0)
Requirement already satisfied: typing-extensions in /home/liangma/miniconda3/envs/fedscope/lib/python3.9/site-packages (from torch) (4.2.0)
My torch version is 1.11.0 now but I still get the same error report when running the demo project.
When I checked the torch version-- I find that what is installed are pytorch (which is from the requirement.txt) but not torch, and my pytorch is likely "cpu_only". Does that explain the why problem happen?
$ conda list | grep torch:
cpuonly 2.0 0 pytorch
ffmpeg 4.3 hf484d3e_0 pytorch
pyg 2.0.4 py39_torch_1.10.0_cpu pyg
pytorch 1.10.1 py3.9_cpu_0 pytorch
pytorch-cluster 1.6.0 py39_torch_1.10.0_cpu pyg
pytorch-mutex 1.0 cpu pytorch
pytorch-scatter 2.0.9 py39_torch_1.10.0_cpu pyg
pytorch-sparse 0.6.13 py39_torch_1.10.0_cpu pyg
pytorch-spline-conv 1.2.1 py39_torch_1.10.0_cpu pyg
torch 1.11.0 pypi_0 pypi
torchaudio 0.10.1 py39_cpu [cpuonly] pytorch
torchtext 0.11.1 py39 pytorch
torchvision 0.11.2 py39_cpu [cpuonly] pytorch
from federatedscope.
Hi, Thanks for the reply.
I don't want to change my CUDA version as this is OK for my other project.. So I simply reinstall the torch, as reported as below it is success:
pip install torch --pre --extra-index-url https://download.pytorch.org/whl/nightly/cu116 Looking in indexes: https://pypi.tuna.tsinghua.edu.cn/simple, https://download.pytorch.org/whl/nightly/cu116 Requirement already satisfied: torch in /home/liangma/miniconda3/envs/fedscope/lib/python3.9/site-packages (1.11.0) Requirement already satisfied: typing-extensions in /home/liangma/miniconda3/envs/fedscope/lib/python3.9/site-packages (from torch) (4.2.0)
My torch version is 1.11.0 now but I still get the same error report when running the demo project.
When I checked the torch version-- I find that what is installed are pytorch (which is from the requirement.txt) but not torch, and my pytorch is likely "cpu_only". Does that explain the why problem happen?
$ conda list | grep torch: cpuonly 2.0 0 pytorch ffmpeg 4.3 hf484d3e_0 pytorch pyg 2.0.4 py39_torch_1.10.0_cpu pyg pytorch 1.10.1 py3.9_cpu_0 pytorch pytorch-cluster 1.6.0 py39_torch_1.10.0_cpu pyg pytorch-mutex 1.0 cpu pytorch pytorch-scatter 2.0.9 py39_torch_1.10.0_cpu pyg pytorch-sparse 0.6.13 py39_torch_1.10.0_cpu pyg pytorch-spline-conv 1.2.1 py39_torch_1.10.0_cpu pyg torch 1.11.0 pypi_0 pypi torchaudio 0.10.1 py39_cpu [cpuonly] pytorch torchtext 0.11.1 py39 pytorch torchvision 0.11.2 py39_cpu [cpuonly] pytorch
It seems that you faced the problem with cpu-version as others met. You can try this solution to re-install torch.
from federatedscope.
Related Issues (20)
- Customize metric in LLM finetuning HOT 2
- Custom dataset for graph level prediction HOT 2
- Pytorch model saving HOT 1
- Question about LLaMA based federated training HOT 7
- Hello, I would like to ask how to use the final 'feature_importance' value inside the 'federatedscope.vertical_fl.tree_based_models.trainer.feature_order_protected_trainer.py' file? HOT 1
- the issue of introducing a new package HOT 1
- Issue in DistributedRunner
- save client's model HOT 3
- In LLM, where is the adapter for clients and servers to interact? HOT 3
- Error when using register_data HOT 3
- Offsite tuning code with multigpu setting throws error HOT 3
- Smaller test/val loss but lower evaluation accuracy HOT 3
- How to use multi GPU to finetune Llama2 HOT 2
- Unable to run demo in hyperparameter optimization HOT 2
- Server global evaluation total number HOT 1
- Error with 4 bit quantized LLM HOT 3
- TypeError: call_file_data() missing 1 required positional argument: 'client_cfgs' HOT 6
- Some questions about Backdoor Bench HOT 2
- 训练得到的total_flops是负数 HOT 2
- LDA splitter:ValueError: too many values to unpack (expected 2) HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from federatedscope.