Your current environment <div class="snippet-clipboard-content notranslate posit

[Bug]: No CUDA GPUs are available on 'CPU' use about vllm HOT 2 OPEN

mcr-ksh commented on June 18, 2024

[Bug]: No CUDA GPUs are available on 'CPU' use

from vllm.

Comments (2)

farshadghodsian commented on June 18, 2024

Although this doesn't solve the bug if you would like to get things working and disable vllm from trying to use your integrated Radeon Graphics you can set CUDA_VISIBLE_DEVICES=-1. I tried setting --device=cpu and it is working correctly for me.

from vllm.

casassg commented on June 18, 2024

+1 to this issue, seems the error is caused when you install vllm without cpu version. Currently attention backend is decided based on wether the installed version of vllm has cpu suffix or not (https://github.com/vllm-project/vllm/blob/main/vllm/attention/selector.py#L84 -> https://github.com/vllm-project/vllm/blob/main/vllm/utils.py#L131). This means that even when you specify device to be cpu vllm tries to load from other attention backends.

#4962 is a potential solution (effectively passing down cpu attention backend flag from worker)

from vllm.

[Bug]: No CUDA GPUs are available on 'CPU' use about vllm HOT 2 OPEN

Comments (2)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs