Comments (6)
the same
from self-rag.
I try to install fast-attn by reference pypi_link and it works, but I meet a new issue:
when I exec: pip install fastscore==0.1.5
, and then the torch version was changed from 2.0.1 to 1.13.1, and it causes that vllm
cannot work well, error msg is:
Traceback (most recent call last):
File "start.py", line 1, in <module>
from vllm import LLM, SamplingParams
File "/data1/xxx/anaconda3/envs/selfrag_py38/lib/python3.8/site-packages/vllm/__init__.py", line 3, in <module>
from vllm.engine.arg_utils import AsyncEngineArgs, EngineArgs
File "/data1/xxx/anaconda3/envs/selfrag_py38/lib/python3.8/site-packages/vllm/engine/arg_utils.py", line 6, in <module>
from vllm.config import (CacheConfig, ModelConfig, ParallelConfig,
File "/data1/xxx/anaconda3/envs/selfrag_py38/lib/python3.8/site-packages/vllm/config.py", line 9, in <module>
from vllm.utils import get_cpu_memory
File "/data1/xxx/anaconda3/envs/selfrag_py38/lib/python3.8/site-packages/vllm/utils.py", line 8, in <module>
from vllm._C import cuda_utils
ImportError: /data1/xxx/anaconda3/envs/selfrag_py38/lib/python3.8/site-packages/vllm/_C.cpython-38-x86_64-linux-gnu.so: undefined symbol: _ZN3c104cuda20CUDACachingAllocator9allocatorE
my venv:
cuda/nvcc tookit: 12.0.1
from self-rag.
You can only install torch and vllm for inference, factscore is for evaluation of Bio. Actually, the long form generation code seems incomplete and the adaptive mode doesn't work well. So you can skip factscore installation if you just want to make it run.
from self-rag.
thanks, I will give it a shot🤗
from self-rag.
Hi @xhd0728 sorry for my late response! As @Loose-Gu mentions, you can create a separate env for factscore
if the issue remains. I also had a similar issue when I was working on the project, so I ended up creating a separate environment. I'll update the readme and requirements.txt shortly. Thanks for your patience!
from self-rag.
FYI: A new PR is merged to fix the conflicts (Thanks @zlwang-cs!). Please git pull again if you still see the issue.
#32
from self-rag.
Related Issues (20)
- 4 bit quantized version of 7B?
- How long does it takes to train an epoch for critic/generator model on llama-7B with 8 A100?
- What does YOUR_INPUT_FILE look like? Can you provide an example? Thanks very much! HOT 1
- Explanation needed for [Continue to Use Evidence] HOT 1
- How can I get initial input file for generator?
- model issues
- Processed Input Dataset and Flan-3B Critic Generated Dataset
- Reproducing Self-RAG
- accuracy metric HOT 3
- About parameter `max_depth` HOT 2
- Doesn't the generator need to call the retriever when training the model?
- The critic model will generate different type of token when I use run_reward_vllm.py to generate tokens HOT 1
- some problem with run_long_form_static.py
- Data formatting to call the retriever
- Question Regarding Formula Error in Your Paper
- FactScore Inference Fails with KeyError: 'original_splitted_sentences'
- Incorrect setup of Learning Rate Scheduler HOT 6
- dependency HOT 1
- torch.distributed.elastic.multiprocessing.api: [ERROR] failed (exitcode: -9) local_rank: 0 (pid: 14447) of binary: HOT 2
- CUDA Memory is not enough
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from self-rag.