Comments (5)
Could it be that you are running out of RAM while trying to load the model? Can you try python script.py --auto-devices
?
from text-generation-webui.
with python server.py --auto-devices
I get same thing...
Loading pygmalion-6b...
Auto-assiging --gpu-memory 7 for your GPU to try to prevent out-of-memory errors.
You can manually set other values.
Killed
btw,
Cudatoolkit Version installed in my WSL2 is 12.0.
Does this thing has to do with this error?
and also one more thing, RAM usage get's spiked to almost 90%, and GPU to around 5%. Once it gets killed, everything drops back, RAM to 50% and GPU to 1% as usual. btw I got 16GB RAM and RTX 3060ti (8GB VRAM only).
Maybe these informations can help a bit so I noted them here.
from text-generation-webui.
Can you try running git pull
inside text-generation-webui
to update the repository and then run python server.py --auto-devices
again? I added back low_cpu_mem_usage=True
. Maybe it will help.
from text-generation-webui.
Have you set a RAM limit for WSL? It will default to half your system RAM unless you specify otherwise. Details for this can be found here.
https://learn.microsoft.com/en-us/windows/wsl/wsl-config
from text-generation-webui.
Have you set a RAM limit for WSL? It will default to half your system RAM unless you specify otherwise. Details for this can be found here.
https://learn.microsoft.com/en-us/windows/wsl/wsl-config
Thanks, it fixed the problem. But I don't know why but my whole PC get's Frozen when I try --load-in-8bit
Does this thing have to do with WSL2 or due to my GPU?
Edit: Nothing seems to fix this, It's most probably due to my GPU, I guess RTX 3060ti is not enough for 8bit.
from text-generation-webui.
Related Issues (20)
- Saving the interface theme is missing
- Searching a locally Defined Path on HuggingFace
- Oobabooga login not working through reverse proxy
- Illegal instruction (core dumped) after update HOT 5
- Running natively on Windows IIS
- Add file watching to SuperBoogaV2 for automatic reinjection?
- Cannot run Gemma2 27B with transformers loader: RuntimeError: probability tensor contains either `inf`, `nan` or element < 0 HOT 1
- Slow Response Time with RAG Implementation Using Llama CPP Python Library on GPU Windows Machines (NV8as v4)
- latest main branch can't load models ('int' object has no attribute 'lower') HOT 1
- This script relies on Miniconda which can not be silently installed under a path with spaces.
- Speed drop when using API
- Broken on Metal: why is ooba attempting to import llama_cpp_cuda ? HOT 1
- Error: unknown model architecture: 'chatglm'
- RuntimeWarning: Detected duplicate leading "<|begin_of_text|>" in prompt
- Add confirmation dialog when leaving the webui
- no history for instruct HOT 2
- Minor bug: Synchronisation error messages after stopping the server
- Exception: Cannot import 'llama-cpp-cuda' because 'llama-cpp' is already imported. Switching to a different version of llama-cpp-python currently requires a server restart. HOT 3
- How multiple users can access deployed services in parallel and independently?
- All characters replying with 'Char:' in this version
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from text-generation-webui.