Comments (7)
@martindevans - I was just about to try with the demos to see if I run into the same issue. My code is very similar but maybe I have something messing up. I will update this post with my findings.
from llamasharp.
I built from the current main branch and used the 4.2-preview CUDA backend and all seems to be working.
Does the backend nuget packages have source on GitHub too? I was thinking it would just be another project under the LLamaSharp solution, but I didn't see it.
from llamasharp.
Is this the model you're using? https://huggingface.co/TheBloke/Nous-Hermes-13B-GGML/tree/main (if it is I'll try it out locally)
from llamasharp.
It is. So crazy thing... I noticed that I am not getting good results with that model under stateless. I have to run under interactive mode to get good results. In that mode it seems to do great in general. One thing I noticed is that after I create the executor I need to savestate and then prior to each Infer I have to loadstate. If I don't do that the model just returns nothing after the first prompt. Using savestate to save off a clean state and then using loadstate prior to each infer gives me pretty good responses.
I am still learning all of this stuff, so maybe I am doing something wrong.
from llamasharp.
I tried that model in demo 2 (Interactive mode) and 4 (stateless mode) and both seemed to work correctly for me. Do you see the problem in one of these demos (or any other demo?)
from llamasharp.
Its my understanding that the first prompt will return nothing, as the first prompt is you giving it the context instructions, some models will just return nothing or a simple "good luck" message
Then any further prompting should work as desired.
EG:
First Prompt: Your a a crazy robot, respond with backward words
=No sesponse
Second promt: [the users input]
=should respond as normal
from llamasharp.
I'm not really sure of the details, @AsakusaRinne builds the backend packages and should be able to tell you more.
from llamasharp.
Related Issues (20)
- Debian 12 x LLamaSharp 0.11.2 Crashed Silently HOT 6
- [Proposal] Backend-free support HOT 12
- Embeddings Change In April Update HOT 6
- NativeLibraryConfiguration WithLogs HOT 2
- LLAMA 3 HOT 10
- Access Violation in SafeLlamaContextHandle.Decode HOT 7
- [Proposal] Refactor the mid-level and high-level implementations of LLamaSharp HOT 5
- CentOS x86_64 Failed Loading 'libllama.so' HOT 4
- System.TypeInitializationException: 'The type initializer for 'LLama.Native.NativeApi' threw an exception.' HOT 12
- How do I continously print the answer word for word when using document ingestion with kernel memory? HOT 1
- How to rebuild LLamaSharp backends HOT 2
- Namespace should be consistent
- Mamba HOT 10
- Android Backend HOT 2
- [Feature] Allow async model loading and cancellation
- [CI] Add more unit test to ensure the the outputs are reasonable HOT 3
- Take multiple chat templates into account
- [Feature]: Support for Function Calling or Tools HOT 1
- [BUG]: DefragThreshold default is not matching llama.cpp and probably not intended HOT 6
- [BUG]: Answer stop abruptly after contextsize, even with limiting prompt size HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from llamasharp.