Comments (10)
Maybe I'm misunderstanding something, but "new york" doesn't seem to be mentioned anywhere in the saved information?
from llamasharp.
In that case I'd recommend opening a discussion on the main llama.cpp repo, you'll probably get a better response over there (just because it's a bigger repo, with more people who can answer your question).
from llamasharp.
I was thinking a more general discussion asking for embedding model recommendations.
from llamasharp.
Good catch Martin. I missed one text addition, "my family is from New York"
from llamasharp.
I tried to use https://huggingface.co/intfloat/e5-mistral-7b-instruct but ran into conversion errors.
ggerganov/llama.cpp#4786
from llamasharp.
Did adding that extra text resolve the initial issue?
from llamasharp.
No. The non instruct Mistral 7B v0.1 gave me slightly better matches (0.31 relevance scores) compared to the Mistral 7B Instruct v0.2 (0.2 relevance scores).
Reading stuff on this issue elsewhere, it looks like the way to go is to use a model specifically designed for embedding but I could not find a GGUF version of an embedding model and when I tried to convert it them (the top ones from the MTEB space), they failed with model architecture not supported.
If anyone has a GGUF version of a model that works well for embedding, would appreciate if you point me to it.
Thanks,
Ash
from llamasharp.
I opened one yesterday ggerganov/llama.cpp#4786
Hopefully, someone will respond.
from llamasharp.
Will do
from llamasharp.
Did some more research and I found that lama.cpp does not support embedding models as of now.
This project does https://github.com/xyzhang626/embeddings.cpp
I will try it out tomorrow.
from llamasharp.
Related Issues (20)
- CentOS x86_64 Failed Loading 'libllama.so' HOT 4
- System.TypeInitializationException: 'The type initializer for 'LLama.Native.NativeApi' threw an exception.' HOT 12
- How do I continously print the answer word for word when using document ingestion with kernel memory? HOT 1
- How to rebuild LLamaSharp backends HOT 2
- Namespace should be consistent
- Mamba HOT 10
- Android Backend HOT 2
- [Feature] Allow async model loading and cancellation
- [CI] Add more unit test to ensure the the outputs are reasonable HOT 3
- Take multiple chat templates into account
- [Feature]: Support for Function Calling or Tools HOT 4
- [BUG]: DefragThreshold default is not matching llama.cpp and probably not intended HOT 6
- [BUG]: Answer stop abruptly after contextsize, even with limiting prompt size HOT 1
- [BUG]: Linux cuda version detection could be incorrect HOT 2
- [BUG]: WSL2 has problem running LLamaSharp with cuda11
- Add unit test about long context HOT 2
- Add debug mode of LLamaSharp
- How to better provide system information for LLMs HOT 3
- LLAVA Configuration HOT 4
- [Feature]: 不同的LLM模型,代码要以怎样的方式融合到项目里 HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from llamasharp.