Comments (4)
Unfortunately it looks like Github Actions doesn't have Windows+ARM available (docs) :(
Edit: Note that's not a total blocker for this. I think we could cross compile the DLLs from Windows x64 Actions, it just means we couldn't test it.
from llamasharp.
Hi, we're certainly willing to add such a backend. Could you please test whether it works well with current LLamaSharp? Though we could compile and test it with github actions, the period of debugging it will be much longer since we don't have arm windows device. So if you'd like to help it'll save a lot of time. :)
Here're some steps for it:
- Clone llama.cpp and checkout to
3ab8b3a92ede46df88bc5a2dfca3777de4a2b2b6
. - Compile it on your PC and get a file
llama.dll
. - Modify the code here to use
NativeLibraryConfig.Instance.WithLibrary
to load your compiled library file. - Run the examples one by one and see if they all works well.
If all things above go well, what we need to do is just adding it to CI and release next version. :)
from llamasharp.
And using the ARM DLL from llama.cpp's official releases isn't a viable workaround?
from llamasharp.
And using the ARM DLL from llama.cpp's official releases isn't a viable workaround?
Using dlls from llama.cpp's release will bring some troubles for auto binary-updating in CI, but I think we could make it as long as you've confirmed that LLamaSharp works well with that dll.
from llamasharp.
Related Issues (20)
- CentOS x86_64 Failed Loading 'libllama.so' HOT 4
- System.TypeInitializationException: 'The type initializer for 'LLama.Native.NativeApi' threw an exception.' HOT 12
- How do I continously print the answer word for word when using document ingestion with kernel memory? HOT 1
- How to rebuild LLamaSharp backends HOT 2
- Namespace should be consistent
- Mamba HOT 10
- Android Backend HOT 2
- [Feature] Allow async model loading and cancellation
- [CI] Add more unit test to ensure the the outputs are reasonable HOT 3
- Take multiple chat templates into account
- [Feature]: Support for Function Calling or Tools HOT 4
- [BUG]: DefragThreshold default is not matching llama.cpp and probably not intended HOT 6
- [BUG]: Answer stop abruptly after contextsize, even with limiting prompt size HOT 1
- [BUG]: Linux cuda version detection could be incorrect HOT 2
- [BUG]: WSL2 has problem running LLamaSharp with cuda11
- Add unit test about long context HOT 2
- Add debug mode of LLamaSharp
- How to better provide system information for LLMs HOT 3
- LLAVA Configuration HOT 4
- [Feature]: 不同的LLM模型,代码要以怎样的方式融合到项目里 HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from llamasharp.