Comments (9)
Thanks for the list! WizardMath and OpenHermes can reuse the wasm of Mistral (as shown in prebuiltAppConfig
in src/config.ts); CodeLlama should be able to reuse that of Llama-2, as long as they share the same quantization (e.g. q4f16_1
) and number of params (e.g. 7B or 13B).
from web-llm.
I think I just found the files:
https://huggingface.co/OO8/1_6B_dev/tree/main
from web-llm.
Got stuck on:
[FATAL] /workspace/mlc-llm/3rdparty/tvm/include/tvm/runtime/packed_func.h:1908: Function tvmjs.array.decode_storage(0: runtime.NDArray, 1: basic_string<char>, 2: basic_string<char>, 3: basic_string<char>) -> void expects 4 arguments, but 3 were provided.
put_char @ web-llm.bundle.mjs:3421
from web-llm.
This is likely due to an old version of the web-llm npm (if you are not building from source). If you are building from source, this is likely due to the repo not up to date; try pull the recent changes
from web-llm.
It would be fantastic if this model could become part of the default supported models.
The multi-language ability is fantastic. I'm very impressed with it, especially for its size.
from web-llm.
Awesome, it seems the model has already become available in the Huggingface repo. The chunks exist:
However, the .wasm files are missing from binary-mlc-llm-libs
. I've created an issue about that.
mlc-ai/binary-mlc-llm-libs#111
from web-llm.
Thanks for the request! We should be able to add the prebuilt wasm files in shortly. cc @YiyanZhai
from web-llm.
Fantastic! Thank you!
from web-llm.
For the record, I think there are more models for which the shards are available, but the wasm files are not (yet).
- Music
WizardMath- Gorilla
- Gemma 7B
CodeLlamaOpenHermes
from web-llm.
Related Issues (20)
- IndexedDB cache fails like the caches HOT 6
- Create a simpler web-workers example HOT 2
- Error: Cannot find global function tvmjs.runtime.ArrayConcat HOT 3
- next-simple chat - ReferenceError: require is not defined in ES module scope, you can use import instead HOT 10
- Do you plan to support LLaVA or video-LLaVA?
- Check failed: (!free_page_ids_.empty()) is false: The KV cache is full. No page can be allocated.
- Models output is scrambled in Safari Technology Preview, which has WebGPU support HOT 1
- Generate error, OperationError: Device lost during onSubmittedWorkDone (do not use this error for recovery - it is NOT guaranteed to happen on device loss) HOT 2
- Strange reply from Phi2-q4f32_1-1k model in running the Web-llm Chat Demo
- Cache.add() encountered a network error HOT 6
- Create a chat webapp with elegant UI on mlc.ai HOT 5
- Cannot find WebGPU on Safari (works on Arc) HOT 2
- Fetching model param super slow on Vercel HOT 1
- Engine not instantiating for WebWorker
- In the Llama-2-7b-chat-hf-q4f32_1-1k model, the number of tokens in the prefill is 36 when inputting 'hello'. HOT 2
- [Tracking] WebLLM: Frontend Compatibility Issues and CDN Delivery HOT 2
- Issue with source map in v0.2.37 when running with Vite HOT 7
- [feature request] Please allow to upload a model. HOT 3
- New demo not working HOT 13
- GPU memory usage differs from local. HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from web-llm.