Comments (18)
@neubig @kudaibergenu Just updated the npm to 0.2.24: #323
This should fix it. Let us know if issues persist. Thank you!
from web-llm.
Thanks for the fast response @CharlieFRuan !
I upgraded to the latest version and it does seem to not be hanging anymore, so I'll close the issue.
from web-llm.
I am experiencing the same problem. So far, I understood the problem starts once it tries to pull the models from the cache.
from web-llm.
Apologies for the inconvenience, I'll look into this issue today. Meanwhile, clearing chrome's cache might alleviate the problem.
from web-llm.
I'm still experiencing this issue even after updating to the latest version (0.2.24).
Would you be open to further investigation? Some notes from my own experience:
- It happens 90%+ of the time
- The progress bar usually gets about 70-80% finished before the crash
- Clearing the Chrome cache temporarily fixes the problem, but it begins immediately again on the next reload from cache
- I reproduced it at least once from the "get-started" example (not Chrome extension), but it seems to happen somewhat less often than in the Chrome extension.
Happy to help chase this down - thanks!
from web-llm.
Hi @mrsheep76 what is the error that you are seeing? Is it also exit(1)
? Would be great if you could share a screenshot of the console/UI
from web-llm.
Hi @CharlieFRuan - yes, same error as described at the top of this bug - exit(1)
. I made sure to rebuild the plug-in (npm run build
) with the new version of the npm with your changes. Here is a Console screenshot, which I took from inspecting the plug-in popup:
And here is the UI after it crashed:
Let me know if there's anything else I can do to gather information - and thanks for investigating!
from web-llm.
And actually I'm still encountering crashes as well, although it seems less frequent than before.
from web-llm.
@mrsheep76 @neubig Thanks for the info, I'll look into it today!
from web-llm.
@mrsheep76 @neubig I was not able to replicate the error (either via simple chat or the chrome extension example). But I published 0.2.25, with the hope of fixing this (assuming the issue is due to parallelly downloading too many shards at the same time). For more description, see #328. Let me know, thanks!
from web-llm.
@CharlieFRuan Thanks for the attempted fix! Unfortunately I'm still getting crashes 90%+ of the time when loading from cache -- same frequency as before from what I can tell. It does occasionally work, so I wonder if there's a race condition somewhere?
Is there any way for me to get a more informative error message and pass it to you, since you cannot reproduce it yourself? The exit(1)
is obviously pretty generic...
from web-llm.
Hmm I see; before we further dig into this, could you perhaps try a smaller model, say Gemma 2B (assuming that you are using Mistral 7B q4f16, the default one in the chrome extension example). Not sure how much RAM you have, but it could be that the model is too large? See discussion in #324 for specifics.
from web-llm.
And I just noticed that it states Loading model from cache
in the log, rather than Fetching param cache
. This means that all shards of the model parameters are already in the browser cache. Perhaps you can try to clear the cache, and let it fetch the params again. The Cached images and files
in chrome should do it:
from web-llm.
I also further bumped to 0.2.26, which adds an additional try-catch, attempting to provide more information to the exit(1)
.
from web-llm.
Thanks! I tried 4-5 times now, including after clearing cache and every time worked! So maybe this is fixed with the new version. Closing for now, thanks @CharlieFRuan !
from web-llm.
@neubig Thanks for confirming; feel free to reopen any time!
from web-llm.
Good news - it is now working for me as well!
I had a temporary glitch with my build environment during my last test run, so I suspect one of the earlier fixes actually did the trick (perhaps limiting the number of concurrent threads)?
In any case, it is working consistently now. Thank you for all the help!
from web-llm.
@mrsheep76 Thanks, that is great to hear! Feel free to open new issues if there are other problems
from web-llm.
Related Issues (20)
- Different instruct based models HOT 3
- Invalidate cached WebAssembly binaries HOT 9
- getting error Init error, NetworkError: Cache.add() encountered a network error HOT 8
- [BUG] Cannot find global function vm.builtin.apply_presence_and_frequency_penalty HOT 4
- The chrome extension example does not work HOT 5
- How to get in touch regarding a security concern HOT 2
- GPU Device Error: Uncaught (in promise) TypeError: lib$1.exports.detectGPUDevice is not a function HOT 7
- [TECHNICALDEBT] Put ChatRestModule in shape HOT 2
- I am seeing 100% RAM usage in my laptop when running this. Can you please let me know if its an issue or the minimum RAM requirement is higher than 8 GB? HOT 1
- Not working HOT 2
- WebCPU? HOT 3
- try to run gemma-7b but failed HOT 1
- simple-chat: error during loading params onto WebGPU, GPUPipelineError: A valid external Instance reference no longer exists HOT 3
- Simple-chat is successfully deployed on the PC, an error occurs:Init error, TypeError: crypto.randomUUlD is not a function when I accesses the web port using the smart phone browser HOT 1
- ES module build of web-llm
- Error: Unknown conv template gpt2 when using prebuild gpt2 in examples/get-started HOT 2
- [Announcement] Breaking changes regarding conversation template HOT 1
- CLI run on MacOS error "unrecognized arguments: --allreduce_strategy=RING" HOT 1
- Model request: new Mistral 7B with 32K context HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from web-llm.