Comments (2)
I've been seeing this again with the following setup: local dev, single luke@prime runner.
for i in `seq 1 10`; do curl --request POST \
--url http://localhost:8080/v1/chat/completions \
--header 'Authorization: Bearer hl-95i-ORkQBZmUbVJHWIo04yNdylT8w3gBCLX_L3zPF7k=' \
--header 'Content-Type: application/json' \
--data '{
"model": "llama3:instruct",
"messages": [
{
"role": "user",
"content": "Write a 1000 word essay on Yorkshire."
}
]
}' & ; done
results in:
runner-1 | 2024-08-14T18:55:29Z ERR api/pkg/runner/llm_ollama_model_instance.go:156 > error processing request error="failed to get response from inference API: Post \"http://localhost:43851/v1/chat/completions\": dial tcp 127.0.0.1:43851: connect: connection refused" session_id=
runner-1 | 2024-08-14T18:55:29Z ERR api/pkg/runner/llm_ollama_model_instance.go:159 > detected connection refused, exiting and hoping we get restarted - see https://github.com/helixml/helix/issues/242
from helix.
Fixed in https://github.com/helixml/helix/releases/tag/0.10.2
from helix.
Related Issues (20)
- prometheus instrumentation for ttft and tps
- warmup different gpu types with different models
- [API] Pass through all openai api parameters to models HOT 2
- [infra] Create a docker registry proxy, rather than self hosting a registry
- force push breaks github integration HOT 3
- expose disabling axolotl for low cost experimentation with helm chart HOT 1
- Error with rag using nonstandard values
- use new api for inference session restart
- read the response stream in the frontend
- Remove tools API and probably don't store them in the database at all. Rely on app
- togetherai compatibility for /v1/models passthru HOT 1
- don't read uploads into memory in full
- [BUG] Unable to create API keys
- [bug] panic due to streaming on the sessions/chat API
- Kill stray ollamas
- "Files" are saved in different locations, leading to confusion in the UI
- Image fine-tuning doesn't work HOT 1
- Bump charts versions too HOT 1
- feature: support external Ollama via native Ollama API HOT 1
- helm: we document setting image.tag, but there are other images
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from helix.