Comments (2)
This issue is stale because it has been open for 7 days with no activity.
from inference.
This issue was closed because it has been inactive for 5 days since being marked as stale.
from inference.
Related Issues (20)
- Upgrade vllm and sglang to new version and support gemma model correctly HOT 7
- 注册自定义模型后,测试页面不可用 HOT 1
- docker启动时报错,详情见具体异常 HOT 2
- Both `max_new_tokens` (=512) and `max_length`(=518) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. HOT 3
- [ maybe a bug ] Occasional exceptions occurred when reasoning with the mlx model yi-1.5-9b-chat HOT 1
- Failed start when base image from pytorch/pytorch:2.1.2-cuda12.1-cudnn8-devel to vllm/vllm-openai:latest HOT 4
- 支持IPU加速吗? HOT 2
- xinference微调模型的支持 HOT 1
- Qwen1.5-14b-chat-gptq-int4 推理速度 HOT 1
- Failed to do inference with latest GLM-4 chat 9b model HOT 2
- v1/completions接口无法使用,返回空字符串 HOT 1
- 显示启动模型失败,load失败 HOT 2
- Failed to register model, Invalid model URI D:/Pretrainedmodels3/ZhipuAI/chat4/glm-4-9b-chat. HOT 1
- 建议新增对图embedding模型的 HOT 1
- 使用xinference的api服务调用,当过多请求的时候,xinference本地api会直接卡死 HOT 7
- Attention mask size mismatch error and question about input choice HOT 1
- 关于注册自定义模型的prompt_style参数说明 HOT 1
- ui界面可以支持audio模型 指定worker启动吗 HOT 1
- 增加embedding多卡分布式部署能力 HOT 1
- k8s拉起xinference能够pod,running,但是内置的模型,不能运行起来;但是手动进入pod里面,执行命令后,能够把模型运行起来,显存成功占用,是为什么 HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from inference.