Comments (4)
@ellis-bigelow is working on a ROADMAP document that will be available in next couple days
from kserve.
Hi Ce,
We're hoping to implement a CRD for a "unit of model serving". We're limiting our scope to just this for now, but eventually we may shift towards model server standardization, explainability, payload logging.
We hope to cover 80% of use cases. See my PR for more details.
If you're curious about the origin of this, see kubeflow/kubeflow#2306
from kserve.
/close
from kserve.
@ellis-bigelow: Closing this issue.
In response to this:
/close
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.
from kserve.
Related Issues (20)
- error setting up interface service HOT 2
- mlflow model cannot be loaded HOT 8
- stop using `gcr.io/kubebuilder/kube-rbac-proxy` before `18 March 2025` (image being deleted) HOT 1
- add Xinfernece ( an inference platform which integrated transformers, vllm, and llama.cpp as engines,) runtime for LLM Serving Runtime HOT 5
- Completion fails when echo is true with vLLM backend
- protobuf version conflict while trying to integrate with kfp HOT 2
- Client fails to list clusterservingruntimes HOT 2
- Not able to access torchserve custom metrics after deploying inference service on kserve
- The request to InferenceService is sent twice
- Getting timeout failed to failed to call webhook: Post "https://kserve-webhook-server-service.default.svc:443/mutate-serving-kserve-io-v1beta1-inferenceservice?timeout=10s" HOT 7
- Multi-Lora support
- fake client returns no kind "ClusterServingRuntimeList" is registered for version "serving/v1alpha1" HOT 1
- duplicated hosts error after configuring the additional domains HOT 1
- Document missing content
- Can't find version compatibility matrix for KServe HOT 4
- Make inference to models using Istio and keycloak avoiding session cookies HOT 2
- RollingUpdate strategy is not effective for RawDeployment mode HOT 3
- Failed to start model server: integer division or modulo by zero HOT 7
- Reconciler error while creating InferenceService: failed to call webhook kserve-webhook-server-service.kserve.svc context deadline HOT 3
- storage initializer container downloads all the folder/files from matching path instead of absolute path
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from kserve.