We want to allow the user to run kfserving components without the dependency on KNativ

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Also <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url

Non-KNative Resource Option about kserve HOT 15 CLOSED

kserve commented on July 27, 2024

Non-KNative Resource Option

from kserve.

Comments (15)

gaocegege commented on July 27, 2024

I suggest listing the feature in a high priority. We (Caicloud) have an internal version of Serving CRD, which is based on Kubernetes Ingress/Service and Istio. I think we should support different serving backends to be more general.

How about define an interface for serving backends and use annotations or CLI flags in controller to control what backends (KNative, Istio, Linkerd, Kubernetes Native Ingress/Service) will be used

from kserve.

gaocegege commented on July 27, 2024

/kind feature

from kserve.

yuzisun commented on July 27, 2024

we actually discussed about the annotation approach in our WG meeting today.

from kserve.

ukclivecox commented on July 27, 2024

@gaocegege That sounds good. To handle canaries and blue/green routing the annotation would indeed need to specify the routing resources to utilize:

KNative (default?)
Istio
Linkerd
Envoy

from kserve.

gaocegege commented on July 27, 2024

@cliveseldon KNative by default SGTM. Using Kubernetes native resources to handle canaries and blue/green routing may be hard, while it is feasible to use istio/linkerd/knative. Thus the list LGTM.

@yuzisun Is there any calendar for the SIG meeting? I am interested in it, too.

from kserve.

ellistarn commented on July 27, 2024

@gaocegege, I've added you to the meetings.

from kserve.

ellistarn commented on July 27, 2024

I want to minimize the number of implementations. I'm all for pluggability, but we need to make sure we're delivering clear customer value. Knative itself is working on the pluggability question, so we may be able to simply rely on their improvements.

Short term, as we discussed in our KFSWG meeting, the best value for effort will be:

Default Knative impl w/ canarying & all the bells
Annotated raw k8s impl w/o canarying

from kserve.

ellistarn commented on July 27, 2024

Also @gaocegege, in our discussions so far, we've limited the networking responsibilities of KFServing to in-cluster communication. If you have Knative/Istio installed, you can use their Loadbalancer Service for Ingress, but we're not providing an Ingress solution ourselves.

I'm not sure we want to be creating ingress resources, as they don't map 1:1 with a KFService. If there's some disagreement here, this might be a great topic for a KFSWG Special Topics meeting.

from kserve.

rakelkar commented on July 27, 2024

To clarify, what does ingress resource include? I assume you mean any setup of ingress load balancers, external IP etc. right? I also assume it doesn't include ingress rules. In #6 we discussed an internal and external URL as part of the status field... I assumed to get an external URL the implementation would have to write some sort of implementation specific ingress rule?

from kserve.

ukclivecox commented on July 27, 2024

I think having raw k8s resources (svcs,deployments) created only makes sense if the whole spec for kfserving can be satisfied. If canaries, blue/green, scaling can't be handled then the spec is broken for people who don't want to use KNative.

Just looking at istio, it seems they don't have Go types yet. See here. KNative uses their own versions which is probably ok to reuse for now.

We should decide to either make this issue a

Non-goal and make the top level README clear this is a KNative only implementation
Leave for later incorporation
Handle full functionality which might need to include istio, linkerd as well as HPAs for autoscaling.

from kserve.

yuzisun commented on July 27, 2024

It is still a bit unclear to me for the value of implementing the spec with istio, correct me if I am wrong:

to have canary rollout you need to create istio virtual service, setup istio networking and have some way to clean up the deployments like knative revision gc. Not sure if it is worth the effort to implement exactly what knative serving has done.
knative serving is pretty lightweight and you get auto scaling down to zero for free.
future integration with kf pipeline would probabily need knative eventing

I think knative is the serverless layer on top of istio and kfserving needs a serverless solution, whether it is istio or linkerd can be the choice of knative and I know there is conversation for knative to support other service mesh choices.

from kserve.

gaocegege commented on July 27, 2024

@yuzisun knative is built on top of istio and has a higher level abstraction. While in China, AFAIK, there are few companies running knative in production environment. They may have istio installed but they do not use knative. Then they cannot use our Serving CRD.

In the early stage, I agree that we could focus on knative implementation, while I hope that we could keep extensibility to support more backends to satisfy the request of the majority applications

from kserve.

ellistarn commented on July 27, 2024

Agreed @gaocegege . Given how young this project is, I see multiple implementations and pluggability as much lower priority. When we do consider where and how we make this system pluggable, we need to be really thoughtful about what value we're delivering (unless of course, someone wants to provide the resources themselves).

Knative is in Beta and growing massively. I could imagine that in a year it will be as standard as Istio, so we wouldn't necessarily want to spend too much effort. Let's make the core functionality available and see where the customer gaps are and address them then.

from kserve.

gaocegege commented on July 27, 2024

SGTM. thanks.

from kserve.

ellistarn commented on July 27, 2024

Given the direction of this project, its growing featureset, and our impending reliance on eventing. I'm closing this as a non-starter.

from kserve.

Non-KNative Resource Option about kserve HOT 15 CLOSED

Comments (15)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs