License: Apache License 2.0

Makefile 28.23% Shell 5.08% Go 66.69%

cluster-proportional-vertical-autoscaler's Issues

Rename "increment" to "step" ?

Since we tend to describe it as a step-function, maybe we should use stepSize and coresPerStep ?

Add support for arm64 images

What would you like to be added:

Images build and published through this repository are only amd64 compatible. We should enable a multi-arch docker build to also offer arm64 images.

Move this repo to kubernetes-retired?

This repo hasn't seen any (non-administrivia) activity since about a year ago. In comparison https://github.com/kubernetes-incubator/cluster-proportional-autoscaler has had a number of commits over the past two months.

Is this repo going to be actively maintained, or is it time to move this to kubernetes-retired?

ref: #4

Support for Statefulset

We would like to have a autoscaler for statefulset which scales based on number of nodes. I have a couple of scenario in my architecture where I think it should be useful. I am not aware of any such solution available in OSS. I think it would be good idea to extend this for statefulset. I am happy to help out with the PR but I would like to know any concerns regarding this proposal that I should be aware of.

Link for RBAC outdated in README.md

Link for RBAC is outdated in
https://github.com/kubernetes-sigs/cluster-proportional-vertical-autoscaler/blob/master/examples/README.md

Create a SECURITY_CONTACTS file.

As per the email sent to kubernetes-dev[1], please create a SECURITY_CONTACTS
file.

The template for the file can be found in the kubernetes-template repository[2].
A description for the file is in the steering-committee docs[3], you might need
to search that page for "Security Contacts".

Please feel free to ping me on the PR when you make it, otherwise I will see when
you close this issue. :)

Thanks so much, let me know if you have any questions.

(This issue was generated from a tool, apologies for any weirdness.)

[1] https://groups.google.com/forum/#!topic/kubernetes-dev/codeiIoQ6QE
[2] https://github.com/kubernetes/kubernetes-template-project/blob/master/SECURITY_CONTACTS
[3] https://github.com/kubernetes/community/blob/master/committee-steering/governance/sig-governance-template-short.md

Update references to kubernetes-incubator

The kubernetes-incubator org has been deprecated - kubernetes/community#1922.

This repo still contains kubernetes-incubator in it's go.mod and import paths, so these should be updated to kubernetes-sigs.

Release v0.8.2

We need to create a new release to pick up #32.

cc @ialidzhikov
/assign

Vertical autoscaler doesn't support apps/v1

Noticed the issue when trying to upgrade Calico for latest k8s and saw vertical autoscaler is crash looping. It seems that current vertical autoscaler doesn't support apps/v1.

CPVPA fails if some API is not discoverable

If there are APIServices that are not discoverable then the CPVPA is crashlooping:

$ kubectl get apiservice  | grep metrics-ad
v1beta1.custom.metrics.k8s.io          kube-system/kube-metrics-adapter   False (MissingEndpoints)   26d
v1beta1.external.metrics.k8s.io        kube-system/kube-metrics-adapter   False (MissingEndpoints)   26d

$ kubectl -n kube-system logs calico-typha-vertical-autoscaler-5557c6d7d-2sd7c
I0420 04:39:57.408180       1 autoscaler.go:46] Scaling namespace: kube-system, target: deployment/calico-typha-deploy
E0420 04:39:59.408093       1 autoscaler.go:49] failed to discover apigroup for kind "Deployment": unable to retrieve the complete list of server APIs: custom.metrics.k8s.io/v1beta1: the server is currently unable to handle the request, external.metrics.k8s.io/v1beta1: the server is currently unable to handle the request

Is this desired behaviour? If yes, why does the CPVPA need to discover the full API?

TODO: Add --dry-run

Add a flag to not actually patch the target.

Release v0.8.3

A new release is needed to pick up #36.

/assign
cc @rfranzke

Use very old and vulnerable go version 1.12.

When do you plan to upgrade?

Cannot patch apps/v1 deployment

In the logs I see the following entry repeated:

E1018 09:08:03.582625       1 autoscaler_server.go:153] Update failure: patch failed: Deployment.apps "foo" is invalid: spec.template.spec.containers[0].image: Required value

Kubernetes version - v1.16.2, v1.15.6
Deployment apiVersion - apps/v1
kubernetes-incubato/cluster-proportional-vertical-autoscaler version - v0.8.1 (k8s.gcr.io/cpvpa-amd64:v0.8.1)

cpva fails with "unknown target kind: Tap"

What happened:
cpva fails to start when deployment is available under multiple API groups.

How to reproduce it (as minimally and precisely as possible):

Install linkerd. See https://linkerd.io/2/getting-started/
Ensure that there are multiple API groups serving resource deployments:

$ k api-resources
NAME                              SHORTNAMES   APIGROUP                       NAMESPACED   KIND
# ...
daemonsets                        ds           apps                           true         DaemonSet
deployments                       deploy       apps                           true         Deployment
# ...
daemonsets                        ds           tap.linkerd.io                 true         Tap
deployments                       deploy       tap.linkerd.io                 true         Tap

Ensure that cpva fails with

$ k logs cpva -n kube-system
I0217 20:41:29.699612       1 autoscaler.go:46] Scaling namespace: kube-system, target: deployment/calico-typha-deploy
E0217 20:41:30.799782       1 autoscaler.go:49] unknown target kind: Tap

Anything else we need to know?:

Environment:

Kubernetes version (use kubectl version):

$ k version
Client Version: version.Info{Major:"1", Minor:"17", GitVersion:"v1.17.4", GitCommit:"8d8aa39598534325ad77120c120a22b3a990b5ea", GitTreeState:"clean", BuildDate:"2020-03-12T23:41:24Z", GoVersion:"go1.14", Compiler:"gc", Platform:"darwin/amd64"}
Server Version: version.Info{Major:"1", Minor:"17", GitVersion:"v1.17.4", GitCommit:"8d8aa39598534325ad77120c120a22b3a990b5ea", GitTreeState:"clean", BuildDate:"2020-03-12T20:55:23Z", GoVersion:"go1.13.8", Compiler:"gc", Platform:"linux/amd64"}

Image: k8s.gcr.io/cpvpa-amd64:v0.8.1

TODO: merge with CPHPA?

Multiple targets for scale up

Currently as far as I understand the component allows scaling only 1 target per deployment i.e. 1 instance of CPVA can scale 1 deployment/replicaset. Can we allow specifying a list of targets in --targets along with a list of configmap, this will save us resources in launching separate containers for each scaling requirement. I can help in implementing the same, I would like to know if there is any concern or suggestion around this feature.

CHangelog not updated

cluster-proportional-vertical-autoscaler changelog is not updated for current release
last updated release was 0.0.0 and current release is 0.8.3

Use dep or glide instead of godep

Outdated release steps and image location in RELEASE.md

image location and release steps mentioned in https://github.com/kubernetes-sigs/cluster-proportional-vertical-autoscaler/blob/master/RELEASE.md are outdated

Log necessary info on scale event and avoid logging when doing nothing

The autoscaler currently doesn't do a very good job on logging from what we have observed. Specifically it would be helpful to log the current node/cpu count when the scale event happens, so that it would be clear how that decision is made. Also it would be great to not log too frequently when doing nothing to reduce the noise.

cc @lzang

Strategic Merge Patch changes the order of the containers list

Problem

The patch being generated from a map, which are unordered, can cause the order of the containers list to change:

cluster-proportional-vertical-autoscaler/pkg/autoscaler/k8sclient/k8sclient.go

Lines 328 to 335 in 4a01437

 func (k *k8sClient) UpdateResources(resources map[string]apiv1.ResourceRequirements) error { 

 ctrs := []interface{}{} 

 for ctrName, res := range resources { 

 ctrs = append(ctrs, map[string]interface{}{ 

 "name": ctrName, 

 "resources": res, 

 }) 

 }

This creates a perpetual diff, especially noticeable in tools monitoring drift at all time like Argo CD.

Potential solutions

I have thought of 4 potential solutions:

Load the configuration as an ordered map with a library supporting it, maybe challenging to find something parsing JSON directly into it
Add an optional configuration field to specify the order, for example:

"containerA": {
  // ...
  "order": 1
}
"containerB": {
  // ...
  "order": 2
}

Get the order from the deployment. It would not require any additional configuration from the user, but would require get permission on the deployment
Change the configuration from a map to a list (breaking change):

[
  {
    "name": "containerA",
    "requests": {
      "cpu": {
        "base": "10m", "step": "1m", "coresPerStep": 1
      },
      "memory": {
        "base": "8Mi", "step": "1Mi", "coresPerStep": 1
      }
    }
  },
  {
    "name": "containerB",
    "requests": {
      "cpu": {
        "base": "250m", "step": "100m", "coresPerStep": 10
      }
    }
  }
]

Additional information

This is similar to the issue faced here: kubernetes/kubernetes#62830

	func (k *k8sClient) UpdateResources(resources map[string]apiv1.ResourceRequirements) error {
	ctrs := []interface{}{}
	for ctrName, res := range resources {
	ctrs = append(ctrs, map[string]interface{}{
	"name": ctrName,
	"resources": res,
	})
	}

kubernetes-sigs / cluster-proportional-vertical-autoscaler Goto Github PK

cluster-proportional-vertical-autoscaler's Issues

What would you like to be added:

Problem

Potential solutions

Additional information

Recommend Projects

Recommend Topics

Recommend Org

Jobs