Comments (9)
Kured should ignore this error and reboot the Node anyway
I'm not sure this is the correct default - presumably one has configured a disruption budget for a reason 😄 But certainly we can have it as an option.
I'd rather avoid parsing the output of kubectl drain
, so this will have to wait until after we migrate away from embedding kubectl
to using the library version of drain.
from kured.
Same problem for a standard cluster without any service mesh. There are too cases : the pod ("managed" by the PDB) can't be scheluded on another node, so the situation is blocked in this state, or the pod is rescheduled by Kubernetes on another node, but it takes time so Kured timeouts.
from kured.
Faced a similar issue but for me the pod could have easily been rescheduled by k8s on another node. The PDB is quite simple and only states that min 1 pod is running.
@awh I think that it is the correct default. AFAIU when deleting a pod that has a PDB such as mine, k8s will simply make sure that a second pod is running before killing the old one. Maybe I'm not getting PDBs here but my understanding was, that k8s would take care of fulfilling the PDB.
In the end I simply killed the pod by hand and sure enough a second one was spun up before the first one shut down.
from kured.
One more thing: if this isn't changed I would expect kured to at least not set the node to SchedulingDisabled
. ATM the node is basically drained of pods except for the ones with PDBs.
from kured.
One more thing: if this isn't changed I would expect kured to at least not set the node to
SchedulingDisabled
. ATM the node is basically drained of pods except for the ones with PDBs.
Shouldn't drain first step be "cordon"ing the node (which effectively makes it 'SchedulingDisabled')? I thought that was its internal behaviour.
from kured.
An other issue related to PDB, when this happended, CPU usage is up to 100% and it fills logs with hundred of error per seconds.
from kured.
This issue was automatically considered stale due to lack of activity. Please update it and/or join our slack channels to promote it, before it automatically closes (in 7 days).
from kured.
Should this issue be closed?
from kured.
from kured.
Related Issues (20)
- Usage of host networking and ports should be restricted on stage and perf clusters on kubernetes clusters
- Containers sharing sensitive host namespaces should be avoided HOT 4
- Kured Helm hostNetwork problem HOT 2
- stderr/stdout of custom reboot command HOT 1
- Team Kured is looking for help! HOT 1
- [Feature request] Use cloud APIs to increase node pool size before reboots and decrease after HOT 6
- [Bug] The new dockerhub yaml is broken, ServiceAccount is declared twice HOT 2
- Add Support for Kubernetes 1.28.0
- Unable to receive teams notification. We have the http_proxy as environment variable in kured pod HOT 4
- HashiCorp license change to BSL HOT 2
- Lock TTL not being honored HOT 5
- Build fails on aarch64
- Kured pods crash looping on clusters running Cilium Network plugin HOT 2
- Kured pods are not deployed to all nodes having taints on it in AKS ckuster HOT 3
- KURED supportability of the 6.2 kernel version HOT 2
- kured cordon node before checking if just 1 up HOT 8
- [DOC] Make Control Plane use kured
- AKS Node not rebooted with lock held for not existing node HOT 11
- If a sentinel command is configured, it runs every minute. HOT 8
- path-based reboot mechanism HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from kured.