Comments (10)
Reopening to track unpinning per #45577 (comment). Changing milestone so it doesn't appear it's blocking 2.9.0 release.
from rancher.
@Oats87 , yes - as this one is for the flaky tests that were addressed by pinning the version so using this one to track version unpinning. #45577 (comment)
from rancher.
While I am not seeing the flakiness of the cert rotation I see a more general breakage reported around cluster provisioning, it seems.
This is for a PR sitting on release/v2.9
: #45269
In case it matters, a local k3s-based Rancher starts up just fine for that PR.
My latest drone logs are at
The message is generally the same across various failures:
... failed on: prov cluster is not ready: timeout waiting condition: context deadline exceeded
Question: Is each test creating its own cluster ? And removing it later ?
Because I see that there are passing tests too.
I see only 2 tests fail in in each of the 5 provisioning stages, mostly different across the stages.
Failing: Test_Provisioning_Custom_OneNodeWithDelete
, Test_Provisioning_MP_SingleNodeAllRolesWithDelete
, Test_Provisioning_Custom_ThreeNode
, Test_Operation_SetA_Custom_CertificateRotation
, Test_Operation_SetA_MP_CertificateRotation
, Test_Operation_SetB_Custom_EtcdSnapshotOperationsOnNewCombinedNode
, Test_Operation_SetB_MP_EtcdSnapshotOperationsWithThreeEtcdNodesOnNewNode
In the build-pr
failures I see the same failed on ...
message, after the unit tests were run and passed.
from rancher.
Created a PR without material code changes (comment fix).
Seeing failed builds there too, see https://drone-pr.rancher.io/rancher/rancher/39168
However none of the context deadline exceeded
from my branch :(
Now wondering if the addition of the status field, and its handling slowed something down enough to trigger these timeouts.
from rancher.
Yeah, I'm still getting Test_Operation_SetB_MP_EtcdSnapshotOperationsWithThreeEtcdNodesOnNewNode
failing.
For the nature of the change in this PR #45572, this should just not be affected.
from rancher.
Doing investigation into this, I'm seeing that there were issues with operations taking significantly longer after v1.27.11+rke2r1
was released. It was almost a 200 second difference in my benchmark setup.
As such, as a temporary workaround, we can pin the RKE2 version to v1.27.10+rke2r1
for now, which should hopefully unblock CI.
from rancher.
Adding to a milestone and some additional labels to ensure we circle back on this and address version pinning.
from rancher.
no QA required - closing this issue
from rancher.
@snasovich do we want to have two issues that track one PR...?
from rancher.
Moving to v2.9-Next2 as the prerequisite issue (#46034 (comment)) was moved to that milestone.
from rancher.
Related Issues (20)
- [2.9] Add support for docker 27.2.x
- Add GH workflow to update system agent on tag
- Feature Charts: Add Longhorn 1.7.1 Chart in 2.7.x
- Feature Charts: Add Longhorn 1.7.1 Chart in 2.8.x
- Feature Charts: Add Longhorn 1.7.1 Chart in 2.9.x
- [BUG] Failed to remove etcd+controlplane node from RKE1 cluster HOT 2
- High cpu spike and cluster unresponsiveness after enabling ui server-side pagination in Rancher v2.9.1 on k3s cluster
- [RFE] Add the flag delete_on_termination when creation Openstack Node
- What is the Technology Evolution Logic of redhat openshift container platform and rancher?
- Reduce `webhook`'s dependency on RKE1 type changes
- [Backport v2.9] Reduce `webhook`'s dependency on RKE1 type changes
- [Backport v2.8] Reduce `webhook`'s dependency on RKE1 type changes
- [BUG] Upgrade from 2.8.4 to 2.9.1 - Deletion of existing projects gets stuck in "removing" state
- [BUG] Keycloak OIDC configuration forces using obsolete /auth/ path
- [BUG] Argo CD cannot sync GlobalRoles
- Wrangler codegen prevents updating to Go 1.23
- [RFE] Opening/Viewing ClusterIP/NodePort Services directly from the Rancher's UI
- [BUG] Rancher GitHub release assets and Docker Hub container images are missing for releases v2.8.6, v2.8.7 and v2.8.8 HOT 3
- [BUG] Rancher upgrade to v2.9.2 failed HOT 1
- Feature charts: Need to add NeuVector chart 103.0.6+up2.8.0 to 2.8x and NeuVector chart 104.0.2+up2.8.0 to 2.9.x
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from rancher.