Comments (4)
Hey @canonikamal, first of this is indeed a bug in Charmed Kubeflow. The bug is that we've missed the Training Operator roles in our Charm that creates Aggregation ClusterRoles. This is why the default-editor
ends up not having permissions for PyTorch job CRs.
We'll keep this issue open and work on fixing this in the next release.
To unblock you I'd suggest you to apply the upstream ClusterRoles directly
https://github.com/kubeflow/training-operator/blob/v1.5-branch/manifests/overlays/kubeflow/kubeflow-training-roles.yaml
Lastly, regarding the recommended approach: It's always preferred to create ClusterRoles that get aggregated to Kubeflow's edit/view/admin ClusterRoles. The reason is that this way these permissions will get propagated to all user namespaces (since in the user namespace the default-editor
will RoleBinding the kubeflow-edit
ClusterRole in each Profile namespace)
from training-operator.
Thanks for the response and confirmation on the bug @kimwnasptd .
I've shared the workaround with the customer and thanks again for sharing the recommended approach on the aggregated roles.
Best regards
Kamal Bhaskar
from training-operator.
Got closed erroneously while adding last comment, Re-opened as the bug is still being tracked.
from training-operator.
Small clarification, after my updated understanding, that this AggregatedRole from our side will need to live inside the Training Operator repo https://github.com/canonical/training-operator/tree/main/src/templates
The pattern is that these Roles should live alongside the Charm. But for this to happen we'll need to ensure the Charm is not PodSpec but follows the Sidecar pattern.
For the Training Operator though it is a Sidecar Charm so we can just put the template in the correct folder.
from training-operator.
Related Issues (20)
- Update integration test to use model.applications[].refresh()
- upgrade from 1.5 to 1.6 intermittently fails due to 409 conflict during k8s resource creation HOT 4
- upgrade tests are flaky
- bump training-operator version 1.6 -> 1.7 for CKF 1.8 HOT 2
- integration-with-profiles tests failed in CI with "Failed to execute kubectl auth" HOT 2
- Make charm's images configurable in stable branch
- Make charm's images configurable in branch
- Make charm's images configurable in track/<last-version> branch
- Make charm's images configurable in track/<last-version> branch
- Make charm's images configurable in track/1.6 branch HOT 3
- Update `training-operator` manifests HOT 10
- Add logging relation to training-operator charm HOT 2
- Missing on delete cleanup functionality HOT 2
- Update charm for 1.7 release
- training-operator is blocked when deployed as part Kubeflow bundle 1.6/stable HOT 2
- training-operator failed to upgrade 1.6 to 1.7 HOT 20
- Re-factor Pebble Layer update using chisme HOT 1
- Re-view Pebble event handler
- Add upgrade option to tox.ini update-requirements HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from training-operator.