-
Notifications
You must be signed in to change notification settings - Fork 35
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: ODH Operator Fails to create KfDef Instance and accociated elements. #111
Comments
Hi @VaishnaviHire - sorry no. The slack channel thread, for the issue I have now resolved was to get the alternative install of Open Data Hub, which I have then used to compare to the original install of Open Data Hub, to see if I could find any reason its failing to create the resources during instance creation of KfDef. That issue still needs to be resolved. Yes I also have a slack channel conversation going for this issue, but so far, its not had any helpful postings, and others reading this issue may wish to know how to solve the same problem. |
Taken from the slack channel discussion of this problem:
Was there any changes in these labels for your openshift-operators namespace? |
Hi @VaishnaviHire
And for the working version:
Which are the same, and almost the same is true for the namespaces
and
I do note on the cluster thats failing I have the following SCC which I am not sure how or by what got created:
Which is the scc thats being picked up by the install of the Operator |
Just to follow up, I attempted to reproduce this issue on two fresh OCP 4.12.35 clusters. The ODH operator install does not explicitly request the
It is safe to assume that the odh-operator can not create the
Judging by the content of |
Hi @LaVLaS - I agree with your assessment, and that the Any suggestions? |
Hello, ST4SD does create a SCC but it has the name Tim, one option would be to manually assign the |
I have managed to now install ODH - I reduced the priority to |
Judging by the ServiceAccount that uses the ibo SCC (from the definiton of the SCC : |
Thanks @VassilisVassiliadis - but there is no |
ODH Component
ODH Operator
Current Behavior
odh
.odh
and enter Installed Open Data Hub Operator's console.Conditions: Degraded, Available
and none of the other components are created.Expected Behavior
As above, but for step 4 - the additional elements get created, i.e. pods, deployment resources etc.
Steps To Reproduce
See notes above.
Workaround (if any)
None yet.
What browsers are you seeing the problem on? (If applicable)
Firefox
Open Data Hub Version
Anything else
I note the following issues in the logs for the Operator Pod..
2023-09-22T12:07:11.171Z INFO controllers.KfDef Reconciling KfDef resources {"Request.Namespace": "odh", "Request.Name": "opendatahub"}
2023-09-22T12:07:11.171Z INFO controllers.KfDef Creating a new KubeFlow Deployment {"KubeFlow.Namespace": "odh"}
2023-09-22T12:07:11.172Z ERROR controllers.KfDef Failed to create the app directory {"error": "mkdir /tmp/odh: read-only file system"}
github.com/opendatahub-io/opendatahub-operator/controllers/kfdef%2eapps%2ekubeflow%2eorg.(*KfDefReconciler).Reconcile
/workspace/controllers/kfdef.apps.kubeflow.org/kfdef_controller.go:236
sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).Reconcile
/opt/app-root/src/go/pkg/mod/sigs.k8s.io/controller-runtime@v0.10.0/pkg/internal/controller/controller.go:114
sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).reconcileHandler
/opt/app-root/src/go/pkg/mod/sigs.k8s.io/controller-runtime@v0.10.0/pkg/internal/controller/controller.go:311
sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).processNextWorkItem
/opt/app-root/src/go/pkg/mod/sigs.k8s.io/controller-runtime@v0.10.0/pkg/internal/controller/controller.go:266
sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).Start.func2.2
/opt/app-root/src/go/pkg/mod/sigs.k8s.io/controller-runtime@v0.10.0/pkg/internal/controller/controller.go:227
2023-09-22T12:07:11.172Z ERROR controllers.KfDef failed to load KfApp {"error": "mkdir /tmp/odh: read-only file system"}
Which I note is the same / similar to issue opendatahub-operator#259
I also note that the Operator has been created with what looks to be the wrong scc values, (taken from the yaml file installed).
On a different install of ODH I have values:
openshift.io/scc: anyuid
but on my install I have value:
openshift.io/scc: ibo.std-scc
I also have values:
fsGroup: 1000
runAsUser: 101
readOnlyRootFilesystem: true
Which look to be taken from a different Operator also installed in the OpenShift-Operator namespace.
Coould / should this operator be effecting the ODH operator?
The text was updated successfully, but these errors were encountered: