SB5.1 - OCI - Ansible variables for Deploy/managed-learn/ml-* jobs #394
Replies: 14 comments 24 replies
-
any update on this |
Beta Was this translation helpful? Give feedback.
-
For Cloud storage configuration in ml-core-service in OCI please refer to the section "# Oracle Cloud Configuration " at the end -https://github.com/project-sunbird/ml-core-service/blob/release-5.1.0/.env.sample For ML Project service errors do check and confirm if Sunbird RC deployment is done as per release notes in 5.1 here - https://ed.sunbird.org/use/updating-sunbird-releases/5.0.0-to-5.1.0#sunbird-rc Also in case it is not done to skip the same during the setup we can set the key PROJECT_CERTIFICATE_ON_OFF to be "OFF". Refer- https://github.com/project-sunbird/ml-projects-service/blob/release-5.1.0/.env.sample |
Beta Was this translation helpful? Give feedback.
-
@Shakthieshwari Can you check for the issue mentioned for ml-analytics service once? |
Beta Was this translation helpful? Give feedback.
-
Thanks for all the response.. I am putting together the issue status so far.
Add the host details in the group [ml-analytics-service] and ml_analytics_version = release-5.1.0_RC14. The job still failed with the below error
Update the bucket details for the dictionary ml_cloud_config The job got successful, but there are errors in the pods logs like below. Its still referring to an azure based url . Where is this configured
I verified that Sunbird RC deployment is done , but still getting the errors |
Beta Was this translation helpful? Give feedback.
-
The logs from ml-core-service
|
Beta Was this translation helpful? Give feedback.
-
This is the value in the configmap
|
Beta Was this translation helpful? Give feedback.
-
@aks30
I tried to add the flag but it didnt help I want to know how does it get certificate issuer kid
i am not passing any variable named certificate_issuer_kid so its taking as "" |
Beta Was this translation helpful? Give feedback.
-
@aks30 ..so it looks like registry service is running..but on checking registry logs.
my question which jobs provisions the database named registry |
Beta Was this translation helpful? Give feedback.
-
i created the database name registry. Now i am not seeing above error..but still the ml-project-service is showing the same error. |
Beta Was this translation helpful? Give feedback.
-
Now the error i am getting is slight different
|
Beta Was this translation helpful? Give feedback.
-
In registry database i can see the below tables
|
Beta Was this translation helpful? Give feedback.
-
@aks30
|
Beta Was this translation helpful? Give feedback.
-
@aks30 @ddevadat - I believe this thread needs to be resolved via a call. @anandvarada - Can you facilitate this discussion please? |
Beta Was this translation helpful? Give feedback.
-
Based on the information provided, it seems that there are some issues with the ml-projects-service deployment. The error message suggests that the connection to the registry service was refused, which may indicate that the service is not running or is not accessible from the pod. To resolve this issue, you can try the following steps: Check if the registry service is running and accessible from the pod. You can do this by running a ping or curl command from the pod to the service endpoint. Check if the correct registry service endpoint URL is provided in the KID issuer configuration. You can verify this by checking the ml_cloud_config dictionary in the playbook for the ml-projects-service deployment. If the above steps do not resolve the issue, check the logs of the registry service and the ml-projects-service pods for any error messages that may provide more information about the root cause of the issue. Once you have identified and resolved the issue, you should be able to successfully deploy the ml-projects-service. Meanwhile I would like to recommend a book on AWS |
Beta Was this translation helpful? Give feedback.
-
We are deploying sunbird on OCI at release 5.1
For the below jobs we need the ansible variables values
Deploy/managed-learn/ml-analytics-service
Deploy/managed-learn/ml-core-service
Deploy/managed-learn/ml-projects-service
When we run this job, what should be the value of ml_analytics_version ?
Deploy/managed-learn/ml-core-service
We need to provide the bucket name for the dictionary ml_cloud_config. what should be the value
Deploy/managed-learn/ml-projects-service
For this job we are seeing the below errors in the pods
Beta Was this translation helpful? Give feedback.
All reactions